Publication: Performance Analysis of Rule Based Automatic SNN Algorithm on Big Data Sets
Loading...
Date
Authors
Journal Title
Journal ISSN
Volume Title
Abstract
Clustering is defined as the classification of patterns into groups (clusters) without supervision. The clustering of similarities of data is a complex process that can not be done with human hands. There are various clustering algorithms based on different principles in the literature. The SNN (Shared Nearest Neighborhood) algorithm is a density-based clustering algorithm that identifies similarities between the data by looking at the shared nearest neighbors by two data. The SNN algorithm uses parameters specifying the radius (Eps) that a user enters when clustering, a radius that limits a neighborhood of a point, and the minimum number of points (minPorts) that must be in an eps-neighborhood. This leads to clustering performans has dependency of user experience. A rule-based automatic SNN algorithm has been proposed to remove this dependency from the user. In this study, the performance of the rule-based automatic SNN algorithm over the data sets with 2000 and over sample numbers is examined and presented. © 2018 IEEE.
Description
Aselsan; et al.; Huawei; IEEE Signal Processing Society; IEEE Turkey Section; Netas
Citation
WoS Q
N/A
Scopus Q
N/A
Source
-- 26th IEEE Signal Processing and Communications Applications Conference, SIU 2018 -- 2018-05-02 through 2018-05-05 -- Izmir -- 137780
Volume
Issue
Start Page
1
End Page
4
