Publication:
On the Effect of K Values and Distance Metrics in KNN Algorithm for Android Malware Detection

dc.authorwosidKiliç, Erdal/Hjy-2853-2023
dc.authorwosidSahin, Durmus/Aaj-7961-2020
dc.authorwosidAkleylek, Sedat/D-2090-2015
dc.contributor.authorSahin, Durmus Ozkan
dc.contributor.authorAkleylek, Sedat
dc.contributor.authorKilic, Erdal
dc.contributor.authorIDKiliç, Erdal/0000-0003-1585-0991
dc.date.accessioned2025-12-11T01:04:21Z
dc.date.issued2021
dc.departmentOndokuz Mayıs Üniversitesien_US
dc.department-temp[Sahin, Durmus Ozkan; Akleylek, Sedat; Kilic, Erdal] Ondokuz Mays Univ, Dept Comp Engn, Atakum, Samsun, Turkeyen_US
dc.descriptionKiliç, Erdal/0000-0003-1585-0991;en_US
dc.description.abstractThere is a remarkable increase in mobile device usage in recent years. The Android operating system is by far the most preferred open-source mobile operating system around the world. Besides, the Android operating system is preferred in many devices on the Internet of Things (IoT) devices are used in many areas of daily life. Smart cities, smart environment, health, home automation, agriculture, and livestock are some of the usage areas. Health is one of the most frequently used areas. Since the Android operating system is both the widely used operating system and open-source, the vast majority of malware released on the market is now designed for Android platforms. Therefore, devices using the Android operating system are under serious threat. In this study, a system that detects malware on Android operating systems based on machine learning is proposed. Besides, feature vectors are created with permissions that have an important place in the security of the Android operating system. Feature vectors created using the k-nearest neighbor algorithm (KNN), one of the machine learning techniques, are given as input to this algorithm, and a classification of malicious software and benign software is provided. In the KNN algorithm, the k value and the distance metric used to find the closest sample directly affect the classification performance. In addition, the study examining the parameters of the KNN algorithm in detail in permission-based studies is limited. For this reason, the performance of the malware detection system is presented comparatively using five different k values and five different distance metrics under different data sets. When the results are examined, it is observed that higher classification performances are obtained when values such as 1, 3 are given to k and metrics such as Euclidean and Minkowski are chosen instead of the Chebyshev distance metric.en_US
dc.description.woscitationindexEmerging Sources Citation Index
dc.identifier.doi10.1142/S2424922X21410011
dc.identifier.issn2424-922X
dc.identifier.issn2424-9238
dc.identifier.urihttps://doi.org/10.1142/S2424922X21410011
dc.identifier.urihttps://hdl.handle.net/20.500.12712/41109
dc.identifier.volume13en_US
dc.identifier.wosWOS:000772405800007
dc.language.isoenen_US
dc.publisherWorld Scientific Publishing Co Pte Ltden_US
dc.relation.ispartofAdvances in Data Science and Adaptive Analysisen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectMachine Learningen_US
dc.subjectK-Nearest Neighbor Algorithmen_US
dc.subjectAndroid Malwareen_US
dc.subjectAndroid IoT Devicesen_US
dc.subjectHealth Device Securityen_US
dc.subjectMalware Detectionen_US
dc.subjectStatic Analysisen_US
dc.titleOn the Effect of K Values and Distance Metrics in KNN Algorithm for Android Malware Detectionen_US
dc.typeArticleen_US
dspace.entity.typePublication

Files