Publication:
Development of Various Stacking Ensemble-Based HIDs Using ADFA Datasets

dc.authorscopusid57212210447
dc.authorscopusid15833929800
dc.authorscopusid59661691200
dc.authorwosidAkleylek, Sedat/D-2090-2015
dc.contributor.authorSatilmis, Hami
dc.contributor.authorAkleylek, Sedat
dc.contributor.authorTok, Zaliha Yuce
dc.contributor.authorIDAkleylek, Sedat/0000-0001-7005-6489
dc.contributor.authorIDSatılmış, Hami/0000-0002-6611-7549
dc.date.accessioned2025-12-11T01:21:16Z
dc.date.issued2025
dc.departmentOndokuz Mayıs Üniversitesien_US
dc.department-temp[Satilmis, Hami] Ondokuz Mayis Univ, Dept Comp Engn, TR-55200 Samsun, Turkiye; [Akleylek, Sedat] ?Istinye Univ, Dept Comp Engn, TR-34010 Istanbul, Turkiye; [Akleylek, Sedat] Univ Tartu, Chair Secur & Theoret Comp Sci, EE-50090 Tartu, Estonia; [Tok, Zaliha Yuce] ASELSAN, TR-06200 Ankara, Turkiyeen_US
dc.descriptionAkleylek, Sedat/0000-0001-7005-6489; Satılmış, Hami/0000-0002-6611-7549en_US
dc.description.abstractThe rapid increase in the number of cyber attacks and the emergence of various attack variations pose significant threats to the security of computer systems and networks. Various intrusion detection systems (IDS) are developed to defend computer systems and networks in response to these threats. One type of IDS, known as a host-based intrusion detection system (HIDS), focuses on securing a single host. Numerous HIDS have been proposed in the literature, incorporating various detection methods. This study develops multiple machine learning (ML) models and stacking ensemble based HIDS that can be used as detection methods in HIDS. Initially, n-grams, standard bag-of-words (BoW), binary BoW, probability BoW, and term frequency-inverse document frequency (TF-IDF) BoW methods are applied to the ADFA-LD and ADFA-WD datasets. Mutual information and k-means methods are used together for feature selection on the resulting BoW datasets. Individual models are created using either selected features or all features. Subsequently, the outputs of these individual models are used in extreme gradient boosting (XGBoost) and adaptive boosting (AdaBoost) models to develop stacking ensemble based models. The experimental results show that the best accuracy (ACC) among models using ADFA-LD based BoW datasets is achieved by the stacking ensemble based XGBoost model, which has an ACC of 0.9747. This XGBoost model utilizes the standard BoW dataset and selected features. Among models using ADFA-WD based BoW datasets, the stacking ensemble based XGBoost is also the most successful in terms of ACC, with an ACC of 0.9163, using the standard BoW dataset and all features.en_US
dc.description.sponsorshipTUBITAK 1515 Frontier RD Laboratories Support Programen_US
dc.description.sponsorshipThe work of Zaliha Yuce Tok was supported by TUBITAK 1515 Frontier RD Laboratories Support Program.en_US
dc.description.woscitationindexEmerging Sources Citation Index
dc.identifier.doi10.1109/OJCOMS.2025.3538101
dc.identifier.endpage1189en_US
dc.identifier.issn2644-125X
dc.identifier.scopus2-s2.0-85217487776
dc.identifier.scopusqualityQ1
dc.identifier.startpage1170en_US
dc.identifier.urihttps://doi.org/10.1109/OJCOMS.2025.3538101
dc.identifier.urihttps://hdl.handle.net/20.500.12712/43142
dc.identifier.volume6en_US
dc.identifier.wosWOS:001425520800001
dc.language.isoenen_US
dc.publisherIEEE-Inst Electrical Electronics Engineers Incen_US
dc.relation.ispartofIEEE Open Journal of the Communications Societyen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectFeature Extractionen_US
dc.subjectLong Short Term Memoryen_US
dc.subjectDetectorsen_US
dc.subjectStackingen_US
dc.subjectComputer Crimeen_US
dc.subjectSupport Vector Machinesen_US
dc.subjectComputer Securityen_US
dc.subjectAdaptation Modelsen_US
dc.subjectStandardsen_US
dc.subjectIntrusion Detectionen_US
dc.subjectIntrusion Detection Systemen_US
dc.subjectHost-Based Intrusion Detection Systemen_US
dc.subjectInformation Securityen_US
dc.subjectMachine Learningen_US
dc.titleDevelopment of Various Stacking Ensemble-Based HIDs Using ADFA Datasetsen_US
dc.typeArticleen_US
dspace.entity.typePublication

Files