Publication: Development of Various Stacking Ensemble-Based HIDs Using ADFA Datasets
| dc.authorscopusid | 57212210447 | |
| dc.authorscopusid | 15833929800 | |
| dc.authorscopusid | 59661691200 | |
| dc.authorwosid | Akleylek, Sedat/D-2090-2015 | |
| dc.contributor.author | Satilmis, Hami | |
| dc.contributor.author | Akleylek, Sedat | |
| dc.contributor.author | Tok, Zaliha Yuce | |
| dc.contributor.authorID | Akleylek, Sedat/0000-0001-7005-6489 | |
| dc.contributor.authorID | Satılmış, Hami/0000-0002-6611-7549 | |
| dc.date.accessioned | 2025-12-11T01:21:16Z | |
| dc.date.issued | 2025 | |
| dc.department | Ondokuz Mayıs Üniversitesi | en_US |
| dc.department-temp | [Satilmis, Hami] Ondokuz Mayis Univ, Dept Comp Engn, TR-55200 Samsun, Turkiye; [Akleylek, Sedat] ?Istinye Univ, Dept Comp Engn, TR-34010 Istanbul, Turkiye; [Akleylek, Sedat] Univ Tartu, Chair Secur & Theoret Comp Sci, EE-50090 Tartu, Estonia; [Tok, Zaliha Yuce] ASELSAN, TR-06200 Ankara, Turkiye | en_US |
| dc.description | Akleylek, Sedat/0000-0001-7005-6489; Satılmış, Hami/0000-0002-6611-7549 | en_US |
| dc.description.abstract | The rapid increase in the number of cyber attacks and the emergence of various attack variations pose significant threats to the security of computer systems and networks. Various intrusion detection systems (IDS) are developed to defend computer systems and networks in response to these threats. One type of IDS, known as a host-based intrusion detection system (HIDS), focuses on securing a single host. Numerous HIDS have been proposed in the literature, incorporating various detection methods. This study develops multiple machine learning (ML) models and stacking ensemble based HIDS that can be used as detection methods in HIDS. Initially, n-grams, standard bag-of-words (BoW), binary BoW, probability BoW, and term frequency-inverse document frequency (TF-IDF) BoW methods are applied to the ADFA-LD and ADFA-WD datasets. Mutual information and k-means methods are used together for feature selection on the resulting BoW datasets. Individual models are created using either selected features or all features. Subsequently, the outputs of these individual models are used in extreme gradient boosting (XGBoost) and adaptive boosting (AdaBoost) models to develop stacking ensemble based models. The experimental results show that the best accuracy (ACC) among models using ADFA-LD based BoW datasets is achieved by the stacking ensemble based XGBoost model, which has an ACC of 0.9747. This XGBoost model utilizes the standard BoW dataset and selected features. Among models using ADFA-WD based BoW datasets, the stacking ensemble based XGBoost is also the most successful in terms of ACC, with an ACC of 0.9163, using the standard BoW dataset and all features. | en_US |
| dc.description.sponsorship | TUBITAK 1515 Frontier RD Laboratories Support Program | en_US |
| dc.description.sponsorship | The work of Zaliha Yuce Tok was supported by TUBITAK 1515 Frontier RD Laboratories Support Program. | en_US |
| dc.description.woscitationindex | Emerging Sources Citation Index | |
| dc.identifier.doi | 10.1109/OJCOMS.2025.3538101 | |
| dc.identifier.endpage | 1189 | en_US |
| dc.identifier.issn | 2644-125X | |
| dc.identifier.scopus | 2-s2.0-85217487776 | |
| dc.identifier.scopusquality | Q1 | |
| dc.identifier.startpage | 1170 | en_US |
| dc.identifier.uri | https://doi.org/10.1109/OJCOMS.2025.3538101 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12712/43142 | |
| dc.identifier.volume | 6 | en_US |
| dc.identifier.wos | WOS:001425520800001 | |
| dc.language.iso | en | en_US |
| dc.publisher | IEEE-Inst Electrical Electronics Engineers Inc | en_US |
| dc.relation.ispartof | IEEE Open Journal of the Communications Society | en_US |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | Feature Extraction | en_US |
| dc.subject | Long Short Term Memory | en_US |
| dc.subject | Detectors | en_US |
| dc.subject | Stacking | en_US |
| dc.subject | Computer Crime | en_US |
| dc.subject | Support Vector Machines | en_US |
| dc.subject | Computer Security | en_US |
| dc.subject | Adaptation Models | en_US |
| dc.subject | Standards | en_US |
| dc.subject | Intrusion Detection | en_US |
| dc.subject | Intrusion Detection System | en_US |
| dc.subject | Host-Based Intrusion Detection System | en_US |
| dc.subject | Information Security | en_US |
| dc.subject | Machine Learning | en_US |
| dc.title | Development of Various Stacking Ensemble-Based HIDs Using ADFA Datasets | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication |
