Publication: Feature Selection in Text Classification
| dc.authorscopusid | 56589621700 | |
| dc.authorscopusid | 56779819400 | |
| dc.authorscopusid | 22953804000 | |
| dc.contributor.author | Şahin, D.O. | |
| dc.contributor.author | Ates, N. | |
| dc.contributor.author | Kilic, E. | |
| dc.date.accessioned | 2025-12-10T23:16:55Z | |
| dc.date.issued | 2016 | |
| dc.department | Ondokuz Mayıs Üniversitesi | en_US |
| dc.department-temp | [Şahin] Durmuş Ozkan, Bilgisayar Mühendisliǧi Bölümü, Ondokuz Mayis Üniversitesi, Samsun, Turkey; [Ates] Nurullah, Bilgisayar Mühendisliǧi Bölümü, Ondokuz Mayis Üniversitesi, Samsun, Turkey; [Kilic] Erdal, Bilgisayar Mühendisliǧi Bölümü, Ondokuz Mayis Üniversitesi, Samsun, Turkey | en_US |
| dc.description.abstract | In recent years, text classification have been widely used. Dimension of text data has increased more and more. Working of almost all classification algorithms is directly related to dimension. In high dimension data set, working of classification algorithms both takes time and occurs over fitting problem. So feature selection is crucial for machine learning techniques. In this study, frequently used feature selection metrics Chi Square (CHI), Information Gain (IG) and Odds Ratio (OR) have been applied. At the same time the method Relevancy Frequency (RF) proposed as term weighting method has been used as feature selection method in this study. It is used for tf.idf term as weighting method, Sequential Minimal Optimization (SMO) and Naive Bayes (NB) in the classification algorithm. Experimental results show that RF gives successful results. © 2016 IEEE. | en_US |
| dc.identifier.doi | 10.1109/SIU.2016.7496105 | |
| dc.identifier.endpage | 1780 | en_US |
| dc.identifier.isbn | 9781509016792 | |
| dc.identifier.scopus | 2-s2.0-84982792161 | |
| dc.identifier.scopusquality | N/A | |
| dc.identifier.startpage | 1777 | en_US |
| dc.identifier.uri | https://doi.org/10.1109/SIU.2016.7496105 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12712/35324 | |
| dc.identifier.wosquality | N/A | |
| dc.language.iso | tr | en_US |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
| dc.relation.ispartof | -- 24th Signal Processing and Communication Application Conference, SIU 2016 -- 2016-05-16 Through 2016-05-19 -- Zonguldak -- 122605 | en_US |
| dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
| dc.rights | info:eu-repo/semantics/closedAccess | en_US |
| dc.subject | Feature Selection | en_US |
| dc.subject | Term Weighting | en_US |
| dc.subject | Text Classification | en_US |
| dc.title | Feature Selection in Text Classification | en_US |
| dc.title.alternative | Metin Sınıflandırmada Öznitelik Seçimi | en_US |
| dc.type | Conference Object | en_US |
| dspace.entity.type | Publication |
