Publication:
Two New Feature Selection Metrics for Text Classification

dc.authorscopusid56589621700
dc.authorscopusid22953804000
dc.contributor.authorŞahin, D.Ö.
dc.contributor.authorKilic, E.
dc.date.accessioned2020-06-21T13:05:11Z
dc.date.available2020-06-21T13:05:11Z
dc.date.issued2019
dc.departmentOndokuz Mayıs Üniversitesien_US
dc.department-temp[Şahin] Durmuş Ozkan, Department of Computer Engineering, Ondokuz Mayis Üniversitesi, Samsun, Turkey; [Kilic] Erdal, Department of Computer Engineering, Ondokuz Mayis Üniversitesi, Samsun, Turkeyen_US
dc.description.abstractObtaining meaningful information from data has become the main problem. Hence data mining techniques have gained importance. Text classification is one of the most commonly studied areas of data mining. The main problem about text classification is the increase in the required time and a decrease in the success of classification because of data size. To determine the right feature selection methods for text classification is the main purpose of this study. Metrics that are used frequently for feature selection like Chi-square and Information Gain were applied over different data sets and performance was measured. In this study two feature selection metrics, which are based on filtration, are recommended as alternatives to the current ones. The first recommended metric is Relevance Frequency Feature Selection metric that was obtained by adding new parameters to Relevance Frequency method that is used for term weighting in text classification. The second one is the alternative Accuracy2 metric, which was obtained by changing the parameters of Accuracy2 metric. It was observed that the suggested Relevance Frequency Feature Selection and Alternative Accuracy2 metrics offer successful results as the current metrics used frequently. © 2019, © 2019 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.en_US
dc.identifier.doi10.1080/00051144.2019.1602293
dc.identifier.endpage171en_US
dc.identifier.issn0005-1144
dc.identifier.issue2en_US
dc.identifier.scopus2-s2.0-85064462639
dc.identifier.scopusqualityQ2
dc.identifier.startpage162en_US
dc.identifier.urihttps://doi.org/10.1080/00051144.2019.1602293
dc.identifier.volume60en_US
dc.identifier.wosWOS:000475484900004
dc.identifier.wosqualityQ3
dc.language.isoenen_US
dc.publisherTaylor and Francis Ltd. michael.wagreich@univie.ac.aten_US
dc.relation.ispartofAutomatikaen_US
dc.relation.journalAutomatikaen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectFeature Selectionen_US
dc.subjectTerm Selectionen_US
dc.subjectText Classificationen_US
dc.subjectText Miningen_US
dc.titleTwo New Feature Selection Metrics for Text Classificationen_US
dc.typeArticleen_US
dspace.entity.typePublication

Files