Back to Search
Journal ArticleOpen Access

Relevant‐Based Feature Ranking (RBFR) Method for Text Classification Based on Machine Learning Algorithm

Author Affiliations
Siddhartha Medical College, Vellore Institute of Technology University, Graphic Era University, KPR Institute of Engineering and Technology, ...
Published InJournal of Nanomaterials
Year2022
Citations18

Abstract

High dimensionality of the feature space is one of the problems in the field of text classification. Identification of optimal subset of features can optimize text classification process in terms of processing time and performance. In this paper, we propose a novel Relevant‐Based Feature Ranking (RBFR) algorithm which identifies and selects smaller subsets of more relevant features in the feature space. We compared the performance of the RBFR against other existing feature selection methods such as balanced accuracy measure, information gain, Gini index, and odds ratio on 3 datasets, namely, 20 newsgroup, Reuters, and WAP datasets. We have used 5 machine learning models (SVM, NB, kNN, RF, and LR) to test and evaluate the proposed feature selection method. We found…
View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.