Rough set based approach to text classification
Contribuinte(s) |
Raghaven, Vijay Hu, Xiaolin Liau, Churn-Jung Treur, Jan |
---|---|
Data(s) |
2013
|
Resumo |
Textual document set has become an important and rapidly growing information source in the web. Text classification is one of the crucial technologies for information organisation and management. Text classification has become more and more important and attracted wide attention of researchers from different research fields. In this paper, many feature selection methods, the implement algorithms and applications of text classification are introduced firstly. However, because there are much noise in the knowledge extracted by current data-mining techniques for text classification, it leads to much uncertainty in the process of text classification which is produced from both the knowledge extraction and knowledge usage, therefore, more innovative techniques and methods are needed to improve the performance of text classification. It has been a critical step with great challenge to further improve the process of knowledge extraction and effectively utilization of the extracted knowledge. Rough Set decision making approach is proposed to use Rough Set decision techniques to more precisely classify the textual documents which are difficult to separate by the classic text classification methods. The purpose of this paper is to give an overview of existing text classification technologies, to demonstrate the Rough Set concepts and the decision making approach based on Rough Set theory for building more reliable and effective text classification framework with higher precision, to set up an innovative evaluation metric named CEI which is very effective for the performance assessment of the similar research, and to propose a promising research direction for addressing the challenging problems in text classification, text mining and other relative fields. |
Identificador | |
Publicador |
IEEE |
Relação |
DOI:10.1109/WI-IAT.2013.190 Zhang, Libiao, Li, Yuefeng, Sun, Chao, & Nadee, Wanvimol (2013) Rough set based approach to text classification. In Raghaven, Vijay, Hu, Xiaolin, Liau, Churn-Jung, & Treur, Jan (Eds.) Proceedings of the 2013 IEEE/WIC/ACM International Conferences on Web Intelligence (WI) and Intelligent Agent Technology (IAT), IEEE, Atlanta, GA, pp. 245-252. |
Direitos |
Copyright 2013 by The Institute of Electrical and Electronics Engineers, Inc. |
Fonte |
School of Electrical Engineering & Computer Science; Science & Engineering Faculty |
Palavras-Chave | #Machine learning #Text classification #Feature selection #Rough set #Decision making |
Tipo |
Conference Paper |