Concept learning of text documents


Autoria(s): An, Jiyuan; Chen, Yi-Ping Phoebe
Contribuinte(s)

Zhong, Ning

Tirri, Henry

Yao, Yiyu

Zhou, Lizhu

Liu, Jiming

Cercone, Nick

Data(s)

01/01/2004

Resumo

Concept learning of text documents can be viewed as the problem of acquiring the definition of a general category of documents. To definite the category of a text document, the Conjunctive of keywords is usually be used. These keywords should be fewer and comprehensible. A naïve method is enumerating all combinations of keywords to extract suitable ones. However, because of the enormous number of keyword combinations, it is impossible to extract the most relevant keywords to describe the categories of documents by enumerating all possible combinations of keywords. Many heuristic methods are proposed, such as GA-base, immune based algorithm. In this work, we introduce pruning power technique and propose a robust enumeration-based concept learning algorithm. Experimental results show that the rules produce by our approach has more comprehensible and simplicity than by other methods. <br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30005267

Idioma(s)

eng

Publicador

IEEE Xplore

Relação

http://dro.deakin.edu.au/eserv/DU:30005267/chen-conceptlearningoftext-2004.pdf

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1410900

Direitos

2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Tipo

Conference Paper