Mining specific features for acquiring user information needs
Data(s) |
2013
|
---|---|
Resumo |
Term-based approaches can extract many features in text documents, but most include noise. Many popular text-mining strategies have been adapted to reduce noisy information from extracted features; however, text-mining techniques suffer from low frequency. The key issue is how to discover relevance features in text documents to fulfil user information needs. To address this issue, we propose a new method to extract specific features from user relevance feedback. The proposed approach includes two stages. The first stage extracts topics (or patterns) from text documents to focus on interesting topics. In the second stage, topics are deployed to lower level terms to address the low-frequency problem and find specific terms. The specific terms are determined based on their appearances in relevance feedback and their distribution in topics or high-level patterns. We test our proposed method with extensive experiments in the Reuters Corpus Volume 1 dataset and TREC topics. Results show that our proposed approach significantly outperforms the state-of-the-art models. |
Identificador | |
Publicador |
Springer Berlin Heidelberg |
Relação |
DOI:10.1007/978-3-642-37453-1_44 Abdulmohsen, Algarni & Li, Yuefeng (2013) Mining specific features for acquiring user information needs. Lecture Notes in Computer Science : Advances in Knowledge Discovery and Data Mining, 7818, pp. 532-543. http://purl.org/au-research/grants/ARC/DP0988007 |
Direitos |
Springer-Verlag Berlin Heidelberg |
Fonte |
School of Electrical Engineering & Computer Science; Science & Engineering Faculty |
Palavras-Chave | #Feature extraction #Pattern mining #Relevance feedback #Text classification |
Tipo |
Journal Article |