Mining specific features for acquiring user information needs


Autoria(s): Abdulmohsen, Algarni; Li, Yuefeng
Data(s)

2013

Resumo

Term-based approaches can extract many features in text documents, but most include noise. Many popular text-mining strategies have been adapted to reduce noisy information from extracted features; however, text-mining techniques suffer from low frequency. The key issue is how to discover relevance features in text documents to fulfil user information needs. To address this issue, we propose a new method to extract specific features from user relevance feedback. The proposed approach includes two stages. The first stage extracts topics (or patterns) from text documents to focus on interesting topics. In the second stage, topics are deployed to lower level terms to address the low-frequency problem and find specific terms. The specific terms are determined based on their appearances in relevance feedback and their distribution in topics or high-level patterns. We test our proposed method with extensive experiments in the Reuters Corpus Volume 1 dataset and TREC topics. Results show that our proposed approach significantly outperforms the state-of-the-art models.

Identificador

http://eprints.qut.edu.au/68727/

Publicador

Springer Berlin Heidelberg

Relação

DOI:10.1007/978-3-642-37453-1_44

Abdulmohsen, Algarni & Li, Yuefeng (2013) Mining specific features for acquiring user information needs. Lecture Notes in Computer Science : Advances in Knowledge Discovery and Data Mining, 7818, pp. 532-543.

http://purl.org/au-research/grants/ARC/DP0988007

Direitos

Springer-Verlag Berlin Heidelberg

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #Feature extraction #Pattern mining #Relevance feedback #Text classification
Tipo

Journal Article