Pattern-based topic models for information filtering
Contribuinte(s) |
Cambria, Erik Chen, Ping |
---|---|
Data(s) |
07/12/2013
|
Resumo |
Topic modelling, such as Latent Dirichlet Allocation (LDA), was proposed to generate statistical models to represent multiple topics in a collection of documents, which has been widely utilized in the fields of machine learning and information retrieval, etc. But its effectiveness in information filtering is rarely known. Patterns are always thought to be more representative than single terms for representing documents. In this paper, a novel information filtering model, Pattern-based Topic Model(PBTM) , is proposed to represent the text documents not only using the topic distributions at general level but also using semantic pattern representations at detailed specific level, both of which contribute to the accurate document representation and document relevance ranking. Extensive experiments are conducted to evaluate the effectiveness of PBTM by using the TREC data collection Reuters Corpus Volume 1. The results show that the proposed model achieves outstanding performance. |
Formato |
application/pdf |
Identificador | |
Relação |
http://eprints.qut.edu.au/66686/1/Pattern-based_Topic_Models_for_Information_Filtering_.pdf Gao, Yang, Xu, Yue, & Li, Yuefeng (2013) Pattern-based topic models for information filtering. In Cambria, Erik & Chen, Ping (Eds.) Sentiment Elicitation from Natural Text for Information Retrieval and Extraction (SENTIRE), 7 December 2013, Dallas, Texas. |
Direitos |
Copyright 2013 Please consult the authors |
Fonte |
School of Electrical Engineering & Computer Science; Faculty of Science and Technology; Science & Engineering Faculty |
Palavras-Chave | #080600 INFORMATION SYSTEMS #Topic modeling #Pattern mining #Information filtering #User interest |
Tipo |
Conference Paper |