A topic based document relevance ranking model


Autoria(s): Gao, Yang; Xu, Yue; Li, Yuefeng
Data(s)

07/04/2014

Resumo

Topic modelling has been widely used in the fields of information retrieval, text mining, machine learning, etc. In this paper, we propose a novel model, Pattern Enhanced Topic Model (PETM), which makes improvements to topic modelling by semantically representing topics with discriminative patterns, and also makes innovative contributions to information filtering by utilising the proposed PETM to determine document relevance based on topics distribution and maximum matched patterns proposed in this paper. Extensive experiments are conducted to evaluate the effectiveness of PETM by using the TREC data collection Reuters Corpus Volume 1. The results show that the proposed model significantly outperforms both state-of-the-art term-based models and pattern-based models.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/67263/

Relação

http://eprints.qut.edu.au/67263/1/A_Topic_based_Document_Relevance_Ranking_Model.pdf

Gao, Yang, Xu, Yue, & Li, Yuefeng (2014) A topic based document relevance ranking model. In 23rd International World Wide Web Conference, 7-11 April 2014, Seoul, Korea.

Direitos

Copyright 2014 Please consult the authors

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080000 INFORMATION AND COMPUTING SCIENCES #080505 Web Technologies (excl. Web Search) #Relevance Ranking #Topic Modelling #Pattern Mining
Tipo

Conference Paper