Mining topical relevant patterns for multi-document summarization


Autoria(s): Wu, Yutong; Gao, Yang; Li, Yuefeng; Xu, Yue; Chen, Meihua
Data(s)

2015

Resumo

Multi-document summarization addressing the problem of information overload has been widely utilized in the various real-world applications. Most of existing approaches adopt term-based representation for documents which limit the performance of multi-document summarization systems. In this paper, we proposed a novel pattern-based topic model (PBTMSum) for the task of the multi-document summarization. PBTMSum combining pattern mining techniques with LDA topic modelling could generate discriminative and semantic rich representations for topics and documents so that the most representative and non-redundant sentences can be selected to form a succinct and informative summary. Extensive experiments are conducted on the data of document understanding conference (DUC) 2007. The results prove the effectiveness and efficiency of our proposed approach.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/94109/

Publicador

IEEE

Relação

http://eprints.qut.edu.au/94109/1/PID3878471%281%29.pdf

DOI:10.1109/WI-IAT.2015.136

Wu, Yutong, Gao, Yang, Li, Yuefeng, Xu, Yue, & Chen, Meihua (2015) Mining topical relevant patterns for multi-document summarization. In 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), IEEE, Singapore, pp. 114-117.

Direitos

Copyright 2015 IEEE

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080107 Natural Language Processing #080109 Pattern Recognition and Data Mining #multi-document summarization #pattern mining #topic model
Tipo

Conference Paper