COMPENDIUM: a text summarisation tool for generating summaries of multiple purposes, domains, and genres
Contribuinte(s) |
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos Procesamiento del Lenguaje y Sistemas de Información (GPLSI) |
---|---|
Data(s) |
26/03/2014
26/03/2014
16/07/2012
|
Resumo |
In this paper, we present a Text Summarisation tool, compendium, capable of generating the most common types of summaries. Regarding the input, single- and multi-document summaries can be produced; as the output, the summaries can be extractive or abstractive-oriented; and finally, concerning their purpose, the summaries can be generic, query-focused, or sentiment-based. The proposed architecture for compendium is divided in various stages, making a distinction between core and additional stages. The former constitute the backbone of the tool and are common for the generation of any type of summary, whereas the latter are used for enhancing the capabilities of the tool. The main contributions of compendium with respect to the state-of-the-art summarisation systems are that (i) it specifically deals with the problem of redundancy, by means of textual entailment; (ii) it combines statistical and cognitive-based techniques for determining relevant content; and (iii) it proposes an abstractive-oriented approach for facing the challenge of abstractive summarisation. The evaluation performed in different domains and textual genres, comprising traditional texts, as well as texts extracted from the Web 2.0, shows that compendium is very competitive and appropriate to be used as a tool for generating summaries. This research has been supported by the project “Desarrollo de Técnicas Inteligentes e Interactivas de Minería de Textos” (PROMETEO/2009/119) and the project reference ACOMP/2011/001 from the Valencian Government, as well as by the Spanish Government (grant no. TIN2009-13391-C04-01). |
Identificador |
Natural Language Engineering. 2013, 19(2): 147-186. doi:10.1017/S1351324912000198 1351-3249 (Print) 1469-8110 (Online) http://hdl.handle.net/10045/36344 10.1017/S1351324912000198 |
Idioma(s) |
eng |
Publicador |
Cambridge University Press |
Relação |
http://dx.doi.org/10.1017/S1351324912000198 |
Direitos |
© Cambridge University Press 2012 info:eu-repo/semantics/openAccess |
Palavras-Chave | #Text summarisation tool #COMPENDIUM #Generating summaries #Lenguajes y Sistemas Informáticos |
Tipo |
info:eu-repo/semantics/article |