COMPENDIUM: a text summarisation tool for generating summaries of multiple purposes, domains, and genres


Autoria(s): Lloret, Elena; Palomar, Manuel
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Procesamiento del Lenguaje y Sistemas de Información (GPLSI)

Data(s)

26/03/2014

26/03/2014

16/07/2012

Resumo

In this paper, we present a Text Summarisation tool, compendium, capable of generating the most common types of summaries. Regarding the input, single- and multi-document summaries can be produced; as the output, the summaries can be extractive or abstractive-oriented; and finally, concerning their purpose, the summaries can be generic, query-focused, or sentiment-based. The proposed architecture for compendium is divided in various stages, making a distinction between core and additional stages. The former constitute the backbone of the tool and are common for the generation of any type of summary, whereas the latter are used for enhancing the capabilities of the tool. The main contributions of compendium with respect to the state-of-the-art summarisation systems are that (i) it specifically deals with the problem of redundancy, by means of textual entailment; (ii) it combines statistical and cognitive-based techniques for determining relevant content; and (iii) it proposes an abstractive-oriented approach for facing the challenge of abstractive summarisation. The evaluation performed in different domains and textual genres, comprising traditional texts, as well as texts extracted from the Web 2.0, shows that compendium is very competitive and appropriate to be used as a tool for generating summaries.

This research has been supported by the project “Desarrollo de Técnicas Inteligentes e Interactivas de Minería de Textos” (PROMETEO/2009/119) and the project reference ACOMP/2011/001 from the Valencian Government, as well as by the Spanish Government (grant no. TIN2009-13391-C04-01).

Identificador

Natural Language Engineering. 2013, 19(2): 147-186. doi:10.1017/S1351324912000198

1351-3249 (Print)

1469-8110 (Online)

http://hdl.handle.net/10045/36344

10.1017/S1351324912000198

Idioma(s)

eng

Publicador

Cambridge University Press

Relação

http://dx.doi.org/10.1017/S1351324912000198

Direitos

© Cambridge University Press 2012

info:eu-repo/semantics/openAccess

Palavras-Chave #Text summarisation tool #COMPENDIUM #Generating summaries #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/article