COMPENDIUM: A text summarization system for generating abstracts of research papers


Autoria(s): Lloret, Elena; Romá-Ferri, María Teresa; Palomar, Manuel
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Universidad de Alicante. Departamento de Enfermería

Procesamiento del Lenguaje y Sistemas de Información (GPLSI)

Data(s)

11/10/2013

11/10/2013

14/08/2013

Resumo

This article analyzes the appropriateness of a text summarization system, COMPENDIUM, for generating abstracts of biomedical papers. Two approaches are suggested: an extractive (COMPENDIUM E), which only selects and extracts the most relevant sentences of the documents, and an abstractive-oriented one (COMPENDIUM E–A), thus facing also the challenge of abstractive summarization. This novel strategy combines extractive information, with some pieces of information of the article that have been previously compressed or fused. Specifically, in this article, we want to study: i) whether COMPENDIUM produces good summaries in the biomedical domain; ii) which summarization approach is more suitable; and iii) the opinion of real users towards automatic summaries. Therefore, two types of evaluation were performed: quantitative and qualitative, for evaluating both the information contained in the summaries, as well as the user satisfaction. Results show that extractive and abstractive-oriented summaries perform similarly as far as the information they contain, so both approaches are able to keep the relevant information of the source documents, but the latter is more appropriate from a human perspective, when a user satisfaction assessment is carried out. This also confirms the suitability of our suggested approach for generating summaries following an abstractive-oriented paradigm.

This research was partially supported by the FPI grant (BES-2007-16268) and the project grants TEXT-MESS (TIN2006-15265-C06-01), TEXT-MESS 2.0 (TIN2009-13391-C04) and LEGOLANG (TIN2012-31224) from the Spanish Government. It has been also funded by the Valencian Government (grant no. PROMETEO/2009/119 and ACOMP/2011/001).

Identificador

LLORET, Elena; ROMÁ-FERRI, María Teresa; PALOMAR, Manuel. "COMPENDIUM: A text summarization system for generating abstracts of research papers". Data & Knowledge Engineering. Article In Press (Available online 14 August 2013). ISSN 0169-023X

0169-023X (Print)

1872-6933 (Online)

http://hdl.handle.net/10045/33138

10.1016/j.datak.2013.08.005

A7147967

Idioma(s)

eng

Publicador

Elsevier

Relação

http://dx.doi.org/10.1016/j.datak.2013.08.005

Direitos

info:eu-repo/semantics/restrictedAccess

Palavras-Chave #Human language technologies #NLP applications #Text summarization #Information systems #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/article