Extractive text summarization: can we use the same techniques for any text?


Autoria(s): Vodolazova, Tatiana; Lloret, Elena; Muñoz, Rafael; Palomar, Manuel
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Procesamiento del Lenguaje y Sistemas de Información (GPLSI)

Data(s)

11/10/2013

11/10/2013

2013

Resumo

In this paper we address two issues. The first one analyzes whether the performance of a text summarization method depends on the topic of a document. The second one is concerned with how certain linguistic properties of a text may affect the performance of a number of automatic text summarization methods. For this we consider semantic analysis methods, such as textual entailment and anaphora resolution, and we study how they are related to proper noun, pronoun and noun ratios calculated over original documents that are grouped into related topics. Given the obtained results, we can conclude that although our first hypothesis is not supported, since it has been found no evident relationship between the topic of a document and the performance of the methods employed, adapting summarization systems to the linguistic properties of input documents benefits the process of summarization.

This research work has been partially funded by the European Commission under the Seventh (FP7 - 2007-2013) Framework Programme for Research and Technological Development through the FIRST project (FP7-287607); the Spanish Government through the project TEXTMESS 2.0 (TIN2009-13391-C04), ”Análisis de Tendencias Mediante Técnicas de Opinión Semántica” (TIN2012-38536-C03-03 ) and “Técnicas de Deconstrucción en la Tecnologías del Lenguaje Humano” (TIN2012-31224); and by the Valencian Government through the project PROMETEO (PROMETEO/2009/199).

Identificador

VODOLAZOVA, Tatiana, et al. "Extractive text summarization: can we use the same techniques for any text?". En: Natural Language Processing and Information Systems : 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013, Salford, UK, June 19-21, 2013, Proceedings. Berlin : Springer, 2013. (Lecture Notes in Computer Science; 7934). ISBN 978-3-642-38823-1, pp. 164-175

978-3-642-38823-1

0302-9743 (Print)

1611-3349 (Online)

http://hdl.handle.net/10045/33139

10.1007/978-3-642-38824-8_14

A7147986

Idioma(s)

eng

Publicador

Springer Berlin / Heidelberg

Relação

http://dx.doi.org/10.1007/978-3-642-38824-8_14

info:eu-repo/grantAgreement/EC/FP7/287607

Direitos

The original publication is available at www.springerlink.com

info:eu-repo/semantics/restrictedAccess

Palavras-Chave #Text summarization #Textual entailment #Anaphora resolution #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/article