Application of Text Summarization techniques to the Geographical Information Retrieval task


Autoria(s): Perea Ortega, José Manuel; Lloret, Elena; Ureña López, Luis Alfonso; Palomar, Manuel
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Procesamiento del Lenguaje y Sistemas de Información (GPLSI)

Data(s)

08/09/2014

08/09/2014

15/06/2013

Resumo

Automatic Text Summarization has been shown to be useful for Natural Language Processing tasks such as Question Answering or Text Classification and other related fields of computer science such as Information Retrieval. Since Geographical Information Retrieval can be considered as an extension of the Information Retrieval field, the generation of summaries could be integrated into these systems by acting as an intermediate stage, with the purpose of reducing the document length. In this manner, the access time for information searching will be improved, while at the same time relevant documents will be also retrieved. Therefore, in this paper we propose the generation of two types of summaries (generic and geographical) applying several compression rates in order to evaluate their effectiveness in the Geographical Information Retrieval task. The evaluation has been carried out using GeoCLEF as evaluation framework and following an Information Retrieval perspective without considering the geo-reranking phase commonly used in these systems. Although single-document summarization has not performed well in general, the slight improvements obtained for some types of the proposed summaries, particularly for those based on geographical information, made us believe that the integration of Text Summarization with Geographical Information Retrieval may be beneficial, and consequently, the experimental set-up developed in this research work serves as a basis for further investigations in this field.

This work has been partially funded by the European Commission under the Seventh (FP7-2007-2013) Framework Programme for Research and Technological Development through the FIRST project (FP7-287607). It has also been partially supported by a grant from the Fondo Europeo de Desarrollo Regional (FEDER), projects TEXT-MESS 2.0 (TIN2009-13391-C04-01) and TEXT-COOL 2.0 (TIN2009-13391-C04-02) from the Spanish Government, a Grant from the Valencian Government, project "Desarrollo de Técnicas Inteligentes e Interactivas de Minería de Textos" (PROMETEO/2009/119), and a Grant No. ACOMP/2011/001.

Identificador

Expert Systems with Applications. 2013, 40(8): 2966-2974. doi:10.1016/j.eswa.2012.12.012

0957-4174 (Print)

1873-6793 (Online)

http://hdl.handle.net/10045/40113

10.1016/j.eswa.2012.12.012

Idioma(s)

eng

Publicador

Elsevier

Relação

http://dx.doi.org/10.1016/j.eswa.2012.12.012

info:eu-repo/grantAgreement/EC/FP7/287607

Direitos

info:eu-repo/semantics/openAccess

Palavras-Chave #Geographical information retrieval #Text summarization #Information retrieval #GeoCLEF #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/article