Towards automatic tweet generation: A comparative study from the text summarization perspective in the journalism genre


Autoria(s): Lloret, Elena; Palomar, Manuel
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Procesamiento del Lenguaje y Sistemas de Información (GPLSI)

Data(s)

08/09/2014

08/09/2014

15/11/2013

Resumo

In recent years, Twitter has become one of the most important microblogging services of the Web 2.0. Among the possible uses it allows, it can be employed for communicating and broadcasting information in real time. The goal of this research is to analyze the task of automatic tweet generation from a text summarization perspective in the context of the journalism genre. To achieve this, different state-of-the-art summarizers are selected and employed for producing multi-lingual tweets in two languages (English and Spanish). A wide experimental framework is proposed, comprising the creation of a new corpus, the generation of the automatic tweets, and their assessment through a quantitative and a qualitative evaluation, where informativeness, indicativeness and interest are key criteria that should be ensured in the proposed context. From the results obtained, it was observed that although the original tweets were considered as model tweets with respect to their informativeness, they were not among the most interesting ones from a human viewpoint. Therefore, relying only on these tweets may not be the ideal way to communicate news through Twitter, especially if a more personalized and catchy way of reporting news wants to be performed. In contrast, we showed that recent text summarization techniques may be more appropriate, reflecting a balance between indicativeness and interest, even if their content was different from the tweets delivered by the news providers.

This research work has been partially funded by the Spanish Government (Ministerio de Economía y competitividad) through the project “Técnicas de Deconstrucción en la Tecnologías del Lenguaje Humano” (TIN2012–31224), and by the Valencian Government through projects PROMETEO (PROMETEO/2009/199) and ACOMP/2011/001.

Identificador

Expert Systems with Applications. 2013, 40(16): 6624-6630. doi:10.1016/j.eswa.2013.06.021

0957-4174 (Print)

1873-6793 (Online)

http://hdl.handle.net/10045/40120

10.1016/j.eswa.2013.06.021

Idioma(s)

eng

Publicador

Elsevier

Relação

http://dx.doi.org/10.1016/j.eswa.2013.06.021

Direitos

info:eu-repo/semantics/openAccess

Palavras-Chave #Generation #Twitter #Automatic tweet generation #Text summarization #Informativeness #Indicativeness #Interest #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/article