Automatic Generation of Titles for a Corpus of Questions


Autoria(s): Cardeñosa, Jesús; Carolina, Carolina
Data(s)

08/04/2010

08/04/2010

2008

Resumo

This paper describes the followed methodology to automatically generate titles for a corpus of questions that belong to sociological opinion polls. Titles for questions have a twofold function: (1) they are the input of user searches and (2) they inform about the whole contents of the question and possible answer options. Thus, generation of titles can be considered as a case of automatic summarization. However, the fact that summarization had to be performed over very short texts together with the aforementioned quality conditions imposed on new generated titles led the authors to follow knowledge-rich and domain-dependent strategies for summarization, disregarding the more frequent extractive techniques for summarization.

Identificador

1313-0455

http://hdl.handle.net/10525/1037

Idioma(s)

en

Publicador

Institute of Information Theories and Applications FOI ITHEA

Palavras-Chave #Summarization #Text Processing #Subjective Clustering #Content Analysis and Indexing
Tipo

Article