A new hybrid summarizer based on vector Space model, statistical physics and linguistics


Autoria(s): Cunha Fanego, Iria da; Fernández, Silvia; Velázquez Morales, Patricia; Vivaldi, Jordi; SanJuan, Eric; Torres Moreno, Juan Manuel
Contribuinte(s)

Universitat Pompeu Fabra

Data(s)

02/07/2013

Resumo

In this article we present a hybrid approach for automatic summarization of Spanish medical texts. There are a lot of systems for automatic summarization using statistics or linguistics, but only a few of them combining both techniques. Our idea is that to reach a good summary we need to use linguistic aspects of texts, but as well we should benefit of the advantages of statistical techniques. We have integrated the Cortex (Vector Space Model) and Enertex (statistical physics) systems coupled with the Yate term extractor, and the Disicosum system (linguistics). We have compared these systems and afterwards we have integrated them in a hybrid approach. Finally, we have applied this hybrid system over a corpora of medical articles and we have evaluated their performances obtaining good results.

Identificador

http://hdl.handle.net/10230/16226

Idioma(s)

eng

Publicador

Springer

Direitos

© Springer-Verlag Berlin Heidelberg 2007.

info:eu-repo/semantics/openAccess

Palavras-Chave #Anàlisi automàtica (Lingüística) -- Congressos
Tipo

info:eu-repo/semantics/conferenceObject

info:eu-repo/semantics/acceptedVersion