An Information Theoretic Approach to Automatic Query Expansion


Autoria(s): Carpineto, Claudio; De Mori, Renato; Romano, Giovanni; Bigi, Brigitte
Contribuinte(s)

Fondazione Ugo Bordoni ; Fondazione Ugo Bordoni

Laboratoire Informatique d'Avignon (LIA) ; Université d'Avignon et des Pays de Vaucluse (UAPV) - Centre d'Enseignement et de Recherche en Informatique - CERI

Data(s)

2001

Resumo

International audience

Techniques for automatic query expansion from top retrieved documents have shown promise for improving retrieval effectiveness on large collections; however, they often rely on an empirical ground, and there is a shortage of cross-system comparisons. Using ideas from Information Theory, we present a computationally simple and theoretically justified method for assigning scores to candidate expansion terms. Such scores are used to select and weight expansion terms within Rocchio's framework for query reweigthing. We compare ranking with information-theoretic query expansion versus ranking with other query expansion techniques, showing that the former achieves better retrieval effectiveness on several performance measures. We also discuss the effect on retrieval effectiveness of the main parameters involved in automatic query expansion, such as data sparseness, query difficulty, number of selected documents, and number of selected terms, pointing out interesting relationships.

Identificador

hal-01392277

https://hal.archives-ouvertes.fr/hal-01392277

Idioma(s)

en

Publicador

HAL CCSD

Association for Computing Machinery

Fonte

ISSN: 1046-8188

ACM Transactions on Information Systems

https://hal.archives-ouvertes.fr/hal-01392277

ACM Transactions on Information Systems, Association for Computing Machinery, 2001, 19 (1), http://dl.acm.org/citation.cfm?doid=366836.366860

Palavras-Chave #Query expansion #[INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL] #[SHS.INFO] Humanities and Social Sciences/Library and information sciences #[INFO.INFO-IR] Computer Science [cs]/Information Retrieval [cs.IR]
Tipo

info:eu-repo/semantics/article

Journal articles