Automatic query expansion : a structural linguistic perspective


Autoria(s): Symonds, Michael; Bruza, Peter D.; Zuccon, Guido; Koopman, Bevan; Sitbon, Laurianne; Turner, Ian
Data(s)

03/02/2014

Resumo

A user’s query is considered to be an imprecise description of their information need. Automatic query expansion is the process of reformulating the original query with the goal of improving retrieval effectiveness. Many successful query expansion techniques ignore information about the dependencies that exist between words in natural language. However, more recent approaches have demonstrated that by explicitly modeling associations between terms significant improvements in retrieval effectiveness can be achieved over those that ignore these dependencies. State-of-the-art dependency-based approaches have been shown to primarily model syntagmatic associations. Syntagmatic associations infer a likelihood that two terms co-occur more often than by chance. However, structural linguistics relies on both syntagmatic and paradigmatic associations to deduce the meaning of a word. Given the success of dependency-based approaches and the reliance on word meanings in the query formulation process, we argue that modeling both syntagmatic and paradigmatic information in the query expansion process will improve retrieval effectiveness. This article develops and evaluates a new query expansion technique that is based on a formal, corpus-based model of word meaning that models syntagmatic and paradigmatic associations. We demonstrate that when sufficient statistical information exists, as in the case of longer queries, including paradigmatic information alone provides significant improvements in retrieval effectiveness across a wide variety of data sets. More generally, when our new query expansion approach is applied to large-scale web retrieval it demonstrates significant improvements in retrieval effectiveness over a strong baseline system, based on a commercial search engine.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/61104/

Publicador

John Wiley & Sons, Inc.

Relação

http://eprints.qut.edu.au/61104/1/TQE.pdf

DOI:10.1002/asi.23065

Symonds, Michael, Bruza, Peter D., Zuccon, Guido, Koopman, Bevan, Sitbon, Laurianne, & Turner, Ian (2014) Automatic query expansion : a structural linguistic perspective. Journal of the American Society for Information Science and Technology.

Direitos

Copyright 2013 American Society for Information Science and Technology

This is a preprint of an article accepted for publication in Journal of the American Society for Information Science and Technology copyright (C) 2013 (American Society for Information Science and Technology)".

Fonte

Computer Science; School of Information Systems; School of Mathematical Sciences; Science & Engineering Faculty

Palavras-Chave #080600 INFORMATION SYSTEMS #080699 Information Systems not elsewhere classified #Information Retrieval #Automatic Query Expansion #Tensor Encoding Model #Web Search #Structural Linguistics
Tipo

Journal Article