Combining word semantics within complex Hilbert space for information retrieval


Autoria(s): Wittek, Peter; Koopman, Bevan; Zuccon, Guido; Darányi, Sándor
Data(s)

2013

Resumo

Complex numbers are a fundamental aspect of the mathematical formalism of quantum physics. Quantum-like models developed outside physics often overlooked the role of complex numbers. Specifically, previous models in Information Retrieval (IR) ignored complex numbers. We argue that to advance the use of quantum models of IR, one has to lift the constraint of real-valued representations of the information space, and package more information within the representation by means of complex numbers. As a first attempt, we propose a complex-valued representation for IR, which explicitly uses complex valued Hilbert spaces, and thus where terms, documents and queries are represented as complex-valued vectors. The proposal consists of integrating distributional semantics evidence within the real component of a term vector; whereas, ontological information is encoded in the imaginary component. Our proposal has the merit of lifting the role of complex numbers from a computational byproduct of the model to the very mathematical texture that unifies different levels of semantic information. An empirical instantiation of our proposal is tested in the TREC Medical Record task of retrieving cohorts for clinical studies.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/69312/

Publicador

Springer

Relação

http://eprints.qut.edu.au/69312/1/wittek2013a.pdf

http://download.springer.com/static/pdf/185/chp%253A10.1007%252F978-3-642-54943-4_14.pdf?auth66=1399593606_0198d0912456754a3cb908001a69ca70&ext=.pdf

DOI:10.1007/978-3-642-54943-4

Wittek, Peter, Koopman, Bevan, Zuccon, Guido, & Darányi, Sándor (2013) Combining word semantics within complex Hilbert space for information retrieval. In Lecture Notes in Computer Science : Quantum Interaction - 7th International Conference, QI 2013, Leicester, UK, Springer, University of Leicester, England.

Direitos

Copyright 2014 Springer-Verlag Berlin Heidelberg

Fonte

Institute for Future Environments; School of Information Systems; Science & Engineering Faculty

Tipo

Conference Paper