Analysis of the effect of negation on information retrieval of medical data


Autoria(s): Koopman, Bevan; Bruza, Peter D.; Sitbon, Laurianne; Lawley, Michael J.
Data(s)

2010

Resumo

Most information retrieval (IR) models treat the presence of a term within a document as an indication that the document is somehow "about" that term, they do not take into account when a term might be explicitly negated. Medical data, by its nature, contains a high frequency of negated terms - e.g. "review of systems showed no chest pain or shortness of breath". This papers presents a study of the effects of negation on information retrieval. We present a number of experiments to determine whether negation has a significant negative affect on IR performance and whether language models that take negation into account might improve performance. We use a collection of real medical records as our test corpus. Our findings are that negation has some affect on system performance, but this will likely be confined to domains such as medical data where negation is prevalent.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/38615/

Publicador

University of Melbourne

Relação

http://eprints.qut.edu.au/38615/1/c38615.pdf

http://www.cs.rmit.edu.au/adcs2010/index.php

Koopman, Bevan, Bruza, Peter D., Sitbon, Laurianne, & Lawley, Michael J. (2010) Analysis of the effect of negation on information retrieval of medical data. In Proceedings of 15th Australasian Document Computing Symposium (ADCS), University of Melbourne, University of Melbourne, Melbourne, Victoria.

Direitos

Copyright 2010 [please consult the authors]

Fonte

Faculty of Science and Technology; Information Systems

Palavras-Chave #080704 Information Retrieval and Web Search
Tipo

Conference Paper