An evaluation of corpus-driven measures of medical concept similarity for information retrieval


Autoria(s): Koopman, Bevan; Zuccon, Guido; Bruza, Peter D.; Sitbon, Laurianne; Lawley, Michael J.
Contribuinte(s)

Lebanon, Guy

Zaki, Mohammed

Wang, Haixun

Data(s)

2012

Resumo

Measures of semantic similarity between medical concepts are central to a number of techniques in medical informatics, including query expansion in medical information retrieval. Previous work has mainly considered thesaurus-based path measures of semantic similarity and has not compared different corpus-driven approaches in depth. We evaluate the effectiveness of eight common corpus-driven measures in capturing semantic relatedness and compare these against human judged concept pairs assessed by medical professionals. Our results show that certain corpus-driven measures correlate strongly (approx 0.8) with human judgements. An important finding is that performance was significantly affected by the choice of corpus used in priming the measure, i.e., used as evidence from which corpus-driven similarities are drawn. This paper provides guidelines for the implementation of semantic similarity measures for medical informatics and concludes with implications for medical information retrieval.

Identificador

http://eprints.qut.edu.au/58993/

Publicador

ACM

Relação

DOI:10.1145/2396761.2398661

Koopman, Bevan, Zuccon, Guido, Bruza, Peter D., Sitbon, Laurianne, & Lawley, Michael J. (2012) An evaluation of corpus-driven measures of medical concept similarity for information retrieval. In Lebanon, Guy, Zaki, Mohammed, & Wang, Haixun (Eds.) Proceedings of the 21st ACM international conference on Information and knowledge management, ACM, Hawaii, The United States of America, pp. 2439-2442.

Direitos

Copyright 2012 ACM.

Fonte

School of Information Systems; Science & Engineering Faculty

Palavras-Chave #080600 INFORMATION SYSTEMS #Semantic similarity #Medical information retrieval
Tipo

Conference Paper