Identify disorders in health records using Conditional Random Fields and Metamap: AEHRC at ShARe/CLEF 2013 eHealth Evaluation Lab Task 1


Autoria(s): Zuccon, G.; Holloway, A.; Koopman, B.; Nguyen, A.
Data(s)

01/09/2013

Resumo

The Australian e-Health Research Centre (AEHRC) recently participated in the ShARe/CLEF eHealth Evaluation Lab Task 1. The goal of this task is to individuate mentions of disorders in free-text electronic health records and map disorders to SNOMED CT concepts in the UMLS metathesaurus. This paper details our participation to this ShARe/CLEF task. Our approaches are based on using the clinical natural language processing tool Metamap and Conditional Random Fields (CRF) to individuate mentions of disorders and then to map those to SNOMED CT concepts. Empirical results obtained on the 2013 ShARe/CLEF task highlight that our instance of Metamap (after ltering irrelevant semantic types), although achieving a high level of precision, is only able to identify a small amount of disorders (about 21% to 28%) from free-text health records. On the other hand, the addition of the CRF models allows for a much higher recall (57% to 79%) of disorders from free-text, without sensible detriment in precision. When evaluating the accuracy of the mapping of disorders to SNOMED CT concepts in the UMLS, we observe that the mapping obtained by our ltered instance of Metamap delivers state-of-the-art e ectiveness if only spans individuated by our system are considered (`relaxed' accuracy).

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/62875/

Relação

http://eprints.qut.edu.au/62875/1/clef2013_t1_workingnotes.pdf

http://www.clef-initiative.eu/documents/71612/79de7556-04ef-4578-8265-4ab7058cecd7

Zuccon, G., Holloway, A., Koopman, B., & Nguyen, A. (2013) Identify disorders in health records using Conditional Random Fields and Metamap: AEHRC at ShARe/CLEF 2013 eHealth Evaluation Lab Task 1. In Proceedings of CLEF Workshop on Cross-Language Evaluation of Methods, Applications, and Resources for eHealth Document Analysis, Valencia, Spain.

Direitos

Copyright 2013 Please consult the authors

Fonte

School of Information Systems; Science & Engineering Faculty

Tipo

Conference Paper