Active learning: A step towards automating medical concept extraction
Data(s) |
2015
|
---|---|
Resumo |
Objective This paper presents an automatic active learning-based system for the extraction of medical concepts from clinical free-text reports. Specifically, (1) the contribution of active learning in reducing the annotation effort, and (2) the robustness of incremental active learning framework across different selection criteria and datasets is determined. Materials and methods The comparative performance of an active learning framework and a fully supervised approach were investigated to study how active learning reduces the annotation effort while achieving the same effectiveness as a supervised approach. Conditional Random Fields as the supervised method, and least confidence and information density as two selection criteria for active learning framework were used. The effect of incremental learning vs. standard learning on the robustness of the models within the active learning framework with different selection criteria was also investigated. Two clinical datasets were used for evaluation: the i2b2/VA 2010 NLP challenge and the ShARe/CLEF 2013 eHealth Evaluation Lab. Results The annotation effort saved by active learning to achieve the same effectiveness as supervised learning is up to 77%, 57%, and 46% of the total number of sequences, tokens, and concepts, respectively. Compared to the Random sampling baseline, the saving is at least doubled. Discussion Incremental active learning guarantees robustness across all selection criteria and datasets. The reduction of annotation effort is always above random sampling and longest sequence baselines. Conclusion Incremental active learning is a promising approach for building effective and robust medical concept extraction models, while significantly reducing the burden of manual annotation. |
Formato |
application/pdf |
Identificador | |
Publicador |
Oxford University Press |
Relação |
http://eprints.qut.edu.au/85672/3/85672.pdf DOI:10.1093/jamia/ocv069 Kholghi, Mahnoosh, Sitbon, Laurianne, Zuccon, Guido, & Nguyen, Anthony (2015) Active learning: A step towards automating medical concept extraction. Journal of the American Medical Informatics Association. |
Direitos |
Copyright 2015 Oxford University Press This is a pre-copyedited, author-produced PDF of an article accepted for publication in Journal of the American Medical Informatics Association following peer review. The version of record [insert complete citation information here] is available online at: xxxxxxx [insert URL that the author will receive upon publication here]. |
Fonte |
School of Electrical Engineering & Computer Science; School of Information Systems; Science & Engineering Faculty |
Palavras-Chave | #Medical Concept Extraction #Clinical Free Text #Active Learning #Conditional Random Fields #Robustness Analysis |
Tipo |
Journal Article |