Analysis of word embeddings and sequence features for clinical information extraction


Autoria(s): De Vine, Lance; Kholghi, Mahnoosh; Zuccon, Guido; Sitbon, Laurianne; Nguyen, Anthony
Data(s)

02/10/2015

Resumo

This study investigates the use of unsupervised features derived from word embedding approaches and novel sequence representation approaches for improving clinical information extraction systems. Our results corroborate previous findings that indicate that the use of word embeddings significantly improve the effectiveness of concept extraction models; however, we further determine the influence that the corpora used to generate such features have. We also demonstrate the promise of sequence-based unsupervised features for further improving concept extraction.

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/90401/

Relação

http://eprints.qut.edu.au/90401/1/ALTA_2015_paper_15.pdf

De Vine, Lance, Kholghi, Mahnoosh, Zuccon, Guido, Sitbon, Laurianne, & Nguyen, Anthony (2015) Analysis of word embeddings and sequence features for clinical information extraction. In 13th Annual Workshop of the Australasian Language Technology Association, 8 - 9 December 2015, University of Western Sydney, Parramatta, NSW.

Direitos

Copyright 2015 [please consult the authors]

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty; School of Information Systems

Palavras-Chave #080000 INFORMATION AND COMPUTING SCIENCES
Tipo

Conference Paper