Stabilized sparse ordinal regression for medical risk stratification


Autoria(s): Tran, Truyen; Phung, Dinh; Luo, Wei; Venkatesh, Svetha
Data(s)

01/06/2015

Resumo

The recent wide adoption of electronic medical records (EMRs) presents great opportunities and challenges for data mining. The EMR data are largely temporal, often noisy, irregular and high dimensional. This paper constructs a novel ordinal regression framework for predicting medical risk stratification from EMR. First, a conceptual view of EMR as a temporal image is constructed to extract a diverse set of features. Second, ordinal modeling is applied for predicting cumulative or progressive risk. The challenges are building a transparent predictive model that works with a large number of weakly predictive features, and at the same time, is stable against resampling variations. Our solution employs sparsity methods that are stabilized through domain-specific feature interaction networks. We introduces two indices that measure the model stability against data resampling. Feature networks are used to generate two multivariate Gaussian priors with sparse precision matrices (the Laplacian and Random Walk). We apply the framework on a large short-term suicide risk prediction problem and demonstrate that our methods outperform clinicians to a large margin, discover suicide risk factors that conform with mental health knowledge, and produce models with enhanced stability. © 2014 Springer-Verlag London.

Identificador

http://hdl.handle.net/10536/DRO/DU:30067690

Idioma(s)

eng

Publicador

Springer

Relação

http://dro.deakin.edu.au/eserv/DU:30067690/luo-stabilizedsparse-2014.pdf

http://dro.deakin.edu.au/eserv/DU:30067690/truyen-stabilizedsparse-inpress-2014.pdf

http://www.dx.doi.org/10.1007/s10115-014-0740-4

Direitos

2014, Springer

Palavras-Chave #electronic medical record #feature graph #medical risk stratification #sparse ordinal regression #stability
Tipo

Journal Article