Discovering linear causal model from incomplete data


Autoria(s): Li, Gang; Dai, Honghua; Tu, Y.
Contribuinte(s)

Dhompongsa, Sompong

Data(s)

01/01/2003

Resumo

One common drawback in algorithms for learning Linear Causal Models is that they can not deal with incomplete data set. This is unfortunate since many real problems involve missing data or even hidden variable. In this paper, based on multiple imputation, we propose a three-step process to learn linear causal models from incomplete data set. Experimental results indicate that this algorithm is better than the single imputation method (EM algorithm) and the simple list deletion method, and for lower missing rate, this algorithm can even find models better than the results from the greedy learning algorithm MLGS working in a complete data set. In addition, the method is amenable to parallel or distributed processing, which is an important characteristic for data mining in large data sets.<br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30005210

Idioma(s)

eng

Publicador

Chiang Mai University, Institute for Science and Technology Research and Development

Relação

http://dro.deakin.edu.au/eserv/DU:30005210/dai-discoveringlinear-2003.pdf

http://www.ist.cmu.ac.th/intech/paper/InTech0346.pdf

Direitos

2003, InTech

Tipo

Conference Paper