Biblioteca Digital

**Autoria(s):** Corduneanu, Adrian; Jaakkola, Tommi
Data(s)	08/10/2004 08/10/2004 08/11/2001
Resumo	An increasing number of parameter estimation tasks involve the use of at least two information sources, one complete but limited, the other abundant but incomplete. Standard algorithms such as EM (or em) used in this context are unfortunately not stable in the sense that they can lead to a dramatic loss of accuracy with the inclusion of incomplete observations. We provide a more controlled solution to this problem through differential equations that govern the evolution of locally optimal solutions (fixed points) as a function of the source weighting. This approach permits us to explicitly identify any critical (bifurcation) points leading to choices unsupported by the available complete data. The approach readily applies to any graphical model in O(n^3) time where n is the number of parameters. We use the naive Bayes model to illustrate these ideas and demonstrate the effectiveness of our approach in the context of text classification problems.
Formato	9 p. 1207127 bytes 733599 bytes application/postscript application/pdf
Identificador	AIM-2001-030 http://hdl.handle.net/1721.1/6679
Idioma(s)	en_US
Relação	AIM-2001-030
Palavras-Chave	#AI #semi-supervised learning #incomplete data #EM #stable estimation

Acesso ao item digital