Text document clustering with metric learning


Autoria(s): Wang, Jinlong; Wu, Shunyao; Vu, Huy Quan; Li, Gang
Contribuinte(s)

Chen, Hsin-Hsi

Efthimiadis, Efthimis N.

Savoy, Jacques

Crestani, Fabio

Marchand-Maillet, Stephane

Data(s)

01/01/2010

Resumo

One reason for semi-supervised clustering fail to deliver satisfactory performance in document clustering is that the transformed optimization problem could have many candidate solutions, but existing methods provide no mechanism to select a suitable one from all those candidates. This paper alleviates this problem by posing the same task as a soft-constrained optimization problem, and introduces the salient degree measure as an information guide to control the searching of an optimal solution. Experimental results show the effectiveness of the proposed method in the improvement of the performance, especially when the amount of priori domain knowledge is limited.<br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30034433

Idioma(s)

eng

Publicador

Association for Computing Machinery

Relação

http://dro.deakin.edu.au/eserv/DU:30034433/vu-sigirconference-2010.pdf

http://dro.deakin.edu.au/eserv/DU:30034433/vu-sigirpaperreview-2010.pdf

http://dro.deakin.edu.au/eserv/DU:30034433/vu-textdocument-2010.pdf

Direitos

2010, by the Association for Computing Machinery, Inc.

Palavras-Chave #document clustering #metric learning
Tipo

Conference Paper