Visual data mining: integrating machine learning with information visualization


Autoria(s): Maniyar, Dharmesh M.; Nabney, Ian T
Contribuinte(s)

Zhang, Zhongfei

Masseglia, Florent

Jain, Ramesh

Del Bimbo, Alberto

Data(s)

20/08/2006

Resumo

Today, the data available to tackle many scientific challenges is vast in quantity and diverse in nature. The exploration of heterogeneous information spaces requires suitable mining algorithms as well as effective visual interfaces. Most existing systems concentrate either on mining algorithms or on visualization techniques. Though visual methods developed in information visualization have been helpful, for improved understanding of a complex large high-dimensional dataset, there is a need for an effective projection of such a dataset onto a lower-dimension (2D or 3D) manifold. This paper introduces a flexible visual data mining framework which combines advanced projection algorithms developed in the machine learning domain and visual techniques developed in the information visualization domain. The framework follows Shneiderman’s mantra to provide an effective user interface. The advantage of such an interface is that the user is directly involved in the data mining process. We integrate principled projection methods, such as Generative Topographic Mapping (GTM) and Hierarchical GTM (HGTM), with powerful visual techniques, such as magnification factors, directional curvatures, parallel coordinates, billboarding, and user interaction facilities, to provide an integrated visual data mining framework. Results on a real life high-dimensional dataset from the chemoinformatics domain are also reported and discussed. Projection results of GTM are analytically compared with the projection results from other traditional projection methods, and it is also shown that the HGTM algorithm provides additional value for large datasets. The computational complexity of these algorithms is discussed to demonstrate their suitability for the visual data mining framework.

Formato

application/pdf

Identificador

http://eprints.aston.ac.uk/9288/1/MDM_full.pdf

Maniyar, Dharmesh M. and Nabney, Ian T (2006). Visual data mining: integrating machine learning with information visualization. IN: Workshop on Multimedia Data Mining “Merging Multimedia and Data Mining Research”. Zhang, Zhongfei; Masseglia, Florent; Jain, Ramesh and Del Bimbo, Alberto (eds) ACM.

Publicador

ACM

Relação

http://eprints.aston.ac.uk/9288/

Tipo

Book Section

NonPeerReviewed