Semi-supervised dimensionality reduction based on partial least squares for visual analysis of high dimensional data


Autoria(s): Paiva, José Gustavo de Souza; Schwartz, William Robson; Pedrini, Helio; Minghim, Rosane
Contribuinte(s)

UNIVERSIDADE DE SÃO PAULO

Data(s)

05/11/2013

05/11/2013

2012

Resumo

Dimensionality reduction is employed for visual data analysis as a way to obtaining reduced spaces for high dimensional data or to mapping data directly into 2D or 3D spaces. Although techniques have evolved to improve data segregation on reduced or visual spaces, they have limited capabilities for adjusting the results according to user's knowledge. In this paper, we propose a novel approach to handling both dimensionality reduction and visualization of high dimensional data, taking into account user's input. It employs Partial Least Squares (PLS), a statistical tool to perform retrieval of latent spaces focusing on the discriminability of the data. The method employs a training set for building a highly precise model that can then be applied to a much larger data set very effectively. The reduced data set can be exhibited using various existing visualization techniques. The training data is important to code user's knowledge into the loop. However, this work also devises a strategy for calculating PLS reduced spaces when no training data is available. The approach produces increasingly precise visual mappings as the user feeds back his or her knowledge and is capable of working with small and unbalanced training sets.

CNPq

FAPESP

Identificador

COMPUTER GRAPHICS FORUM, HOBOKEN, v. 31, n. 3, Part 4, pp. 1345-1354, JUN, 2012

0167-7055

http://www.producao.usp.br/handle/BDPI/41531

10.1111/j.1467-8659.2012.03126.x

http://dx.doi.org/10.1111/j.1467-8659.2012.03126.x

Idioma(s)

eng

Publicador

WILEY-BLACKWELL

HOBOKEN

Relação

COMPUTER GRAPHICS FORUM

Direitos

restrictedAccess

Copyright WILEY-BLACKWELL

Palavras-Chave #MULTIDIMENSIONAL PROJECTION #VARIABLE SELECTION #REGRESSION #CLASSIFICATION #SIMILARITY #COMPUTER SCIENCE, SOFTWARE ENGINEERING
Tipo

article

original article

publishedVersion