A Bayesian framework for learning shared and individual subspaces from multiple data sources


Autoria(s): Gupta, Sunil Kumar; Phung, Dinh; Adams, Brett; Venkatesh, Svetha
Contribuinte(s)

Huang, Joshua Zhexue

Cao, Longbing

Srivastava, Jaideep

Data(s)

01/01/2011

Resumo

This paper presents a novel Bayesian formulation to exploit shared structures across multiple data sources, constructing foundations for effective mining and retrieval across disparate domains. We jointly analyze diverse data sources using a unifying piece of metadata (textual tags). We propose a method based on Bayesian Probabilistic Matrix Factorization (BPMF) which is able to explicitly model the partial knowledge common to the datasets using shared subspaces and the knowledge specific to each dataset using individual subspaces. For the proposed model, we derive an efficient algorithm for learning the joint factorization based on Gibbs sampling. The effectiveness of the model is demonstrated by social media retrieval tasks across single and multiple media. The proposed solution is applicable to a wider context, providing a formal framework suitable for exploiting individual as well as mutual knowledge present across heterogeneous data sources of many kinds.

Identificador

http://hdl.handle.net/10536/DRO/DU:30044674

Idioma(s)

eng

Publicador

Springer-Verlag

Relação

http://dro.deakin.edu.au/eserv/DU:30044674/gupta-bayesianframework-2011.pdf

http://dro.deakin.edu.au/eserv/DU:30044674/gupta-bayesianframework-evidence-2011.pdf

http://dx.doi.org/10.1007/978-3-642-20841-6_12

Direitos

2011, Springer-Verlag Berlin Heidelberg

Palavras-Chave #Bayesian formulation #Bayesian frameworks #data sets #data source #efficient algorithm #formal framework #Gibbs sampling #heterogeneous data sources #matrix factorizations #multiple data sources #mutual knowledge #partial knowledge #social media
Tipo

Conference Paper