Discovering user access pattern based on probabilistic latent factor model


Autoria(s): Xu, Guandong; Zhang, Yanchun; Ma, Jiangang; Zhou, Xiaofang
Contribuinte(s)

Hugh E. Williams

Gill Dobbie

Data(s)

01/01/2005

Resumo

There has been an increased demand for characterizing user access patterns using web mining techniques since the informative knowledge extracted from web server log files can not only offer benefits for web site structure improvement but also for better understanding of user navigational behavior. In this paper, we present a web usage mining method, which utilize web user usage and page linkage information to capture user access pattern based on Probabilistic Latent Semantic Analysis (PLSA) model. A specific probabilistic model analysis algorithm, EM algorithm, is applied to the integrated usage data to infer the latent semantic factors as well as generate user session clusters for revealing user access patterns. Experiments have been conducted on real world data set to validate the effectiveness of the proposed approach. The results have shown that the presented method is capable of characterizing the latent semantic factors and generating user profile in terms of weighted page vectors, which may reflect the common access interest exhibited by users among same session cluster.

Identificador

http://espace.library.uq.edu.au/view/UQ:103183

Idioma(s)

eng

Publicador

Australian Computer Society

Palavras-Chave #Web usage mining #Web linkage information #User profile #Probabilistic latent semantic model
Tipo

Conference Paper