Utilizing hyperlink transitivity to improve web page clustering


Autoria(s): Hou, Jingyu; Zhang, Yanchun
Contribuinte(s)

Dieter-Schewe, Klaus

Data(s)

01/01/2003

Resumo

The rapid increase of web complexity and size makes web searched results far from satisfaction in many cases due to a huge amount of information returned by search engines. How to find intrinsic relationships among the web pages at a higher level to implement efficient web searched information management and retrieval is becoming a challenge problem. In this paper, we propose an approach to measure web page similarity. This approach takes hyperlink transitivity and page importance into consideration. From this new similarity measurement, an effective hierarchical web page clustering algorithm is proposed. The primary evaluations show the effectiveness of the new similarity measurement and the improvement of web page clustering. The proposed page similarity, as well as the matrix-based hyperlink analysis methods, could be applied to other web-based research areas..<br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30005062

Idioma(s)

eng

Publicador

Australian Computer Society

Relação

http://dro.deakin.edu.au/eserv/DU:30005062/hou-utilizinghyperlinktransitivity-2003.pdf

http://crpit.com/confpapers/CRPITV17Hou.pdf

Direitos

2003, Australian Computer Society

Palavras-Chave #World Wide Web #hyperlink analysis #web page similarity #web clustering
Tipo

Conference Paper