Automated cross-lingual link discovery in Wikipedia


Autoria(s): Tang, Ling-Xiang; Cavanagh, Daniel; Trotman, Andrew; Geva, Shlomo; Xu, Yue
Contribuinte(s)

Kando, Noriko

Ishikawa, Daisuke

Sugimoto, Miho

Data(s)

2011

Resumo

At NTCIR-9, we participated in the cross-lingual link discovery (Crosslink) task. In this paper we describe our approaches to discovering Chinese, Japanese, and Korean (CJK) cross-lingual links for English documents in Wikipedia. Our experimental results show that a link mining approach that mines the existing link structure for anchor probabilities and relies on the “translation” using cross-lingual document name triangulation performs very well. The evaluation shows encouraging results for our system.

Identificador

http://eprints.qut.edu.au/49125/

Publicador

National Institute of Informatics

Relação

http://research.nii.ac.jp/ntcir/workshop/OnlineProceedings9/NTCIR/11-NTCIR9-CROSSLINK-TangL.pdf

Tang, Ling-Xiang, Cavanagh, Daniel, Trotman, Andrew, Geva, Shlomo, & Xu, Yue (2011) Automated cross-lingual link discovery in Wikipedia. In Kando, Noriko, Ishikawa, Daisuke, & Sugimoto, Miho (Eds.) Proceedings of the NTCIR-9 Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, Question Answering and Cross-Lingual Information Access, National Institute of Informatics, National Center of Sciences, Tokyo, pp. 512-519.

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080107 Natural Language Processing #080306 Open Software #NTCIR #Crosslink #Wikipedia #Link probability #Page name matching #Transliteration
Tipo

Conference Paper