Integrating WordNet and Wiktionary with lemon


Autoria(s): McCrae, J.; Montiel-Ponsoda, Elena; Cimiano, Philipp
Data(s)

2012

Resumo

Nowadays, there is a significant quantity of linguistic data available on the Web. However, linguistic resources are often published using proprietary formats and, as such, it can be difficult to interface with one another and they end up confined in “data silos”. The creation of web standards for the publishing of data on the Web and projects to create Linked Data have lead to interest in the creation of resources that can be published using Web principles. One of the most important aspects of “Lexical Linked Data” is the sharing of lexica and machine readable dictionaries. It is for this reason, that the lemon format has been proposed, which we briefly describe. We then consider two resources that seem ideal candidates for the Linked Data cloud, namely WordNet 3.0 and Wiktionary, a large document based dictionary. We discuss the challenges of converting both resources to lemon , and in particular for Wiktionary, the challenge of processing the mark-up, and handling inconsistencies and underspecification in the source material. Finally, we turn to the task of creating links between the two resources and present a novel algorithm for linking lexica as lexical Linked Data.

Formato

application/pdf

Identificador

http://oa.upm.es/21448/

Idioma(s)

spa

Publicador

Facultad de Informática (UPM)

Relação

http://oa.upm.es/21448/1/Integrating_WordNet_and_Wiktionary_with_lemon.pdf

info:eu-repo/semantics/altIdentifier/doi/10.1007/978-3-642-28249-2_3

Direitos

(c) Editor/Autor

info:eu-repo/semantics/openAccess

Fonte

Integrating WordNet and Wiktionary with lemon | En: Linked Data in Linguistics | pag. 25-34 | Springer Berlin Heidelberg | 2012

Palavras-Chave #Informática
Tipo

info:eu-repo/semantics/bookPart

Sección de Libro

PeerReviewed