Language Resources – a Part of World Cultural Heritage


Autoria(s): Dimitrova, Ludmila
Data(s)

15/11/2013

15/11/2013

2011

Resumo

This article briefly reviews multilingual language resources for Bulgarian, developed in the frame of some international projects: the first-ever annotated Bulgarian MTE digital lexical resources, Bulgarian-Polish corpus, Bulgarian-Slovak parallel and aligned corpus, and Bulgarian-Polish-Lithuanian corpus. These resources are valuable multilingual dataset for language engineering research and development for Bulgarian language. The multilingual corpora are large repositories of language data with an important role in preserving and supporting the world's cultural heritage, because the natural language is an outstanding part of the human cultural values and collective memory, and a bridge between cultures.

*1 EU COP Project 106 MULTEXT-East Multilingual Text Tools and Corpora for Central and Eastern European Languages *2 Semantics and Contrastive Linguistics with a Focus on a Bilingual Electronic Dictionary

Identificador

Digital Presentation and Preservation of Cultural and Scientific Heritage, Vol. 1, No 1, (2011), 151p-160p

1314-4006

http://hdl.handle.net/10525/2029

Idioma(s)

en

Publicador

Institute of Mathematics and Informatics Bulgarian Academy of Sciences

Palavras-Chave #Natural Language #Multilingual Corpus #Parallel Corpus #Aligned Corpus #Comparable Corpus #Annotation
Tipo

Article