English-Malayalam Cross-Lingual Information Retrieval – an Experience


Autoria(s): Sumam, Mary Idicula; Nikesh, P L; David, Peter S
Data(s)

18/07/2014

18/07/2014

2008

Resumo

This paper describes about an English-Malayalam Cross-Lingual Information Retrieval system. The system retrieves Malayalam documents in response to query given in English or Malayalam. Thus monolingual information retrieval is also supported in this system. Malayalam is one of the most prominent regional languages of Indian subcontinent. It is spoken by more than 37 million people and is the native language of Kerala state in India. Since we neither had any full-fledged online bilingual dictionary nor any parallel corpora to build the statistical lexicon, we used a bilingual dictionary developed in house for translation. Other language specific resources like Malayalam stemmer, Malayalam morphological root analyzer etc developed in house were used in this work

Cochin University of Science & Technology

Identificador

http://dyuthi.cusat.ac.in/purl/4102

Idioma(s)

en

Publicador

IEEE

Palavras-Chave #Cross-Lingual Information Retrieval #Vector space model #Malayalam #Document ranking #Bilingual dictionary #Content based retrieval
Tipo

Article