English-Malayalam Cross-Lingual Information Retrieval – an Experience
Data(s) |
18/07/2014
18/07/2014
2008
|
---|---|
Resumo |
This paper describes about an English-Malayalam Cross-Lingual Information Retrieval system. The system retrieves Malayalam documents in response to query given in English or Malayalam. Thus monolingual information retrieval is also supported in this system. Malayalam is one of the most prominent regional languages of Indian subcontinent. It is spoken by more than 37 million people and is the native language of Kerala state in India. Since we neither had any full-fledged online bilingual dictionary nor any parallel corpora to build the statistical lexicon, we used a bilingual dictionary developed in house for translation. Other language specific resources like Malayalam stemmer, Malayalam morphological root analyzer etc developed in house were used in this work Cochin University of Science & Technology |
Identificador | |
Idioma(s) |
en |
Publicador |
IEEE |
Palavras-Chave | #Cross-Lingual Information Retrieval #Vector space model #Malayalam #Document ranking #Bilingual dictionary #Content based retrieval |
Tipo |
Article |