Alignment Model and Training Technique in SMT from English to Malayalam


Autoria(s): Santhosh Kumar, G; Sheena Kurian, K; Mary, Priya Sebastian
Data(s)

19/07/2014

19/07/2014

30/08/2010

Resumo

This paper investigates certain methods of training adopted in the Statistical Machine Translator (SMT) from English to Malayalam. In English Malayalam SMT, the word to word translation is determined by training the parallel corpus. Our primary goal is to improve the alignment model by reducing the number of possible alignments of all sentence pairs present in the bilingual corpus. Incorporating morphological information into the parallel corpus with the help of the parts of speech tagger has brought around better training results with improved accuracy

Cochin University of Science and Technology

Identificador

http://dyuthi.cusat.ac.in/purl/4140

Idioma(s)

en

Palavras-Chave #Alignment #Parallel Corpus #PoS Tagging #Malayalam #Statistical Machine Translation
Tipo

Article