Alignment Model and Training Technique in SMT from English to Malayalam
Data(s) |
19/07/2014
19/07/2014
30/08/2010
|
---|---|
Resumo |
This paper investigates certain methods of training adopted in the Statistical Machine Translator (SMT) from English to Malayalam. In English Malayalam SMT, the word to word translation is determined by training the parallel corpus. Our primary goal is to improve the alignment model by reducing the number of possible alignments of all sentence pairs present in the bilingual corpus. Incorporating morphological information into the parallel corpus with the help of the parts of speech tagger has brought around better training results with improved accuracy Cochin University of Science and Technology |
Identificador | |
Idioma(s) |
en |
Palavras-Chave | #Alignment #Parallel Corpus #PoS Tagging #Malayalam #Statistical Machine Translation |
Tipo |
Article |