Studies on a syntax based approach for translation between structurally different languages and the development of a prototype for Malayalam to English translation


Autoria(s): Latha, R. Nair; Dr. David, Peter S; Dr.Sumam Mary,Idicula
Data(s)

22/05/2014

22/05/2014

01/03/2013

Resumo

This thesis summarizes the results on the studies on a syntax based approach for translation between Malayalam, one of Dravidian languages and English and also on the development of the major modules in building a prototype machine translation system from Malayalam to English. The development of the system is a pioneering effort in Malayalam language unattempted by previous researchers. The computational models chosen for the system is first of its kind for Malayalam language. An in depth study has been carried out in the design of the computational models and data structures needed for different modules: morphological analyzer , a parser, a syntactic structure transfer module and target language sentence generator required for the prototype system. The generation of list of part of speech tags, chunk tags and the hierarchical dependencies among the chunks required for the translation process also has been done. In the development process, the major goals are: (a) accuracy of translation (b) speed and (c) space. Accuracy-wise, smart tools for handling transfer grammar and translation standards including equivalent words, expressions, phrases and styles in the target language are to be developed. The grammar should be optimized with a view to obtaining a single correct parse and hence a single translated output. Speed-wise, innovative use of corpus analysis, efficient parsing algorithm, design of efficient Data Structure and run-time frequency-based rearrangement of the grammar which substantially reduces the parsing and generation time are required. The space requirement also has to be minimised

Department of Computer Science, Cochin University of Science and Technology

Cochin University of Science and Technology

Identificador

http://dyuthi.cusat.ac.in/purl/3808

Idioma(s)

en

Publicador

Cochin University of Science and Technology

Palavras-Chave #Direct Machine Translation #Rule Based Machine Translation #Corpus Based Machine Translation #,Language Morphology and Morphological Analysis
Tipo

Thesis