3 resultados para romance languages and literature
em Cochin University of Science
Resumo:
This thesis summarizes the results on the studies on a syntax based approach for translation between Malayalam, one of Dravidian languages and English and also on the development of the major modules in building a prototype machine translation system from Malayalam to English. The development of the system is a pioneering effort in Malayalam language unattempted by previous researchers. The computational models chosen for the system is first of its kind for Malayalam language. An in depth study has been carried out in the design of the computational models and data structures needed for different modules: morphological analyzer , a parser, a syntactic structure transfer module and target language sentence generator required for the prototype system. The generation of list of part of speech tags, chunk tags and the hierarchical dependencies among the chunks required for the translation process also has been done. In the development process, the major goals are: (a) accuracy of translation (b) speed and (c) space. Accuracy-wise, smart tools for handling transfer grammar and translation standards including equivalent words, expressions, phrases and styles in the target language are to be developed. The grammar should be optimized with a view to obtaining a single correct parse and hence a single translated output. Speed-wise, innovative use of corpus analysis, efficient parsing algorithm, design of efficient Data Structure and run-time frequency-based rearrangement of the grammar which substantially reduces the parsing and generation time are required. The space requirement also has to be minimised
Resumo:
Biodegradation is the chemical degradation of materials brought about by the action of naturally occurring microorganisms. Biodegradation is a relatively rapid process under suitable conditions of moisture, temperature and oxygen availability. The logic behind blending biopolymers such as starch with inert polymers like polyethylene is that if the biopolymer component is present in sufficient amount, and if it is removed by microorganisms in the waste disposal environment, then the base inert plastic should slowly degrade and disappear. The present work focuses on the preparation of biodegradable and photodegradable blends based on low density polyethylene incorporating small quantities of ionomers as compatibilizers. The thesis consists of eight chapters. The first chapter presents an introduction to the present research work and literature survey. The details of the materials used and the experimental procedures undertaken for the study are described in the second chapter. Preparation and characterization of low density polyethylene (LDPE)-biopolymer (starch/dextrin) blends are described in the third chapter. The result of investigations on the effect of polyethylene-co-methacrylic acid ionomers on the compatibility of LDPE and starch are reported in chapter 4. Chapter 5 has been divided into two parts. The first part deals with the effect of metal oxides on the photodegradation of LDPE. The second part describes the function of metal stearates on the photodegradation of LDPE. The results of the investigations on the role of various metal oxides as pro-oxidants on the degradation of ionomer compatibilized LDPE-starch blends are reported in chapter 6. Chapter 7 deals with the results of investigations on the role of various metal stearates as pro-oxidants on the degradation of ionomer compatibilized LDPE-starch blends. The conclusion of the investigations is presented in the last chapter of the thesis.
Resumo:
This is a Named Entity Based Question Answering System for Malayalam Language. Although a vast amount of information is available today in digital form, no effective information access mechanism exists to provide humans with convenient information access. Information Retrieval and Question Answering systems are the two mechanisms available now for information access. Information systems typically return a long list of documents in response to a user’s query which are to be skimmed by the user to determine whether they contain an answer. But a Question Answering System allows the user to state his/her information need as a natural language question and receives most appropriate answer in a word or a sentence or a paragraph. This system is based on Named Entity Tagging and Question Classification. Document tagging extracts useful information from the documents which will be used in finding the answer to the question. Question Classification extracts useful information from the question to determine the type of the question and the way in which the question is to be answered. Various Machine Learning methods are used to tag the documents. Rule-Based Approach is used for Question Classification. Malayalam belongs to the Dravidian family of languages and is one of the four major languages of this family. It is one of the 22 Scheduled Languages of India with official language status in the state of Kerala. It is spoken by 40 million people. Malayalam is a morphologically rich agglutinative language and relatively of free word order. Also Malayalam has a productive morphology that allows the creation of complex words which are often highly ambiguous. Document tagging tools such as Parts-of-Speech Tagger, Phrase Chunker, Named Entity Tagger, and Compound Word Splitter are developed as a part of this research work. No such tools were available for Malayalam language. Finite State Transducer, High Order Conditional Random Field, Artificial Immunity System Principles, and Support Vector Machines are the techniques used for the design of these document preprocessing tools. This research work describes how the Named Entity is used to represent the documents. Single sentence questions are used to test the system. Overall Precision and Recall obtained are 88.5% and 85.9% respectively. This work can be extended in several directions. The coverage of non-factoid questions can be increased and also it can be extended to include open domain applications. Reference Resolution and Word Sense Disambiguation techniques are suggested as the future enhancements