2 resultados para Table of Contents

em Cochin University of Science


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Statistical Machine Translation (SMT) is one of the potential applications in the field of Natural Language Processing. The translation process in SMT is carried out by acquiring translation rules automatically from the parallel corpora. However, for many language pairs (e.g. Malayalam- English), they are available only in very limited quantities. Therefore, for these language pairs a huge portion of phrases encountered at run-time will be unknown. This paper focuses on methods for handling such out-of-vocabulary (OOV) words in Malayalam that cannot be translated to English using conventional phrase-based statistical machine translation systems. The OOV words in the source sentence are pre-processed to obtain the root word and its suffix. Different inflected forms of the OOV root are generated and a match is looked up for the word variants in the phrase translation table of the translation model. A Vocabulary filter is used to choose the best among the translations of these word variants by finding the unigram count. A match for the OOV suffix is also looked up in the phrase entries and the target translations are filtered out. Structuring of the filtered phrases is done and SMT translation model is extended by adding OOV with its new phrase translations. By the results of the manual evaluation done it is observed that amount of OOV words in the input has been reduced considerably

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This paper aims to describe recent developments in the services provided by Indian electronic thesis and dissertation (ETD) repositories. It seeks to explore the prospect of knowledge formation and diffusion in India and to discuss the potential of open access e-theses repositories for knowledge management.This study is based on literature review and content analysis of IndianETDrepository websites. Institutional repositories and electronic thesis and dissertation projects in India were identified through a literature survey as well as internet searching and browsing. The study examines the tools, type of contents, coverage and aims of Indian ETD repositories.The paper acknowledges the need for knowledge management for national development. It highlights the significance of an integrated platform for preserving, searching and retrieving Indian theses. It describes the features and functions of Indian ETD repositories.The paper provides insights into the characteristics of the national repository of ETDs of India, which encourage and support open access to publicly-funded research