1 resultado para word boundaries
em Cochin University of Science
Filtro por publicador
- Repository Napier (1)
- Aberystwyth University Repository - Reino Unido (2)
- Academic Research Repository at Institute of Developing Economies (2)
- Acceda, el repositorio institucional de la Universidad de Las Palmas de Gran Canaria. España (2)
- Adam Mickiewicz University Repository (2)
- AMS Tesi di Dottorato - Alm@DL - Università di Bologna (1)
- AMS Tesi di Laurea - Alm@DL - Università di Bologna (4)
- Aquatic Commons (6)
- Archive of European Integration (3)
- Archivo Digital para la Docencia y la Investigación - Repositorio Institucional de la Universidad del País Vasco (2)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (4)
- Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP) (4)
- Biblioteca Digital de Teses e Dissertações Eletrônicas da UERJ (1)
- Blue Tiger Commons - Lincoln University - USA (2)
- BORIS: Bern Open Repository and Information System - Berna - Suiça (72)
- Brock University, Canada (9)
- Bucknell University Digital Commons - Pensilvania - USA (6)
- CaltechTHESIS (1)
- Cambridge University Engineering Department Publications Database (60)
- CentAUR: Central Archive University of Reading - UK (94)
- Central European University - Research Support Scheme (1)
- Chinese Academy of Sciences Institutional Repositories Grid Portal (21)
- Cochin University of Science & Technology (CUSAT), India (1)
- CORA - Cork Open Research Archive - University College Cork - Ireland (1)
- Dalarna University College Electronic Archive (2)
- Department of Computer Science E-Repository - King's College London, Strand, London (2)
- DI-fusion - The institutional repository of Université Libre de Bruxelles (1)
- Digital Archives@Colby (3)
- Digital Commons - Michigan Tech (3)
- Digital Commons - Montana Tech (1)
- Digital Commons @ DU | University of Denver Research (2)
- Digital Peer Publishing (1)
- DigitalCommons@The Texas Medical Center (2)
- DigitalCommons@University of Nebraska - Lincoln (6)
- Digitale Sammlungen - Goethe-Universität Frankfurt am Main (6)
- Diposit Digital de la UB - Universidade de Barcelona (2)
- Duke University (4)
- eResearch Archive - Queensland Department of Agriculture; Fisheries and Forestry (1)
- Greenwich Academic Literature Archive - UK (4)
- Helda - Digital Repository of University of Helsinki (17)
- Indian Institute of Science - Bangalore - Índia (33)
- Instituto Politécnico do Porto, Portugal (1)
- Massachusetts Institute of Technology (5)
- Memoria Académica - FaHCE, UNLP - Argentina (1)
- Ministerio de Cultura, Spain (26)
- National Center for Biotechnology Information - NCBI (7)
- Plymouth Marine Science Electronic Archive (PlyMSEA) (1)
- Publishing Network for Geoscientific & Environmental Data (77)
- QUB Research Portal - Research Directory and Institutional Repository for Queen's University Belfast (90)
- Queensland University of Technology - ePrints Archive (99)
- Repositório Científico do Instituto Politécnico de Lisboa - Portugal (2)
- Repositório digital da Fundação Getúlio Vargas - FGV (5)
- Repositório Digital da Universidade Municipal de São Caetano do Sul - USCS (1)
- Repositório Institucional da Universidade de Aveiro - Portugal (1)
- Repositório Institucional UNESP - Universidade Estadual Paulista "Julio de Mesquita Filho" (16)
- SAPIENTIA - Universidade do Algarve - Portugal (2)
- School of Medicine, Washington University, United States (19)
- The Scholarly Commons | School of Hotel Administration; Cornell University Research (1)
- Universidad Autónoma de Nuevo León, Mexico (3)
- Universidad de Alicante (4)
- Universidad del Rosario, Colombia (1)
- Universidad Politécnica de Madrid (4)
- Universidade Federal do Pará (1)
- Universidade Federal do Rio Grande do Norte (UFRN) (1)
- Universitätsbibliothek Kassel, Universität Kassel, Germany (3)
- Université de Lausanne, Switzerland (2)
- Université de Montréal, Canada (7)
- University of Connecticut - USA (1)
- University of Southampton, United Kingdom (78)
- WestminsterResearch - UK (1)
Resumo:
In Statistical Machine Translation from English to Malayalam, an unseen English sentence is translated into its equivalent Malayalam translation using statistical models like translation model, language model and a decoder. A parallel corpus of English-Malayalam is used in the training phase. Word to word alignments has to be set up among the sentence pairs of the source and target language before subjecting them for training. This paper is deals with the techniques which can be adopted for improving the alignment model of SMT. Incorporating the parts of speech information into the bilingual corpus has eliminated many of the insignificant alignments. Also identifying the name entities and cognates present in the sentence pairs has proved to be advantageous while setting up the alignments. Moreover, reduction of the unwanted alignments has brought in better training results. Experiments conducted on a sample corpus have generated reasonably good Malayalam translations and the results are verified with F measure, BLEU and WER evaluation metrics