Biblioteca Digital

23 resultados para alignment, corpora, translation technology, English as a Lingua Franca, academic course descriptions

em Cambridge University Engineering Department Publications Database

HMM word and phrase alignment for statistical machine translation

Relevância:

50.00% 50.00%

Publicador:

Veja mais

A weighted finite state transducer implementation of the alignment template model for statistical machine translation

Relevância:

50.00% 50.00%

Publicador:

Veja mais

Improving speech transcription for Mandarin-English translation

Relevância:

40.00% 40.00%

Publicador:

Veja mais

Improving speech transcription for Mandarin-english translation

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper describes the development of the CU-HTK Mandarin Speech-To-Text (STT) system and assesses its performance as part of a transcription-translation pipeline which converts broadcast Mandarin audio into English text. Recent improvements to the STT system are described and these give Character Error Rate (CER) gains of 14.3% absolute for a Broadcast Conversation (BC) task and 5.1% absolute for a Broadcast News (BN) task. The output of these STT systems is then post-processed, so that it consists of sentence-like segments, and translated into English text using a Statistical Machine Translation (SMT) system. The performance of the transcription-translation pipeline is evaluated using the Translation Edit Rate (TER) and BLEU metrics. It is shown that improving both the STT system and the post-STT segmentations can lower the TER scores by up to 5.3% absolute and increase the BLEU scores by up to 2.7% absolute. © 2007 IEEE.

Veja mais

Context-dependent alignment models for statistical machine translation

Relevância:

40.00% 40.00%

Publicador:

Veja mais

Improving speech transcription for Mandarin-English translation

Relevância:

40.00% 40.00%

Publicador:

Veja mais

The John-Hopkins University 2003 Chinese-English machine translation system

Relevância:

40.00% 40.00%

Publicador:

Veja mais

HMM word and phrase alignment for statistical machine translation

Relevância:

40.00% 40.00%

Publicador:

Veja mais

Segmentation and alignment of parralel text for statistical machine translation

Relevância:

40.00% 40.00%

Publicador:

Veja mais

Hierarchical Phrase-Based Translation Grammars Extracted from Alignment Posterior Probabilities

Relevância:

40.00% 40.00%

Publicador:

Veja mais

Improving statistical machine translation by classifying and generalizing inflected verb forms

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper introduces a rule-based classification of single-word and compound verbs into a statistical machine translation approach. By substituting verb forms by the lemma of their head verb, the data sparseness problem caused by highly-inflected languages can be successfully addressed. On the other hand, the information of seen verb forms can be used to generate new translations for unseen verb forms. Translation results for an English to Spanish task are reported, producing a significant performance improvement.

Veja mais

Phrase linguistic classification and generalization for improving statistical machine translation

Relevância:

40.00% 40.00%

Publicador:

Resumo:

In this paper a method to incorporate linguistic information regarding single-word and compound verbs is proposed, as a first step towards an SMT model based on linguistically-classified phrases. By substituting these verb structures by the base form of the head verb, we achieve a better statistical word alignment performance, and are able to better estimate the translation model and generalize to unseen verb forms during translation. Preliminary experiments for the English - Spanish language pair are performed, and future research lines are detailed. © 2005 Association for Computational Linguistics.

Veja mais