408 resultados para 570104 Lingüística informatizada


Relevância:

10.00% 10.00%

Publicador:

Resumo:

This article describes the developmentof an Open Source shallow-transfer machine translation system from Czech to Polish in theApertium platform. It gives details ofthe methods and resources used in contructingthe system. Although the resulting system has quite a high error rate, it is still competitive with other systems.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Extensible Dependency Grammar (XDG; Debusmann, 2007) is a flexible, modular dependency grammarframework in which sentence analyses consist of multigraphs and processing takes the form of constraint satisfaction. This paper shows how XDGlends itself to grammar-driven machine translation and introduces the machinery necessary for synchronous XDG. Since the approach relies on a shared semantics, it resembles interlingua MT.It differs in that there are no separateanalysis and generation phases. Rather, translation consists of the simultaneousanalysis and generation of a single source-target sentence.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper proposes to enrich RBMTdictionaries with Named Entities(NEs) automatically acquired fromWikipedia. The method is appliedto the Apertium English-Spanishsystem and its performance comparedto that of Apertium with and withouthandtagged NEs. The system withautomatic NEs outperforms the onewithout NEs, while results vary whencompared to a system with handtaggedNEs (results are comparable forSpanish to English but slightly worstfor English to Spanish). Apart fromthat, adding automatic NEs contributesto decreasing the amount of unknownterms by more than 10%.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

There are a number of morphological analysers for Polish. Most of these, however, are non-free resources. What is more, different analysers employ different tagsets and tokenisation strategies. This situation calls for a simpleand universal framework to join different sources of morphological information, including the existing resources as well as user-provided dictionaries. We present such a configurable framework that allows to write simple configuration files that define tokenisation strategies and the behaviour of morphologicalanalysers, including simple tagset conversion.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper discusses the qualitativecomparative evaluation performed on theresults of two machine translation systemswith different approaches to the processing ofmulti-word units. It proposes a solution forovercoming the difficulties multi-word unitspresent to machine translation by adopting amethodology that combines the lexicongrammar approach with OpenLogos ontologyand semantico-syntactic rules. The paper alsodiscusses the importance of a qualitativeevaluation metrics to correctly evaluate theperformance of machine translation engineswith regards to multi-word units.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We describe a series of experiments in which we start with English to French and English to Japanese versions of an Open Source rule-based speech translation system for a medical domain, and bootstrap correspondign statistical systems. Comparative evaluation reveals that the rule-based systems are still significantly better than the statistical ones, despite the fact that considerable effort has been invested in tuning both the recognition and translation components; also, a hybrid system only marginally improved recall at the cost of a los in precision. The result suggests that rule-based architectures may still be preferable to statistical ones for safety-critical speech translation tasks.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Softcatalà is a non-profit associationcreated more than 10 years ago to fightthe marginalisation of the Catalan languagein information and communicationtechnologies. It has led the localisationof many applications and thecreation of a website which allows itsusers to translate texts between Spanishand Catalan using an external closed-sourcetranslation engine. Recently,the closed-source translation back-endhas been replaced by a free/open-sourcesolution completely managed by Softcatalà: the Apertium machine translationplatform and the ScaleMT web serviceframework. Thanks to the opennessof the new solution, it is possibleto take advantage of the huge amount ofusers of the Softcatalà translation serviceto improve it, using a series ofmethods presented in this paper. In addition,a study of the translations requestedby the users has been carriedout, and it shows that the translationback-end change has not affected theusage patterns.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper presents an Italian to CatalanRBMT system automatically built bycombining the linguistic data of theexisting pairs Spanish-Catalan andSpanish-Italian. A lightweight manualpostprocessing is carried out in order tofix inconsistencies in the automaticallyderived dictionaries and to add very frequentwords that are missing accordingto a corpus analysis. The system isevaluated on the KDE4 corpus and outperformsGoogle Translate by approximatelyten absolute points in terms ofboth TER and GTM.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

El objetivo de este artículo introductorio es esbozar las características comunes que sustentan lamayoría de esas propuestas, para poder entender uno de los caminos más relevantes que ha seguidola didáctica de la lengua en la segunda mitad del siglo XX. A continuación mencionamos lascircunstancias socio-históricas en que emergieron dichas propuestas, así como los principioslingüísticos y pedagógicos en que se fundamentan y una descripción esquemática de su dinámica enel aula. Un apartado final apunta algunas reflexiones personales sobre las perspectivas de futuro. Lanecesaria brevedad del artículo obliga a sintetizar los distintos apartados y a remitir a unos pocosmanuales específicos de cada aspecto.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

En la comprensió i la producció del llenguatge intervenen processos automàtics i processos controlats. La diferència entre uns i altres està determinada pel grau d'implicació del sistema cognitiu en el processament, el qual es pot inferir, en certa mesura, examinant el nivell de consciència i voluntarietat presents en realitzar una tasca. L'objectiu d'aquest article ha estat definir aquests processos i posar-se en relació amb un marc teòric de referència. L'automatisme s'explica a partir de la hipòtesi de la modularitat, mentre que els processos controlats, per la seva estreta relació amb el sistema cognitiu, són interpretats des de diferents marcs teòrics que abasten des de la perspectiva pragmàtica a la teoria de la ment.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This paper calls for greater attention from researchers into the nature of humor translation as an interdisciplinary area that should be of interest to translation and humor studies. It includes a brief review of the complexity of translation and the problems posed by traditional approaches. The paper introduces a number of parameters that may be of assistance in developing joke typologies for translators or translation scholars. A model is presented for structuring joke-types according to binary branching. An attempt is then made to combine the model with ideas and concepts put forward in Attardo (2002). The result is a binary branch tree for the 6 Knowledge Resources and the hierarchical structure that Attardo claims they have. One important conclusion is that sameness, or similarity, may have little to do with funniness, and, if this is so, it is going to create a dilemma for translators wishing to achieve equivalent effect.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Investigación sobre el fenómeno del multilingüismo en el medio audiovisual. El estudio reúne las recientes investigaciones que se han hecho sobre la presencia de varias lenguas en un mismo producto audiovisual y las posibilidades de traducción de este fenómeno, tanto en el doblaje como en la subtitulación. El trabajo incluye, a modo de ejemplo, el estudio de la traducción oficial de tres películas multiligües: Vicky Cristina Barcelona, Un prophète y L'auberge espagnole.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Aquest document conté el text Presentació, una introducció al CD del Corpus Oral Dialectal (COD). El COD és un component del Corpus de Català Contemporani de la Universitat de Barcelona (CCCUB), un arxiu de corpus de llengua catalana oral contemporània que ha estat confegit pel grup de recerca Grup d'Estudi de la Variació (GEV) amb la finalitat de contribuir a l'estudi de la variació dialectal, social i funcional en la llengua catalana. Una selecció de materials del CCCUB ha estat dipositada al RECERCAT (Dipòsit de la Recerca de Catalunya, www.recercat.net), i també és accessible a través del web del CCCUB: http://www.ub.edu/cccub.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Aquest document conté el text Presentation_English, una introducció en anglès al CD del Corpus Oral Dialectal (COD). El COD és un component del Corpus de Català Contemporani de la Universitat de Barcelona (CCCUB), un arxiu de corpus de llengua catalana oral contemporània que ha estat confegit pel grup de recerca Grup d'Estudi de la Variació (GEV) amb la finalitat de contribuir a l'estudi de la variació dialectal, social i funcional en la llengua catalana. Aquest i altres materials del CCCUB són accessibles directament al Dipòsit UB o a través del web del CCCUB (http://www.ub.edu/cccub).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Aquest document conté el text Presentation_English, una introducció en anglès al CD del Corpus Oral Dialectal (COD). El COD és un component del Corpus de Català Contemporani de la Universitat de Barcelona (CCCUB), un arxiu de corpus de llengua catalana oral contemporània que ha estat confegit pel grup de recerca Grup d'Estudi de la Variació (GEV) amb la finalitat de contribuir a l'estudi de la variació dialectal, social i funcional en la llengua catalana. Aquest i altres materials del CCCUB són accessibles directament al Dipòsit UB o a través del web del CCCUB (http://www.ub.edu/cccub).