25 resultados para Cognates
Resumo:
The identification of cognates between two distinct languages has recently start- ed to attract the attention of NLP re- search, but there has been little research into using semantic evidence to detect cognates. The approach presented in this paper aims to detect English-French cog- nates within monolingual texts (texts that are not accompanied by aligned translat- ed equivalents), by integrating word shape similarity approaches with word sense disambiguation techniques in order to account for context. Our implementa- tion is based on BabelNet, a semantic network that incorporates a multilingual encyclopedic dictionary. Our approach is evaluated on two manually annotated da- tasets. The first one shows that across different types of natural text, our method can identify the cognates with an overall accuracy of 80%. The second one, con- sisting of control sentences with semi- cognates acting as either true cognates or false friends, shows that our method can identify 80% of semi-cognates acting as cognates but also identifies 75% of the semi-cognates acting as false friends.
Resumo:
This paper presents an overview of the 6th ALTA shared task that ran in 2015. The task was to identify in English texts all the potential cognates from the perspective of the French language. In other words, identify all the words in the English text that would acceptably translate into a similar word in French. We present the motivations for the task, the description of the data and the results of the 4 participating teams. We discuss the results against a baseline and prior work.
Resumo:
Motivated by the need to statically balance the inherent elastic forces in linkages, this paper presents three techniques to statically balance a four-bar linkage loaded by a zero-free-length spring attached between its coupler point and an anchor point on the ground. The number of auxiliary links and balancing springs required for the three techniques is less than or equal to that of the only technique currently in the literature. One of the three techniques does not require auxiliary links. In these techniques, the set of values for the spring constants and the ground-anchor point of the balancing springs can vary over a one-parameter family. Thrice as many balancing choices are available when the cognates are considered. The ensuing numerous options enable a user to choose the most practical solution. To facilitate the evaluation of the balancing choices for all the cognates, Roberts-Chebyshev cognate theorem is extended to statically balanced four-bar linkages. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
This article explores one aspect of the processing perspective in L2 learning in an EST context: the processing of new content words, in English, of the type ‘cognates’ and ‘false friends’, by Spanish speaking engineering students. The paper does not try to offer a comprehensive overview of language acquisition mechanisms, but rather it is intended to review more narrowly how our conceptual systems, governed by intricately linked networks of neural connections in the brain, make language development possible, creating, at the same time, some L2 processing problems. The case of ‘cognates and false friends’ in specialised contexts is brought here to illustrate some of the processing problems that the L2 learner has to confront, and how mappings in the visual, phonological and semantic (conceptual) brain structures function in second language processing of new vocabulary. Resumen Este artículo pretende reflexionar sobre un aspecto de la perspectiva del procesamiento de segundas lenguas (L2) en el contexto del ICT: el procesamiento de palabras nuevas, en inglés, conocidas como “cognados” y “falsos amigos”, por parte de estudiantes de ingeniería españoles. No se pretende ofrecer una visión completa de los mecanismos de adquisición del lenguaje, más bien se intenta mostrar cómo nuestro sistema conceptual, gobernado por una complicada red de conexiones neuronales en el cerebro, hace posible el desarrollo del lenguaje, aunque ello conlleve ciertas dificultades en el procesamiento de segundas lenguas. El caso de los “cognados” y los “falsos amigos”, en los lenguajes de especialidad, se trae para ilustrar algunos de los problemas de procesamiento que el estudiante de una lengua extranjera tiene que afrontar y el funcionamiento de las correspondencias entre las estructuras visuales, fonológicas y semánticas (conceptuales) del cerebro en el procesamiento de nuevo vocabulario.
Resumo:
This research constructed a readability measurement for French speakers who view English as a second language. It identified the true cognates, which are the similar words from these two languages, as an indicator of the difficulty of an English text for French people. A multilingual lexical resource is used to detect true cognates in text, and Statistical Language Modelling to predict the predict the readability level. The proposed enhanced statistical language model is making a step in the right direction by improving the accuracy of readability predictions for French speakers by up to 10% compared to state of the art approaches. The outcome of this study could accelerate the learning process for French speakers who are studying English. More importantly, this study also benefits the readability estimation research community, presenting an approach and evaluation at sentence level as well as innovating with the use of cognates as a new text feature.
Resumo:
The cell cycle phase at starvation influences post-starvation differentiation and morphogenesis in Dictyostelium discoideum. We found that when expressed in Saccharomyces cerevisiae, a D. discoideum cDNA that encodes the ribosomal protein S4 (DdS4) rescues mutations in the cell cycle genes cdc24, cdc42 and bem1. The products of these genes affect morphogenesis in yeast via a coordinated moulding of the cytoskeleton during bud site selection. D. discoideum cells that over-or under-expressed DdS4 did not show detectable changes in protein synthesis but displayed similar developmental aberrations whose intensity was graded with the extent of over-or under-expression. This suggested that DdS4 might influence morphogenesis via a stoichiometric effect - specifically, by taking part in a multimeric complex similar to the one involving Cdc24p, Cdc42p and Bem1p in yeast. In support of the hypothesis, the S. cerevisiae proteins Cdc24p, Cdc42p and Bem1p as well as their D. discoideum cognates could be co-precipitated with antibodies to DdS4. Computational analysis and mutational studies explained these findings: a C-terminal domain of DdS4 is the functional equivalent of an SH3 domain in the yeast scaffold protein Bem1p that is central to constructing the bud site selection complex. Thus in addition to being part of the ribosome, DdS4 has a second function, also as part of a multi-protein complex. We speculate that the existence of the second role can act as a safeguard against perturbations to ribosome function caused by spontaneous variations in DdS4 levels.
Resumo:
Homenaje a Ignacio Barandiarán Maestu / coord. por Javier Fernández Eraso, Juan Santos Yanguas.
Resumo:
Lancelets (amphioxus), although showing the most similar anatomical features to vertebrates, never develop a vertebrate-like head but rather several structures specific to this animal. The lancelet anatomical specificity seems to be traceable to early developmental stages, such as the vertebrate dorsal and anterior-posterior determinations. The BMP and Wnt proteins play important roles in establishing the early basis of the dorsal structures and the head in vertebrates. The early behavior of BMP and Wnt may be also related to the specific body structures of lancelets. The expression patterns of a dpp-related gene, Bbbmp2/4, and two wnt-related genes, Bbwnt7 and Bbwnt8, have been studied in comparison with those of brachyury and Hnf-3 beta class genes The temporal expression patterns of these genes are similar to those of vertebrates; Bbbmp2/4 and Bbwnt8 are first expressed in the invaginating primitive gut and the equatorial region. respectively, at the initial gastrula stage. However, spatial expression pattern of Bbbmp2/4 differs significantly from the vertebrate cognates. It is expressed in the mid-dorsal inner layer of gastrulae and widely in the anterior region, in which vertebrates block BMP signaling, The present study suggests that the lancelet embryo may have two distinct developmental domains from the gastrula stage, the domains of which coincide later with the lateral diverticular and the somitocoelomic regions. The embryonic origin of the anterior-specific structures in lancelets corresponds to the anterior domain where Bbbmp2/4 is continuously expressed.
Resumo:
This article deals with one of the most common elements in names of Irish hills and mountains. The grammar, phonology, etymology, semantic range and chronology of the element are examined. Sliabh is particularly complex in terms of its semantic range, which includes the following senses: 1) a mountain or hill (standing alone or forming part of a range); 2) a range of hills or mountains; 3) an moor or area of upland. The word is present in the earliest attested stages of the Irish language, and there is some evidence for all three meanings in Old Irish, though senses 1 and 2 are best attested. It is suggested that the view advanced by MacBain and Thurneysen that sliabh is etymologically related to Eng. slope and that this reflects its original meaning is open to some doubt in view of the lack of evidence for this sense in early Irish and the lack of clear cognates in other branches of Celtic and Indo-European languages.
Resumo:
Le but de cette thèse est d'étudier les corrélats comportementaux et neuronaux du transfert inter-linguistique (TIL) dans l'apprentissage d’une langue seconde (L2). Compte tenu de nos connaissances sur l'influence de la distance linguistique sur le TIL (Paradis, 1987, 2004; Odlin, 1989, 2004, 2005; Gollan, 2005; Ringbom, 2007), nous avons examiné l'effet de facilitation de la similarité phonologique à l’aide de la résonance magnétique fonctionnelle entre des langues linguistiquement proches (espagnol-français) et des langues linguistiquement éloignées (persan-français). L'étude I rapporte les résultats obtenus pour des langues linguistiquement proches (espagnol-français), alors que l'étude II porte sur des langues linguistiquement éloignées (persan-français). Puis, les changements de connectivité fonctionnelle dans le réseau langagier (Price, 2010) et dans le réseau de contrôle supplémentaire impliqué dans le traitement d’une langue seconde (Abutalebi & Green, 2007) lors de l’apprentissage d’une langue linguistiquement éloignée (persan-français) sont rapportés dans l’étude III. Les résultats des analyses d’IRMF suivant le modèle linéaire général chez les bilingues de langues linguistiquement proches (français-espagnol) montrent que le traitement des mots phonologiquement similaires dans les deux langues (cognates et clangs) compte sur un réseau neuronal partagé par la langue maternelle (L1) et la L2, tandis que le traitement des mots phonologiquement éloignés (non-clang-non-cognates) active des structures impliquées dans le traitement de la mémoire de travail et d'attention. Toutefois, chez les personnes bilingues de L1-L2 linguistiquement éloignées (français-persan), même les mots phonologiquement similaires à travers les langues (cognates et clangs) activent des régions connues pour être impliquées dans l'attention et le contrôle cognitif. Par ailleurs, les mots phonologiquement éloignés (non-clang-non-cognates) activent des régions usuellement associées à la mémoire de travail et aux fonctions exécutives. Ainsi, le facteur de distance inter-linguistique entre L1 et L2 module la charge cognitive sur la base du degré de similarité phonologiques entres les items en L1 et L2. Des structures soutenant les processus impliqués dans le traitement exécutif sont recrutées afin de compenser pour des demandes cognitives. Lorsque la compétence linguistique en L2 augmente et que les tâches linguistiques exigent ainsi moins d’effort, la demande pour les ressources cognitives diminue. Tel que déjà rapporté (Majerus, et al, 2008; Prat, et al, 2007; Veroude, et al, 2010; Dodel, et al, 2005; Coynel, et al ., 2009), les résultats des analyses de connectivité fonctionnelle montrent qu’après l’entraînement la valeur d'intégration (connectivité fonctionnelle) diminue puisqu’il y a moins de circulation du flux d'information. Les résultats de cette recherche contribuent à une meilleure compréhension des aspects neurocognitifs et de plasticité cérébrale du TIL ainsi que l'impact de la distance linguistique dans l'apprentissage des langues. Ces résultats ont des implications dans les stratégies d'apprentissage d’une L2, les méthodes d’enseignement d’une L2 ainsi que le développement d'approches thérapeutiques chez des patients bilingues qui souffrent de troubles langagiers.
Resumo:
In Statistical Machine Translation from English to Malayalam, an unseen English sentence is translated into its equivalent Malayalam sentence using statistical models. A parallel corpus of English-Malayalam is used in the training phase. Word to word alignments has to be set among the sentence pairs of the source and target language before subjecting them for training. This paper deals with certain techniques which can be adopted for improving the alignment model of SMT. Methods to incorporate the parts of speech information into the bilingual corpus has resulted in eliminating many of the insignificant alignments. Also identifying the name entities and cognates present in the sentence pairs has proved to be advantageous while setting up the alignments. Presence of Malayalam words with predictable translations has also contributed in reducing the insignificant alignments. Moreover, reduction of the unwanted alignments has brought in better training results. Experiments conducted on a sample corpus have generated reasonably good Malayalam translations and the results are verified with F measure, BLEU and WER evaluation metrics.
Resumo:
In Statistical Machine Translation from English to Malayalam, an unseen English sentence is translated into its equivalent Malayalam translation using statistical models like translation model, language model and a decoder. A parallel corpus of English-Malayalam is used in the training phase. Word to word alignments has to be set up among the sentence pairs of the source and target language before subjecting them for training. This paper is deals with the techniques which can be adopted for improving the alignment model of SMT. Incorporating the parts of speech information into the bilingual corpus has eliminated many of the insignificant alignments. Also identifying the name entities and cognates present in the sentence pairs has proved to be advantageous while setting up the alignments. Moreover, reduction of the unwanted alignments has brought in better training results. Experiments conducted on a sample corpus have generated reasonably good Malayalam translations and the results are verified with F measure, BLEU and WER evaluation metrics
Resumo:
Este estudo faz considerações sobre a investigação histórico-comparativa da língua tapajúna (Jê), mais especificamente sobre as consoantes em seu aspecto fonético-fonológico, comparando cognatos dessa língua e da língua suyá (também da família Jê) com o objetivo de verificar suas semelhanças e diferenças no contexto das consoantes, tendo em vista contribuir para a reconstrução do tronco linguístico Macro-Jê. Para o estudo aqui desenvolvido, foi utilizado o método comparativo por meio do qual são verificadas as correspondências existentes entre as línguas envolvidas em um estudo dessa natureza, com o fim de observar como seria possivelmente constituída a forma original, que teria originado as formas das duas línguas. Os dados de tapajúna e de suyá também foram comparados aos dados do Proto-Jê, reconstruído por Davis (1966). De um modo geral, observa-se que há poucas diferenças entre tapajúna e suyá. No caso das variações em contextos fonético-fonológicos idênticos, há a questão da variação de faixa etária, como a ocorrência das consoantes pré-nasalizadas na fala dos mais jovens do povo tapajúna, enquanto os mais velhos usam as nasais.
Resumo:
The translation project of some short stories of the book Piccoli equivoci senza importanza, written by the Italian writer Antonio Tabucchi, was performed during the academic years. This thesis target is to analyze grammatical and stylistic issues of the short stories; to achieve this goal, parts of the original texts and their respective translations were used. The grammar section approaches important aspects as idioms, false cognates and different verb regencies between Portuguese and Italian. The stylistic part focuses on Tabucchi´s choices regarding the narrator, focalization, types of speech and interior monologues in his texts; these options characterize the Italian writer´s style and make his stories plots intense. As a conclusion it can be noticed that this work is relevant to give a forward step in the Italian literature studies field, because it contributes to an important author style study and provides the analysis of which choices compound the text and the effects they cause
Resumo:
In this article we address the search for equivalents in the elaboration of a model dictionary for sworn translators of terms used in bylaws, in the Portuguese-French translation direction. We present, specifically, some cases in which the morphological similarity between the two languages can lead to errors, for example false cognates.