872 resultados para Comparable Corpus


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Postprint

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This research analyzes the average previous stressed vowels [ε] and [e] and later [ɔ] and [o] in nominal and verbal forms in the 1st person singular and 3rd person singular and plural in the present tense, specifically the umlaut process of mid vowels /e/ and /o/, which assimilate in /ε/ and /ᴐ/ in stressed position. The general objective of this research is to describe and quantify the occurrence of umlaut and subsequently analyze in which words there is regularity or not. As specific objectives we have: i) to compile and to label an oral, spontaneous, synchronic and regional corpus, from radio programs produced in the city of Ituiutaba, Minas Gerais; ii) to describe the characteristics of the corpus to be compiled; iii) to investigate the alternating timbre of mid vowels in stressed position; iv) to identify instances of nominal and verbal umlaut of the middle vowels in stressed position; v) to describe the identified cases of nominal and verbal umlaut; vi) to analyze the probable causes for the variation of the middle vowels. To perform the proposed analysis, we have adopted as a theoretical-methodological basis multi-representational models: Phonology of Use (BYBEE, 2001) and Exemplar Theory (PIERREHUMBERT, 2001) combined with the precepts of Corpus Linguistics (BEBER SARDINHA, 2004). The corpus consisted of 16 radio programs – eight political and eight religious – from the city of Ituiutaba-MG, with recordings of about 20 to 40 minutes. We note, by means of the results generated by WordSmith Tools® software, version 6.0 (SCOTT, 2012), that the analyzed forms show little variation, which shows that the umlaut is a process already lexicalized in participants of the radio programs analyzed. We conclude that the results converge with the proposal of the Phonology of Use (BYBEE, 2001; PHILLIPS, 1984) that less frequent words that have no phonetic environment conducive to change, are changed first.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This research investigated the nasality of vowels in the spontaneous speech of inhabitants of the quilombola communities of Brejo dos Crioulos and Poções (MG). As a theoretical framework, we based on the assumptions of Phonetics and Phonology, in renowned scholars on the investigation of nasality (CAGLIARI, 1977; CÂMARA JR., 1984, 2013; BISOL, 2013; ABAURRE; PAGOTTO, 1996; SILVA, 2015), with subsidies of the Corpus Linguistics. Its general goal was to investigate the occurrence of nasality, in the dialect of these quilombola communities, and their linguistic behavior, considering the linguistic factors that can interfere in the phenomenon. Specifically it was aimed to a) detect the occurrence of nasalized vowels with the help of the resources that the Corpus Linguistics provides (Praat and WorldSmith Tolls); b) discriminate the different types of occurring contexts of nasalized vowels; c) make quantitative and qualitative analyzes of the nasalized vowels in the study corpus; d) describe and analyze the behavior of nasalized vowels and; e) contrast the values of F1 and F2 of the oral and nasalized vowels. It was hypothesized that the nasality happens because it is conditioned by the nasal segment following the nasalized vowel - phonological process of “assimilation” - its position as the primary stress and grammatical category. It was believed that the quilombolas communities of Brejo dos Crioulos and Poções produce nasalized vowels in their speech and this linguistic phenomenon is favored by the adjacent presence of consonants or nasal vowels. Furthermore, it was hypothesized that the values of F1 and F2 of oral and nasalized vowels in these communities are distinct. The following research questions were elaborated: (i) is the presence of nasalized vowels in the speech of these quilombola communities conditioned to the presence of a nasal sound segment? (ii) does the nasal sound segment following the nasalized vowel favor the occurrence of the nasality phenomenon? is there a difference between the values of F1 and F2 of the oral and nasalized vowels in both quilombola communities considered? To compose our corpus, 24 interviews recordings were used (12 female speakers and 12 male speakers), a total of 24 participants. It was found that the following nasal sound segment tends to condition the nasalized vowel. In general, it assimilates the lowering of the soft palate of nasal consonant segment immediately following, but there are cases of nasal vowel segment - regressive assimilation; the stressed syllable tends to favor the nasality, but it occurs in pretonic and postonic position as well; F1 and F2 values of oral and nasalized vowels in the quilombola communities of Poções and Brejo dos Crioulos are distinct: the group of Brejo dos Crioulos tends to produce the F1 of oral and nasalized vowels more lowered than the group of Poções and the F2, in a more anterior position. The nasality tends to occur in verbs and nouns, although it is not specific to a grammatical category. This research found cases of spurious nasalization, confirming previous studies. In turn, it revealed cases of lexical items with favorable context for nasalization, but with its non-occurrence. This last case, considered as the lowering of the uniform soft palate in PB, presented pronounced vowels without the soft palate lowering. That is, it was detected variation in the phenomenon of nasalization in PB. With this work, it was promoted the discussion about nasality, in order to contribute to the linguistic studies about the functioning of Brazilian Portuguese in this geographical context.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mémoire numérisé par la Direction des bibliothèques de l'Université de Montréal.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Users seeking information may not find relevant information pertaining to their information need in a specific language. But information may be available in a language different from their own, but users may not know that language. Thus users may experience difficulty in accessing the information present in different languages. Since the retrieval process depends on the translation of the user query, there are many issues in getting the right translation of the user query. For a pair of languages chosen by a user, resources, like incomplete dictionary, inaccurate machine translation system may exist. These resources may be insufficient to map the query terms in one language to its equivalent terms in another language. Also for a given query, there might exist multiple correct translations. The underlying corpus evidence may suggest a clue to select a probable set of translations that could eventually perform a better information retrieval. In this paper, we present a cross language information retrieval approach to effectively retrieve information present in a language other than the language of the user query using the corpus driven query suggestion approach. The idea is to utilize the corpus based evidence of one language to improve the retrieval and re-ranking of news documents in the other language. We use FIRE corpora - Tamil and English news collections in our experiments and illustrate the effectiveness of the proposed cross language information retrieval approach.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mémoire numérisé par la Direction des bibliothèques de l'Université de Montréal.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The dredges described in this report were taken on the PR II, CORPUS 4 Expedition in January 1969 by the USGS Woods Hole Coastal and Marine Science Center from the R/V Atlantic Twin. Dredges recovered and are available at USGS Woods Hole Coastal and Marine Science Center for sampling and study.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Al evaluar los contactos de Plutarco con otras culturas contemporáneas, los investigadores todavía no han llegado a un consenso acerca de la relación entre el queronense y la literatura cristiano-primitiva. Un buen ejemplo de esto aparece al atender al motivo de la creación del alma humana. La intención de las próximas páginas es, tras un análisis de los textos plutarqueos, atender a estos posibles contactos con NHC, los heresiólogos y el Corpus Hermeticum a fin de dilucidar sus similitudes y diferencias.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

El estudio de las combinaciones léxicas según su grado de fijación y su distinción en combinaciones libres, colocaciones y locuciones ha sido realizado desde la perspectiva sincrónica. Planteamos la posibilidad de aplicar las pautas para distinguir estos tipos de estructuras en materiales de tipo diacrónico. Concretamente, nos basamos en los documentos que componen el Corpus del Español del Reino de Granada (CORDEREGRA) para valorar los materiales de este corpus histórico-lingüístico y comprobar si los criterios sincrónicos se pueden aplicar al estudio de documentos de otros siglos.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Si se pretende elaborar un diccionario de adjetivos, ya sea este monolingüe o bilingüe, la primera tarea que se le impone al lexicógrafo es la de definir qué es un adjetivo, una cuestión que todavía hoy no ha sido resuelta satisfactoriamente. En alemán hay una serie de palabras que han sido descritas tradicionalmente como adjetivos en función exclusivamente predicativa, cuyo estatus como adjetivos es, sin embargo, cuestionado por algunos autores. En este artículo se trata de dilucidar si estas palabras realmente solo pueden aparecer en función predicativa, cómo se las describe en diccionarios y gramáticas y cuáles son sus principales correspondencias en español, a fin de decidir si deberían ser incluidas en un corpus destinado a la elaboración de un diccionario sintáctico de adjetivos alemán-español.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Phraseological units are complex structures that may be difficult to comprehend and transfer into other languages due to their idiomatic nature. The translator of English legal texts often comes across binomials, a type of phraseological unit that is a characteristic of this specialized discourse. Based on a specialized comparable bilingual corpus composed of legal forms and agreements, this article identifies several occurrences of this phraseological structure and extracts the most frequent examples in English and Spanish. A contrastive analysis of the data obtained from the corpus helps to establish a series of equivalencies among binomials in both languages and proposes a typology of equivalences regarding these phraseological structures.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper introduces a novel, in-depth approach of analyzing the differences in writing style between two famous Romanian orators, based on automated textual complexity indices for Romanian language. The considered authors are: (a) Mihai Eminescu, Romania’s national poet and a remarkable journalist of his time, and (b) Ion C. Brătianu, one of the most important Romanian politicians from the middle of the 18th century. Both orators have a common journalistic interest consisting in their desire to spread the word about political issues in Romania via the printing press, the most important public voice at that time. In addition, both authors exhibit writing style particularities, and our aim is to explore these differences through our ReaderBench framework that computes a wide range of lexical and semantic textual complexity indices for Romanian and other languages. The used corpus contains two collections of speeches for each orator that cover the period 1857–1880. The results of this study highlight the lexical and cohesive textual complexity indices that reflect very well the differences in writing style, measures relying on Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) semantic models.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study is a corpus-based comparison between student essays written in the subject areas of English linguistics and literature at undergraduate level. They are 200 Bachelor degree theses submitted at a variety of university departments (such as English, Language and Literature, Humanities, Social and Intercultural Studies) in Sweden. The comparison concerns frequencies of core modal verbs and how often they occur together with the I, we and it subject pronouns and in the structures this/the [essay, study, project, thesis] when students attempt to communicate their personal claims. Quantitative and qualitative analyses of the essays show few similarities in the ways that core modal verbs appear in both disciplines. The results indicate mainly distinct differences, especially in relation to clusters and variation of performative verbs. Specific patterns in the ways that students use core modal verbs as hedges have also been identified.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

De nos jours, les séries télévisées américaines représentent une part incontournable de la culture populaire, à tel point que plusieurs traductions audiovisuelles coexistent au sein de la francophonie. Outre le doublage qui permet leur diffusion à la télévision, elles peuvent être sous titrées jusqu’à trois fois soit, en ordre chronologique : par des fans sur Internet; au Québec, pour la vente sur DVD en Amérique du Nord; et en France, pour la vente sur DVD en Europe. Pourtant, bien que ces trois sous titrages répondent aux mêmes contraintes linguistiques (celles de la langue française) et techniques (diffusion au petit écran), ils diffèrent dans leur traitement des dialogues originaux. Nous établissons dans un premier temps les pratiques à l’œuvre auprès des professionnels et des amateurs. Par la suite, l’analyse des traductions ainsi que le recours à un corpus comparable de séries télévisées françaises et québécoises permettent d’établir les normes linguistiques (notamment eu égard à la variété) et culturelles appliquées par les différents traducteurs et, subsidiairement, de définir ce que cache l’appellation « Canadian French ». Cette thèse s’inscrit dans le cadre des études descriptives et sociologiques. Nous y décrivons la réalité professionnelle des traducteurs de l’audiovisuel et l’influence que les fansubbers exercent non seulement sur la pratique professionnelle, mais aussi sur de nouvelles méthodes de formation de la prochaine génération de traducteurs. Par ailleurs, en étudiant plusieurs traductions d’une même œuvre, nous démontrons que les variétés de français ne sauraient justifier, à elles seules, la multiplication de l’offre en sous titrage, vu le faible taux de différences purement linguistiques.