949 resultados para corpus analysis


Relevância:

30.00% 30.00%

Publicador:

Resumo:

This research investigated the nasality of vowels in the spontaneous speech of inhabitants of the quilombola communities of Brejo dos Crioulos and Poções (MG). As a theoretical framework, we based on the assumptions of Phonetics and Phonology, in renowned scholars on the investigation of nasality (CAGLIARI, 1977; CÂMARA JR., 1984, 2013; BISOL, 2013; ABAURRE; PAGOTTO, 1996; SILVA, 2015), with subsidies of the Corpus Linguistics. Its general goal was to investigate the occurrence of nasality, in the dialect of these quilombola communities, and their linguistic behavior, considering the linguistic factors that can interfere in the phenomenon. Specifically it was aimed to a) detect the occurrence of nasalized vowels with the help of the resources that the Corpus Linguistics provides (Praat and WorldSmith Tolls); b) discriminate the different types of occurring contexts of nasalized vowels; c) make quantitative and qualitative analyzes of the nasalized vowels in the study corpus; d) describe and analyze the behavior of nasalized vowels and; e) contrast the values of F1 and F2 of the oral and nasalized vowels. It was hypothesized that the nasality happens because it is conditioned by the nasal segment following the nasalized vowel - phonological process of “assimilation” - its position as the primary stress and grammatical category. It was believed that the quilombolas communities of Brejo dos Crioulos and Poções produce nasalized vowels in their speech and this linguistic phenomenon is favored by the adjacent presence of consonants or nasal vowels. Furthermore, it was hypothesized that the values of F1 and F2 of oral and nasalized vowels in these communities are distinct. The following research questions were elaborated: (i) is the presence of nasalized vowels in the speech of these quilombola communities conditioned to the presence of a nasal sound segment? (ii) does the nasal sound segment following the nasalized vowel favor the occurrence of the nasality phenomenon? is there a difference between the values of F1 and F2 of the oral and nasalized vowels in both quilombola communities considered? To compose our corpus, 24 interviews recordings were used (12 female speakers and 12 male speakers), a total of 24 participants. It was found that the following nasal sound segment tends to condition the nasalized vowel. In general, it assimilates the lowering of the soft palate of nasal consonant segment immediately following, but there are cases of nasal vowel segment - regressive assimilation; the stressed syllable tends to favor the nasality, but it occurs in pretonic and postonic position as well; F1 and F2 values of oral and nasalized vowels in the quilombola communities of Poções and Brejo dos Crioulos are distinct: the group of Brejo dos Crioulos tends to produce the F1 of oral and nasalized vowels more lowered than the group of Poções and the F2, in a more anterior position. The nasality tends to occur in verbs and nouns, although it is not specific to a grammatical category. This research found cases of spurious nasalization, confirming previous studies. In turn, it revealed cases of lexical items with favorable context for nasalization, but with its non-occurrence. This last case, considered as the lowering of the uniform soft palate in PB, presented pronounced vowels without the soft palate lowering. That is, it was detected variation in the phenomenon of nasalization in PB. With this work, it was promoted the discussion about nasality, in order to contribute to the linguistic studies about the functioning of Brazilian Portuguese in this geographical context.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In this paper we compare the robustness of several types of stylistic markers to help discriminate authorship at sentence level. We train a SVM-based classifier using each set of features separately and perform sentence-level authorship analysis over corpus of editorials published in a Portuguese quality newspaper. Results show that features based on POS information, punctuation and word / sentence length contribute to a more robust sentence-level authorship analysis. © Springer-Verlag Berlin Heidelberg 2010.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Al evaluar los contactos de Plutarco con otras culturas contemporáneas, los investigadores todavía no han llegado a un consenso acerca de la relación entre el queronense y la literatura cristiano-primitiva. Un buen ejemplo de esto aparece al atender al motivo de la creación del alma humana. La intención de las próximas páginas es, tras un análisis de los textos plutarqueos, atender a estos posibles contactos con NHC, los heresiólogos y el Corpus Hermeticum a fin de dilucidar sus similitudes y diferencias.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Based on the concept of the triple basic structure of human communication by Poyatos (1994a, 1994b) and on the analytical and theoretical implications that derive from this, the present paper conceives the human communication as an indivisible whole in which verbal communication can not be separated from body behavior. This paper analyzes nonverbal categories used in oral communication. The corpus consists of an oral narration in Galician from which we highlighted certain kinemes (minimum units of body movement with meaning) by using the model proposed by Bouvet (2001), in order to explain the non-verbal categories with examples taken from said recordings.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

From ecological tourism to ecotourism: lexical analysis of an emerging tourism. This article deals with the lexicon created in connection with a recent form of tourism: the ecological tourism or ecotourism. The rise of this type of tourism encourages the creation of new concepts and products that are named with new words and expressions with different procedures of formation. From the name itself ecotourism, then expressed as the acronym ecotourism, we analyze the formation of other related words, as well as their formal variation and use. For this, we have worked with a specific corpus of electronic tourist texts and different digital sources and databases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

With the development of information technology, the theory and methodology of complex network has been introduced to the language research, which transforms the system of language in a complex networks composed of nodes and edges for the quantitative analysis about the language structure. The development of dependency grammar provides theoretical support for the construction of a treebank corpus, making possible a statistic analysis of complex networks. This paper introduces the theory and methodology of the complex network and builds dependency syntactic networks based on the treebank of speeches from the EEE-4 oral test. According to the analysis of the overall characteristics of the networks, including the number of edges, the number of the nodes, the average degree, the average path length, the network centrality and the degree distribution, it aims to find in the networks potential difference and similarity between various grades of speaking performance. Through clustering analysis, this research intends to prove the network parameters’ discriminating feature and provide potential reference for scoring speaking performance.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Gun related violence is a complex issue and accounts for a large proportion of violent incidents. In the research reported in this paper, we set out to investigate the pro-gun and anti-gun sentiments expressed on a social media platform, namely Twitter, in response to the 2012 Sandy Hook Elementary School shooting in Connecticut, USA. Machine learning techniques are applied to classify a data corpus of over 700,000 tweets. The sentiments are captured using a public sentiment score that considers the volume of tweets as well as population. A web-based interactive tool is developed to visualise the sentiments and is available at this http://www.gunsontwitter.com. The key findings from this research are: (i) There are elevated rates of both pro-gun and anti-gun sentiments on the day of the shooting. Surprisingly, the pro-gun sentiment remains high for a number of days following the event but the anti-gun sentiment quickly falls to pre-event levels. (ii) There is a different public response from each state, with the highest pro-gun sentiment not coming from those with highest gun ownership levels but rather from California, Texas and New York.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis sets out to explore the place and agency of non-comital women in twelfth-century Anglo-Norman England. Until now, broad generalisations have been applied to all aristocratic women based on a long established scholarship on royal and comital women. Non-comital women have been overlooked, mainly because of an assumed lack of suitable sources from this time period. The first aim of this thesis is to demonstrate that there is a sufficient corpus of charters for a study of this social group of women. It is based on a database created from 5545 charters, of which 3046 were issued by non-comital women and men, taken from three case study counties, Oxfordshire, Suffolk and Yorkshire, and is also supported by other government records. This thesis demonstrates that non-comital women had significant social and economic agency in their own person. By means of a detailed analysis of charters and their clauses this thesis argues that scholarship on non-comital women must rethink the framework applied to the study of non-comital women to address the lifecycle as one of continuities and as active agents in a wider public society. Non-comital women’s agency and identity was not only based on land or in widowhood, which has been the one period in their life cycles where scholars have recognised some level of autonomy, and women had agency in all stages of their life cycle. Women’s agency and identity were drawn from and part of a wider framework that included their families, their kin, and broader local political, religious, and social networks. Natal families continued to be important sources of agency and identity to women long after they had married. Part A of the thesis applies modern charter diplomatic analysis methods to the corpus of charters to bring out and explore women’s presence therein. Part B contextualises these findings and explores women’s agency in their families, landholding, the gift-economy, and the wider religious and social networks of which they were a part.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis investigates the standardisation of Modern Scottish Gaelic orthography from the mid-eighteenth century to the twenty-first. It presents the results of the first corpus-based analysis of Modern Scottish Gaelic orthographic development combined with an analytic approach that places orthographic choices in their sociolinguistic context. The theoretical framework behind the analysis centres on discussion of how the language ideologies of the phonographic ideal, historicism, autonomy, vernacularism and the ideology of the standard itself have shaped orthographic conventions and debates. It argues that current spelling norms reflect an orthography that is the result of compromise, historical factors and pragmatic function. The research uses a digital corpus to examine how three particular features have been used over time: the dialect variation between <eu> and <ia>; variation in s + stop consonant clusters (sd/st, sg/sc, sb/sp); and the use of the grave and acute accents. Evidence is drawn from the Corpas na Gàidhlig electronic corpus created at the University of Glasgow: the sub-corpus used in this study includes 117 published texts representing a period of over 250 years from 1750 to 2007, and a total size of over four and a quarter million words. The results confirm a key period of reform between 1750 and the early nineteenth century, and thereafter a settled norm being established in the early nineteenth century. Since then, some variation has been acceptable although changes and reform of some features have centred on increasing uniformity and regularisation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

In evaluating Plutarch’s contacts with other cultures of his era, scholars have not reached consensus so far regarding the relationship between the Chaironean and Early Christian writers. A good example of this lack of consensus rises when we come to the views of the creation of human soul. The aim of the following paper is to deal with those contacts by, after an analysis of Plutarch’s texts, taking into an account the sources of NHC, heresiologists, and also the contemporary Corpus Hermeticum in order to highlight their similitudes and/or differences about the motif of the soul’s birth.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

String searching within a large corpus of data is an important component of digital forensic (DF) analysis techniques such as file carving. The continuing increase in capacity of consumer storage devices requires corresponding im-provements to the performance of string searching techniques. As string search-ing is a trivially-parallelisable problem, GPGPU approaches are a natural fit – but previous studies have found that local storage presents an insurmountable performance bottleneck. We show that this need not be the case with modern hardware, and demonstrate substantial performance improvements from the use of single and multiple GPUs when searching for strings within a typical forensic disk image.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This article aimed to analyzed, from statistic results obtained after different rounds of the VARBRUL program, the overlapping factors in a joint analysis of the variables phonic projection, tonicity and verb tense in the study of the variation nós/a gente. The sample consisted of 24 interviews lasting 40-45 minutes each, made between 2007-2010 in Concordia – SC, and stratified according to gender, two age ranges (under 45 years and 50 years and over) and three levels of education (elementary school I, elementary school II and high school). By adopting the theoretical support of the Variation Linguistics we therefore sought to discuss methodological aspects related to overlapping factors of independent variables that are usually taken into account in the analysis of the pronominal variation nós/a gente in the subject position. Data were obtained through analysis of a corpus with 1553 occurrences of the surveyed pronouns: 770 of nós and 783 of a gente. Results showed that a joint analysis of the variables phonic projection, tonicity and verb tense significantly changes results, both with regard to the groups of selected factors and to the relative weight assigned to the analyzed variables. 

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This article intends to revisit the issue of national genesis of grammar, from the analysis of a corpus of five unpublished documents that was writing in the region of Diamantina (MG) in the second half of the eighteenth century. The data analyzed here according to the assumptions sociolinguistic endorse the hypothesis that destabilization of pronominal framework and the consequent weakening of the agreement of the Portuguese system were already established in this region in the late eighteenth century. From this result, we speculate about the socio-historical role of the Minas Gerais region in implementing linguistic changes determinants for the establishment of a national grammar.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Even nowadays, with the access to information through the internet or even by means of apps on the cellphone, the market of printed tourist guidebooks, as a way of advertising tourist destinations, continues to work. In a global scope, one of the most recognized publishing companies is Lonely Planet, with publications about a great number of destinations, in until 11 languages, commercialized in various parts of the world. One of its theme is Brazil. The titles Brazil (2013), in Portuguese, Brasil (2014) and in the Italian guidebook, Brasile (2014) were selected. The manuals were analyzed in a post-doctorate research entitled, “Analyzes of tourist guidebooks about Brazil in the light of the Translation Studies Based on Corpus”. However, the analyzed chapters are just the ones that make up the South of Brazil, that is, Paraná, Santa Catarina and Rio Grande do Sul. The aim was to analyze the cultural markers with the highest level of keyness, presented in them and its respective translations in the guidebooks in Portuguese and in Italian. Nevertheless, for this article just the chapters referring to Paraná were selected. The theoretical basis were the Corpus Linguistic (Berber Sardinha, 2004), the Translation Studies Based on Corpus (BAKER, 1993, 1995, 1996, 1999, 2000; CAMARGO, 2005, 2007a, 2007b), as well as the studies about cultural markers (AUBERT, 1981, 1995, 2006). It was noted that even on the source language corpus there were markers in Portuguese language, such as “Curitiba” and “Iguaçu”, whereas the toponyms are quite frequent in this textual genre.