969 resultados para linguistic corpora
Resumo:
International audience
Resumo:
Annotation Pro - a description of techniques, methods implemented in the tool, as well as the list of all built in functionalities and features of the user interface, and usage tips.
Resumo:
350 p.
Resumo:
Most second language researchers agree that there is a role for corrective feedback in second language writing classes. However, many unanswered questions remain concerning which linguistic features to target and the type and amount of feedback to offer. This study examined two new pieces of writing by 151 learners of English as a Second Language (ESL), in order to investigate the effect of direct and metalinguistic written feedback on errors with the simple past tense, the present perfect tense, dropped pronouns, and pronominal duplication. This inquiry also considered the extent to which learner differences in language-analytic ability (LAA), as measured by the LLAMA F, mediated the effects of these two types of explicit written corrective feedback. Learners in the feedback groups were provided with corrective feedback on two essays, after which learners in all three groups completed two additional writing tasks to determine whether or not the provision of corrective feedback led to greater gains in accuracy compared to no feedback. Both treatment groups, direct and metalinguistic, performed better than the comparison group on new pieces of writing immediately following the treatment sessions, yet direct feedback was more durable than metalinguistic feedback for one structure, the simple past tense. Participants with greater LAA proved more likely to achieve gains in the direct feedback group than in the metalinguistic group, whereas learners with lower LAA benefited more from metalinguistic feedback. Overall, the findings of the present study confirm the results of prior studies that have found a positive role for written corrective feedback in instructed second language acquisition.
Resumo:
Visualisation provides good support for software analysis. It copes with the intangible nature of software by providing concrete representations of it. By reducing the complexity of software, visualisations are especially useful when dealing with large amounts of code. One domain that usually deals with large amounts of source code data is empirical analysis. Although there are many tools for analysis and visualisation, they do not cope well software corpora. In this paper we present Explora, an infrastructure that is specifically targeted at visualising corpora. We report on early results when conducting a sample analysis on Smalltalk and Java corpora.
Resumo:
Many works deal with the study of Greek epigrams. These texts gather precious historical, religious and epigraphic content whose analysis has been addressed from a literary point of view. The degree of dependence of Hellenistic epigrams of earlier period epigrams are of great interest, as shown by recent and numerous papers on this matter (Harder et al . 1998; 2002; 2006; 2012; Bing and Bruss 2007; Baumbach et al. 2010). Another major line of research is focused on the origin of epigrams and their relation to elegy (Gentilli 1968; Passa 1998b) or to hexametric poetry and the oral-formulaic language inherited from Homeric tradition (di Tillio 1969; Moranti 1971, 1972; Gentili and Giannini 1977). Furthermore, other authors, such as Day (2010), have pointed out the significance of the performative and ritual nature of epigrams. Likewise, this field of study has undergone a renewal because of new papyrological findings enriching the existing epigram collection corpora. Compared to all these works, monographs and studies dedicated to the analysis of the language of the verse inscriptions are fewer. Many of them explore the linguistic differences between literary epigrams and those epigrams preserved by epigraphic means, as well as the degree of intervention of later tradition on such texts (Tiberi 1996, del Barrio Vega 2008; Kaczko 2009). The first exclusively linguistic reviews were published by the end of the 19th century (Wagner 1883; Fengler 1892), however, they are descriptive analyses lacking from an independent methodology. Kock(1910) was the first researcher who systematized and suggested a linguistic hypothesis and upheld the use of epichoric dialects by poets. His theory was supported by adepts such as Kretschmer (1913; 1915) although some discordant voices appeared soon, such as Buck (1923) who denied the existence of a linguistic standard and endorsed the importance of the Ionic model over the epichoric one. Traditionally, Greek language manuals point out the significance of the Ionic model and accept the adaptation of Homeric language to epichoric dialect. The study of verse inscription language was not systematically resumed until Mickey's publications (1981a; 1981b). According to this researcher, epigraphic poetry consists of a tempered version of the epichoric dialect where the dialect-characteristic features are avoided. Following the same line but with some differences, Alonso Déniz and Nieto Izquierdo (2009) conclude that the most distinctive features of Argolic are not avoided, at least at the metrical inscriptions from Argolid...
Resumo:
Dissertação de Mestrado, Ciências da Linguagem, Faculdade de Ciências Humanas e Sociais, Universidade do Algarve, 2014
Resumo:
Dissertação de Mestrado, Ciências da Linguagem, Faculdade de Ciências Humanas e Sociais, Universidade do Algarve, 2016
Resumo:
This thesis investigates how the strong verb system inherited from Old English evolved in the regional dialects of Middle English (ca. 1100-1500). Old English texts preserve a relatively complex system of strong verbs, in which traditionally seven different ablaut classes are distinguished. This system becomes seriously disrupted from the Late Old English and Early Middle English periods onwards. As a result, many strong verbs die out, or have their ablaut patterns affected by sound change and morphological analogy, or transfer to the weak conjugation. In my thesis, I study the beginnings of two of these developments in two strong verb classes to find out what the evidence from Middle English regional dialects can tell us about their origins and diffusion. Chapter 2 concentrates on the strong-to-weak shift in Class III verbs, and investigates to what extent strong, mixed and weak past tense and participle forms vary in Middle English dialects, and whether the variation is more pronounced in the paradigms of specific verbs or sub-classes. Chapter 3 analyses the regional distribution of ablaut levelling in strong Class IV verbs throughout the Middle English period. The Class III and IV data for the Early Middle English period are drawn from A Linguistic Atlas of Early Middle English, and the data for the Late Middle English period from a sub-corpus of files from The Penn-Helsinki Parsed Corpus of Middle English and The Middle English Grammar Corpus. Furthermore, The English Dialect Dictionary and Grammar are consulted as an additional reference point to find out to what extent the Middle English developments are reflected in Late Modern English dialects. Finally, referring to modern insights into language variation and change and linguistic interference, Chapter 4 discusses to what extent intra- and extra-linguistc factors, such as token and type frequency, stem structure and language contact, might correlate with the strong-to-weak shift and ablaut levelling in Class III and IV verbs in the Middle English period. The thesis is accompanied by six appendices that contain further information about my distinction of Middle English dialect areas (Appendix A), historical Class III and IV verbs (B and C) and the text samples and linguistic data from the Middle English text corpora (D, E and F).
Resumo:
This thesis focuses on the history of the inflexional subjunctive and its functional substitutes in Late Middle English. To explore why and how the inflexional subjunctive declined in the history of English language, I analysed 2653 examples of three adverbial clauses introduced by if (1882 examples), though (305 examples) and lest (466 examples). Using a corpus-based approach, this thesis argues that linguistic change in subjunctive constructions did not happen suddenly but rather gradually, and the way it changed was varied , and that different constructions changed at different speeds in different environments. It is well known that the inflexional subjunctive declined in the history of English, mainly because of inflexional loss. Strangely however this topic has been comparatively neglected in the scholarly literature, especially with regard to the Middle English period, probably due to the limitations of data and also because study of this development requires very cumbersome textual research. This thesis has derived and analysed the data from three large corpora in the public domain: the Middle English Grammar Corpus (MEG-C for short), the Innsbruck Computer Archive of Machine-Readable English Texts (ICAMET for short), and some selected texts from The Corpus of Middle English Prose and Verse, part of the Middle English Compendium that also includes the Middle English Dictionary. The data were analysed from three perspectives: 1) clausal type, 2) dialect, and 3) textual genre. The basic methodology for the research was to analyse the examples one by one, with special attention being paid to the peculiarities of each text. In addition, this thesis draw on some complementary – indeed overlapping -- linguistic theories for further discussion: 1) Biber’s multi-dimensional theory, 2) Ogura and Wang’s (1994) S-curve or ‘diffusion’ theory, 3) Kretzchmar’s (2009) linguistics of speech, and 4) Halliday’s (1987) notion of language as a dynamic open system. To summarise the outcomes of this thesis: 1) On variation between clausal types, it was shown that the distributional tendencies of verb types (sub, ind, mod) are different between the three adverbial clauses under consideration. 2) On variation between dialects, it has been shown that the northern area, i.e. the so-called Great Scandinavian Belt, displays an especially high comparative ratio of the inflexional subjunctive construction compared to the other areas. This thesis suggests that this result was caused by the influence of Norse, relating the finding to the argument of Samuels (1989) that the present tense -es ending in the northern dialect was introduced by the influence of the Scandinavians. 3) On variation between genres, those labelled Science, Documents and Religion display relatively high ratio of the inflexional subjunctive, while Letter, Romance and History show relatively low ratio of the inflexional subjunctive. This results are explained by Biber’s multi-dimensional theory, which shows that the inflexional subjunctive can be related to the factors ‘informational’, ‘non-narrative’, ‘persuasive’ and ‘abstract’. 4) Lastly, on the inflexional subjunctive in Late Middle English, this thesis concludes that 1) the change did not happen suddenly but gradually, and 2) the way language changes varies. Thus the inflexional subjunctive did not disappear suddenly from England, and there was a time lag among the clausal types, dialects and genres, which can be related to Ogura and Wang’s S-curve (“diffusion”) theory and Kretzchmars’s view of “linguistic continuum”. This thesis has shown that the issues with regard to the inflexional subjunctive are quite complex, so that research in this area requires not only textual analysis but also theoretical analysis, considering both intra- and extra- linguistic factors.
Resumo:
RESUMO No presente trabalho, realizamos um estudo sobre a sintaxe histórica da língua portuguesa, focalizando as construções com se apassivador/indeterminador. Partindo de uma concepção de língua histórica, considerada em sua dimensão sociolinguística (COSERIU, 1979a; LABOV, 1972, 1982), analisamos a situação de variação e mudança linguística por que passam tais construções na gramática do português arcaico. Para tanto, utilizamos quatro corpora, representativos da prosa literária e não-literária do português dos séculos XIII, XIV, XV e XVI. Paralelamente ao estudo linguístico deste sintaticismo no referido período, esboçamos também um estudo historiográfico recuperando as reflexões dedicadas ao tema das construções com se pelas tradições gramaticais portuguesa e brasileira, bem como pelos estudos filológicos e linguístico-históricos. ABSTRACT In this paper, we carry out a study on Portuguese historical syntax, focusing on the se constructions. Based on a conception of historical language, considered in its sociolinguistic dimension (COSERIU, 1979a; LABOV, 1972, 1982), we analyze linguistic variation and change which these constructions undergo in the grammar of Old Portuguese. We used four corpora, representative of literary and non-literary Portuguese prose of the of 13th, 14th, 15th and 16th centuries. Parallel to the syntactic study, we also outline a study recovering the reflections on the theme of the se constructions by Brazilian and Portuguese grammatical tradition, as well as by the philological and historical linguistic studies.
Resumo:
This paper describes various experiments done to investigate author profiling of tweets in 4 different languages – English, Dutch, Italian, and Spanish. Profiling consists of age and gender classification, as well as regression on 5 different person- ality dimensions – extroversion, stability, agreeableness, open- ness, and conscientiousness. Different sets of features were tested – bag-of-words, word ngrams, POS ngrams, and average of word embeddings. SVM was used as the classifier. Tfidf worked best for most English tasks while for most of the tasks from the other languages, the combination of the best features worked better.
Resumo:
This thesis aimed to investigate the cognitive underpinnings of math skills, with particular reference to cognitive, and linguistic markers, core mechanisms of number processing and environmental variables. In particular, the issue of intergenerational transmission of math skills has been deepened, comparing parents’ and children’s basic and formal math abilities. This pattern of relationships amongst these has been considered in two different age ranges, preschool and primary school children. In the first chapter, a general introduction on mathematical skills is offered, with a description of some seminal works up to recent studies and latest findings. The first chapter concludes with a review of studies about the influence of environmental variables. In particular, a review of studies about home numeracy and intergenerational transmission is examined. The first study analyzed the relationship between mathematical skills of children attending primary school and those of their mothers. The objective of this study was to understand the influence of mothers' math abilities on those of their children. In the second study, the relationship between parents’ and children numerical processing has been examined in a sample of preschool children. The goal was to understand how mathematical skills of parents were relevant for the development of the numerical skills of children, taking into account children’s cognitive and linguistic skills as well as the role of home numeracy. The third study had the objective of investigating whether the verbal and nonverbal cognitive skills presumed to underlie arithmetic are also related to reading. Primary school children were administered measures of reading and arithmetic to understand the relationships between these two abilities and testing for possible shared cognitive markers. Finally, in the general discussion a summary of main findings across the study is presented, together with clinical and theoretical implications.
Resumo:
Translanguaging is quite common both in bilingual conversation and in interpreted discourse and poses challenges that mediators need to address in their work, training, and in the theoretical study of this activity. The impact of translanguaging on communication is very significant and affects the nature and workings of conversational exchanges, the identity of multilinguals, and the concept of language. This paper will examine how this plays a role in interaction and, specifically, in interpreter-mediated communication, arguing that linguistic and cultural mediation should consider such multilingual practices.