999 resultados para linguistic variation
Resumo:
This paper introduces a method for the analysis of regional linguistic variation. The method identifies individual and common patterns of spatial clustering in a set of linguistic variables measured over a set of locations based on a combination of three statistical techniques: spatial autocorrelation, factor analysis, and cluster analysis. To demonstrate how to apply this method, it is used to analyze regional variation in the values of 40 continuously measured, high-frequency lexical alternation variables in a 26-million-word corpus of letters to the editor representing 206 cities from across the United States.
Resumo:
We analyze a Big Data set of geo-tagged tweets for a year (Oct. 2013–Oct. 2014) to understand the regional linguistic variation in the U.S. Prior work on regional linguistic variations usually took a long time to collect data and focused on either rural or urban areas. Geo-tagged Twitter data offers an unprecedented database with rich linguistic representation of fine spatiotemporal resolution and continuity. From the one-year Twitter corpus, we extract lexical characteristics for twitter users by summarizing the frequencies of a set of lexical alternations that each user has used. We spatially aggregate and smooth each lexical characteristic to derive county-based linguistic variables, from which orthogonal dimensions are extracted using the principal component analysis (PCA). Finally a regionalization method is used to discover hierarchical dialect regions using the PCA components. The regionalization results reveal interesting linguistic regional variations in the U.S. The discovered regions not only confirm past research findings in the literature but also provide new insights and a more detailed understanding of very recent linguistic patterns in the U.S.
Resumo:
Previous research with children learning a second language (L2) has reported errors with verb inflection and cross-linguistic variation in accuracy and error patterns. However, owing to the cross-linguistic complexity and diversity of different verbal paradigms, the cross-linguistic effects on the nature of default forms has not been directly addressed in L2 acquisition studies. In the present study, we compared accuracy and error patterns in verbal agreement inflections in L2 children acquiring Dutch and Greek, keeping the children’s L1 constant (Turkish). Results showed that inflectional defaults in Greek follow universal predictions regarding the morphological underspecification of paradigms. However, the same universal predictions do not apply to the same extent to Dutch. It is argued that phonological properties of inflected forms should be taken into account to explain cross-linguistic differences in the acquisition of inflection. By systematically comparing patterns in child L2 Dutch and Greek, this study shows how universal mechanisms and target language properties work in tandem in the acquisition of inflectional paradigms.
Resumo:
This paper investigates whether the position of adverb phrases in sentences is regionally patterned in written Standard American English, based on an analysis of a 25 million word corpus of letters to the editor representing the language of 200 cities from across the United States. Seven measures of adverb position were tested for regional patterns using the global spatial autocorrelation statistic Moran’s I and the local spatial autocorrelation statistic Getis-Ord Gi*. Three of these seven measures were indentified as exhibiting significant levels of spatial autocorrelation, contrasting the language of the Northeast with language of the Southeast and the South Central states. These results demonstrate that continuous regional grammatical variation exists in American English and that regional linguistic variation exists in written Standard English.
Resumo:
Relatively little research on dialect variation has been based on corpora of naturally occurring language. Instead, dialect variation has been studied based primarily on language elicited through questionnaires and interviews. Eliciting dialect data has several advantages, including allowing for dialectologists to select individual informants, control the communicative situation in which language is collected, elicit rare forms directly, and make high-quality audio recordings. Although far less common, a corpus-based approach to data collection also has several advantages, including allowing for dialectologists to collect large amounts of data from a large number of informants, observe dialect variation across a range of communicative situations, and analyze quantitative linguistic variation in large samples of natural language. Although both approaches allow for dialect variation to be observed, they provide different perspectives on language variation and change. The corpus- based approach to dialectology has therefore produced a number of new findings, many of which challenge traditional assumptions about the nature of dialect variation. Most important, this research has shown that dialect variation involves a wider range of linguistic variables and exists across a wider range of language varieties than has previously been assumed. The goal of this chapter is to introduce this emerging approach to dialectology. The first part of this chapter reviews the growing body of research that analyzes dialect variation in corpora, including research on variation across nations, regions, genders, ages, and classes, in both speech and writing, and from both a synchronic and diachronic perspective, with a focus on dialect variation in the English language. Although collections of language data elicited through interviews and questionnaires are now commonly referred to as corpora in sociolinguistics and dialectology (e.g. see Bauer 2002; Tagliamonte 2006; Kretzschmar et al. 2006; D'Arcy 2011), this review focuses on corpora of naturally occurring texts and discourse. The second part of this chapter presents the results of an analysis of variation in not contraction across region, gender, and time in a corpus of American English letters to the editor in order to exemplify a corpus-based approach to dialectology.
Resumo:
This article shows that the term functionalism, very often understood as a single or uniform approach in linguistics, has to be understood in its different perspectives. I start by presenting an opposing conception similar to the I-language vs E-language in Chomsky (1986). As in the latter conception , language can be understood as an abstract model of a mind internal mechanism responsible for language production and perception or, as in the former one, it can be the description of the external use of language. Also like with formalists , there are functionalists who look for cross-linguistic variation (and universals of language use) and functionalists who look for language internal variation. It is also shown that functionalists can differ in the extent to which social variables are considered in the explanation of linguistic form.
Resumo:
Phonological development was assessed in six alphabetic orthographies (English, French, Greek, Icelandic, Portuguese and Spanish) at the beginning and end of the first year of reading instruction. The aim was to explore contrasting theoretical views regarding: the question of the availability of phonology at the outset of learning to read (Study 1); the influence of orthographic depth on the pace of phonological development during the transition to literacy (Study 2); and the impact of literacy instruction (Study 3). Results from 242 children did not reveal a consistent sequence of development as performance varied according to task demands and language. Phonics instruction appeared more influential than orthographic depth in the emergence of an early meta-phonological capacity to manipulate phonemes, and preliminary indications were that cross-linguistic variation was associated with speech rhythm more than factors such as syllable complexity. The implications of the outcome for current models of phonological development are discussed.
Resumo:
On the variation in literary languages The article discusses the formation and development of several literary languages. The main objects of the study are the Finno-Ugric languages spoken in the Volga-Kama region (Russia), and Chuvash, but, for comparison, Norwegian, Arabic, Russian, Finnish and some other Finno-Ugric languages are treated, as well. A general classification of cases where linguistic variation tends to occur in standard languages is put forward. The author also discusses the question whether some kind of universal “normal” amount of variation can be defined for literary languages. The conclusion is that the degree of variation allowed in a literary norm is highly language-specific and depends on the characteristics of the linguistic situation and the history of the literary standard.
Resumo:
Dans cette étude nous faisons une description de l'état actuel de l'espagnol parlé par la communauté paisa de Medellin (Colombie) à Montréal, dont le dialecte ou variation de langue est aussi connu comme paisa. Pour mener cette recherche à bien un travail de champ a été réalisé au moyen d’un questionnaire écrit et d’entrevues orales mi-dirigées. Les données obtenues nous permettent d’établir des premières comparaisons entre le parler des paisas de Montréal et celui des habitants de Medellin, et d’identifier quelques-uns des principaux changements linguistiques chez cette communauté parlante, spécialement les interférences du français au moment de parler en espagnol, et certains facteurs qui en favorisent, de même que certains changements au niveau socioculturel, les attitudes linguistiques des paisas vers leur variété, vers l’espagnol, en général, et aussi vers le français des québécois de la Région Métropolitaine de Montréal, en particulier. Finalement, il est aussi possible de connaître quelques-unes des différences et similitudes par rapport à la situation linguistique actuelle à la ville de Medellin. Bien que la communauté étudiée représente un bas pourcentage des hispanophones de la ville, la recherche peut contribuer à l’avancement des connaissances de l’espagnol à Montréal, puisque ses résultats, ajoutés à ceux d’autres études, permettront de mieux connaître sa réalité linguistique.
Resumo:
Le présent travail décrit –pour la première fois– l’état actuel de la langue espagnole parlée par la communauté colombienne à Montréal sur les formes de s’adresser aux autres en langage pronominal ou nominal et la courtoisie verbale. Dans le but de réaliser cette étude, un travail de terrain a été effectué à l’aide d’un questionnaire et des entrevues orales semi-dirigées adressés à 30 informateurs. L’analyse des données, nous a permis d’établir quelques premières comparaisons entre la façon de parler des habitants de la Colombie et des Colombiens résidant à Montréal et d’identifier quelques-uns des changements linguistiques principaux dans cette communauté parlante, notamment, les variations reliées aux formes de s’adresser aux autres et aux actes de courtoisie affectés par l’influence du français et de l’anglais. L’analyse effectuée tient compte autant les aspects linguistiques, pragmatiques et sociaux que les attitudes linguistiques des interviewés. De cette façon, les résultats mettent en lumière une nouvelle description sur la dynamique de l’usage des formes de s’adresser aux autres de locuteurs originaires de trois zones dialectales représentatives de la Colombie : la région andine orientale, la région andine occidentale et la zone côtière du Caraïbe. Ensemble avec d’autres études précédentes sur la formes de s’adresser aux autres, ce travail constitue une meilleure compréhension de la réalité linguistique de l’espagnol des Colombiens unilingues, bilingues et trilingues.