88 resultados para Text linguistics
Resumo:
This paper describes the methodology used to compile a corpus called MorphoQuantics that contains a comprehensive set of 17,943 complex word types extracted from the spoken component of the British National Corpus (BNC). The categorisation of these complex words was derived primarily from the classification of Prefixes, Suffixes and Combining Forms proposed by Stein (2007). The MorphoQuantics corpus has been made available on a website of the same name; it lists 554 word-initial and 281 word-final morphemes in English, their etymology and meaning, and records the type and token frequencies of all the associated complex words containing these morphemes from the spoken element of the BNC, together with their Part of Speech. The results show that, although the number of word-initial affixes is nearly double that of word-final affixes, the relative number of each observed in the BNC is very similar; however, word-final affixes are more productive in that, on average, the frequency with which they attach to different bases is three times that of word-initial affixes. Finally, this paper considers how linguists, psycholinguists and psychologists may use MorphoQuantics to support their empirical work in first and second language acquisition, and clinical and educational research.
Resumo:
This study explores how the typographic layout of information influences readers' impressions of magazine contents pages. Thirteen descriptors were used in a paired comparison procedure that assessed whether participants' rhetorical impressions of a set of six controlled documents change in relation to variations in layout. The combinations of layout attributes tested were derived from the structural attributes associated with three patterns of typographic differentiation (high, moderate, and low) described in a previous study (see Moys, 2014). The content and the range of stylistic attributes applied to the test material were controlled in order to focus on layout attributes. Triangulation of the quantitative and qualitative data indicates that, even within the experimental confines of limited stylistic differentiation, the layout attributes associated with patterns of high, moderate, and low typographic differentiation do influence readers' rhetorical judgments. In addition, the findings emphasize the importance of considering inter-relationships between clusters of typographic attributes rather than testing isolated variables.
Resumo:
Whereas there is substantial scholarship on formulaic language in L1 and L2 English, there is less research on formulaicity in other languages. The aim of this paper is to contribute to learner corpus research into formulaic language in native and non-native German. To this effect, a corpus of argumentative essays written by advanced British students of German (WHiG) was compared with a corpus of argumentative essays written by German native speakers (Falko-L1). A corpus-driven analysis reveals a larger number of 3-grams in WHiG than in Falko-L1, which suggests that British advanced learners of German are more likely to use formulaic language in argumentative writing than their native-speaker counterparts. Secondly, by classifying the formulaic sequences according to their functions, this study finds that native speakers of German prefer discourse-structuring devices to stance expressions, whilst British advanced learners display the opposite preferences. Thirdly, the results show that learners of German make greater use of macro-discourse-structuring devices and cautious language, whereas native speakers favour micro-discourse structuring devices and tend to use more direct language. This study increases our understanding of formulaic language typical of British advanced learners of German and reveals how diverging cultural paradigms can shape written native speaker and learner output.
Resumo:
The main aim of this chapter is to offer an overview of research that has adopted the methodology of Corpus Linguistics to study aspects of language use in the media. The overview begins by introducing the key principles and analytical tools adopted in corpus research. To demonstrate the contribution of corpus approaches to media linguistics, a selection of recent corpus studies is subsequently discussed. The final section summarises the strengths and limitations of corpus approaches and discusses avenues for further research.
Resumo:
Explaining the diversity of languages across the world is one of the central aims of typological, historical, and evolutionary linguistics. We consider the effect of language contact-the number of non-native speakers a language has-on the way languages change and evolve. By analysing hundreds of languages within and across language families, regions, and text types, we show that languages with greater levels of contact typically employ fewer word forms to encode the same information content (a property we refer to as lexical diversity). Based on three types of statistical analyses, we demonstrate that this variance can in part be explained by the impact of non-native speakers on information encoding strategies. Finally, we argue that languages are information encoding systems shaped by the varying needs of their speakers. Language evolution and change should be modeled as the co-evolution of multiple intertwined adaptive systems: On one hand, the structure of human societies and human learning capabilities, and on the other, the structure of language.
Resumo:
Sorace (2000, 2005) has claimed that while L2 learners can easily acquire properties of L2 narrow syntax they have significant difficulty with regard to interpretation and the discourse distribution of related properties, resulting in so-called residual optionality. However, there is no consensus as to what this difficulty indicates. Is it related to an insurmountable grammatical representational deficit (in the sense of representation deficit approaches; e.g. Beck 1998, Franceschina 2001, Hawkins 2005), is it due to cross-linguistic interference, or is it just a delay due to a greater complexity involved in the acquisition of interface-conditioned properties? In this article, I explore the L2 distribution of null and overt subject pronouns of English speaking learners of L2 Spanish. While intermediate learners clearly have knowledge of the syntax of Spanish null subjects, they do not have target-like pragmatic knowledge of their distribution with overt subjects. The present data demonstrate, however, that this difficulty is overcome at highly advanced stages of L2 development, thus suggesting that properties at the syntax-pragmatics interface are not destined for inevitable fossilization.
Resumo:
According to the 2000 census, 35.3 million Hispanics live in the United States. This number comprises 12.5% of the overall population rendering the Latino community the largest minority in the United States. The Mexican community is not only the largest Hispanic group but also the fastest growing: from 1990 to 2000, the Mexican population grew 52.9% increasing from 13.5 million to 20.6 million (U.S. Department of Commerce News, 2001). The influx of Mexican immigrants coupled with the expansion of their community within the United States has created an unparalleled situation of language contact. Language is synonymous with identity (cf. Granger, 2004, and works cited within). To the extent that this is true, Spanish is synonymous with being Mexican and by extension, Chicano. With the advent of amnesty programs such as Immigration Reform and Control Act (IRCA), which naturalized millions of Mexican migrants, what was once a temporal migratory population has become increasingly permanent (Durand et al., 1999). In an effort to conserve Mexican traditions and identity, the struggle to preserve the mother tongue while at the same time acculturate to mainstream Americana has resulted in a variant of Spanglish that has received little attention. This paper will examine the variant of Spanglish seen in the greater Los Angeles area and liken it to the bi-national identity under which these Mexican Americans thrive.
Resumo:
Two varieties of Greek are spoken on the island of Cyprus: the local dialect, namely the Greek-Cypriot Dialect (GCD), and Standard Modern Greek (SMG). English is also influential, as Cyprus was an English colony until 1960. The dialect is rarely employed for everyday written purposes; however, it is now evident in computer-mediated communication (CMC). As a contribution to the field of code-switching in writing, this study examines how Greek-Cypriot internet users employ GCD, SMG, and English in their Facebook interactions. In particular, we investigate how identities (discursive and social) are performed and indexed through the linguistic choices of Greek-Cypriot internet users. The findings indicate that switches to GCD add a humorous tone and express solidarity and informality. SMG is mostly used for ‘official’ statements, and it is preferred by mature internet users, while English is used with expressions of affect and evaluative comments.
Resumo:
This paper investigates the attitudes of Greek-Cypriot internet users towards written Cypriot Greek (CG) in online chat. CG does not have a standard official orthography and it is only used in informal oral communication. With the emergence of computer-mediated communication (CMC), a novel Romanized form of CG is used instead of Standard Greek (SG) in online environments (Themistocleous 2005). To investigate language attitudes, an online questionnaire was distributed electronically to Greek-Cypriot internet users. The results show that the majority of the informants have positive attitudes towards written CG, a practice that goes against the results of previous attitudinal surveys. In this paper, I demonstrate how the internet can influence and change the attitudes of Greek-Cypriots towards their regional variety. It is argued that the unconventional and norm-free character of CMC allows internet users to use their non-standard variety in a domain where the standard would be expected to be used.
Resumo:
The present longitudinal study examines the interaction of learner variables (gender, motivation, self-efficacy and first language literacy) and their influence on second language learning outcomes. The study follows English learners of French from Year 5 in primary school (aged 9-10) to the first year in secondary school (Year 7 aged 11-12). Language outcomes were measured by two oral production tasks; a sentence repetition task and a photo description task both of which were administered at three time points. Longitudinal data on learner attitudes and motivation were collected via questionnaires. Teacher assessment data for general first language literacy attainment were also provided. The results show a great deal of variation in learner attitudes and outcomes and that there is a complex relationship between first language literacy, self-efficacy, gender and attainment. For example, in general, girls held more positive attitudes to boys and were more successful. However, the inclusion of first language ability, which explained 30-40% of variation, shows that gender differences in attitudes and outcomes are likely mediated by first language literacy and prior learning experience.