756 resultados para Language Analysis
Resumo:
This study investigates the use of unsupervised features derived from word embedding approaches and novel sequence representation approaches for improving clinical information extraction systems. Our results corroborate previous findings that indicate that the use of word embeddings significantly improve the effectiveness of concept extraction models; however, we further determine the influence that the corpora used to generate such features have. We also demonstrate the promise of sequence-based unsupervised features for further improving concept extraction.
Resumo:
The study proposes a method for identifying the personal imprint of literary translators in translated works of fiction. The initial assumption was that the style of a target text is not determined solely by the literary style of the author but also by features of its translator s idiolect. A method was developed for identifying the idiolectal features of individual translators, which were then used to describe personal translation styles. The method is not restricted to a particular language pair. To test the method and to establish the nature of the proposed personal imprint empirically, extracts from four English-language literary source texts (two novels by James Joyce and two by Ernest Hemingway) were first compared with their translations into Finnish (by four different translators) in order to identify changes, or shifts, that had taken place at the formal linguistic level in the translation process. To allow individual propensities to manifest themselves, only optional shifts in which the translators had a range of choices available to them were included in the study. In the second phase, extracts by different authors rendered into Finnish by the same translator were compared in order to gauge the extent of the potential impact of the author's style on the translator's work. In-depth analysis of the types of shifts made most frequently by the individual translators revealed further intersubjective differences, and the shifts were used to construct translation profiles for each of the translators. In order to determine the potential effects of frequently occurring shifts on the target text, some central concepts of narratology were adapted and used to establish an intermediate link between microlevel choices and macrolevel effects. In this way the propensity of an individual translator to opt for certain types of shift could be linked with the overall artistic effect of the target text.
Resumo:
This study deals with language change and variation in the correspondence of the eighteenth-century Bluestocking circle, a social network which provided learned men and women with an informal environment for the pursuit of scholarly entertainment. Elizabeth Montagu (1718 1800), a notable social hostess and a Shakespearean scholar, was one of their key figures. The study presents the reconstruction of Elizabeth Montagu s social networks from her youth to her later years with a special focus on the Bluestocking circle, and linguistic research on private correspondence between Montagu and her Bluestocking friends and family members between the years 1738 1778. The epistolary language use is investigated using the methods and frameworks of corpus linguistics, historical sociolinguistics, and social network analysis. The approach is diachronic and concerns real-time language change. The research is based on a selection of manuscript letters which I have edited and compiled into an electronic corpus (Bluestocking Corpus). I have also devised a network strength scale in order to quantify the strength of network ties and to compare the results of the linguistic research with the network analysis. The studies range from the reconstruction and analysis of Elizabeth Montagu s most prominent social networks to the analysis of changing morphosyntactic features and spelling variation in Montagu s and her network members correspondence. The linguistic studies look at the use of the progressive construction, preposition stranding and pied piping, and spelling variation in terms of preterite and past participle endings in the regular paradigm (-ed, - d, -d, - t, -t) and full / contracted spellings of auxiliary verbs. The results are analysed in terms of social network membership, sociolinguistic variables of the correspondents, and, when relevant, aspects of eighteenth-century linguistic prescriptivism. The studies showed a slight diachronic increase in the use of the progressive, a significant decrease of the stigmatised preposition stranding and increase of pied piping, and relatively informal but socially controlled epistolary spelling. Certain significant changes in Elizabeth Montagu s language use over the years could be attributed to her increasingly prominent social standing and the changes in her social networks, and the strength of ties correlated strongly with the use of the progressive in the Bluestocking Corpus. Gender, social rank, and register in terms of kinship/friendship had a significant influence in language use, and an effect of prescriptivism could also be detected. Elizabeth Montagu s network ties resulted in language variation in terms of network membership, her own position in a given network, and the social factors that controlled eighteenth-century interaction. When all the network ties are strong, linguistic variation seems to be essentially linked to the social variables of the informants.
Resumo:
The purpose of this research was to analyse the phonological system of the Limi dialect of Humla Bhotia. Humla Bhotia is a Tibeto-Burman language that is spoken by approximately 4000 5000 people in the far northwestern Humla province of the Kingdom of Nepal. The language has not previously been the subject of analysis. The data base for this thesis was collected on two different dialects of Humla Bhotia in Kathmandu, the capital of Nepal, from February to May 2000. I had three language informants who speak Humla Bhotia as their mother tongue. One of the informants speaks the Upper Humla dialect and the other two informants speak the Limi dialect. In this thesis I have concentrated on the phonology of the dialect of Limi but occasionally I also make reference to the Upper Humla dialect. The Limi data base consists of 600 words elicited in isolation, sentences where words have been checked for consonantal and pitch variation, and five texts comprising 117 sentences. Firstly, I have studied the geographical location, population and dialects of Humla Bhotia. Five dialects were identified: Limi, Upper Humla, La Yakba, Nyinba and Humli Khyampa. Information on the dialect areas is based on the accounts of seven mother tongue speakers of the language and on Nancy Levine s (1988) anthropological research of the ethnic group Nyinba. Secondly, I have analysed the phonological system of Limi from the viewpoint of American stucturalism much along the lines followed by Pike 1966 [1947] ja 1967 [1948]. In defining the prosodic elements I have also used acoustic analysis. In the Limi dialect there are 7 vowel phonemes. No vowel clusters occur within the same syllable. In this preliminary analysis 29 contrastive plosives, 8 affricates and 5 6 fricatives were found. The data also revealed 4 nasal phonemes, two rhotic phonemes, one lateral phoneme and two central approximants. Further research is however called for to check the phonemic status of these segments. Four contrastive prosodic elements were encountered: nasalisation, length, phonation type and pitch movement. There are two contrastive types of phonation: tense and lax. Many words were found with a third type of phonation, modal phonation. How modal phonation relates to the prosodic system is unclear at this stage and is therefore left for further research to determine. There are two contrastive pitch movement tonemes: a rising toneme and falling toneme. The falling toneme occurs in free variation with a level pitch contour. Rising appears to be linked with lax phonation and falling with tense phonation.
Resumo:
This thesis is a preliminary phonological description of the Tibetan-related Denjongka language of Sikkim, India. Because the language has not been much researched and the previous studies have focused on other issues than phonology, the present paper is the first of its kind. The data for this thesis was gathered in Gangtok, the capital of Sikkim, from March to May 2004. I had four language informants from four different locations in Sikkim who spoke different dialects of Denjongka. One of the informants, from whom I recorded c. 900 words and 530 sentences, was used as the main data source for the analysis. First, I will give some ethnographic background information on the people who speak Denjongka. Next, I will discuss first the segmental and then the suprasegmental phonology of the language, which were analysed much in line with American structuralism. I also used acoustic analysis enabled by the Praat-program. Eight vowel phonemes were found. The phonemic status of /E/, however, is still suspect. I present some preliminary evidence for roundedness, frontness and height assimilation among the vowels. In the interpretation adopted in this analysis, there are no diphthongs in Denjongka. Forty consonant phonemes were found: 17 plosives, 7 affricates, 5 fricatives, 5 nasals, 4 liquids and 2 approximants. Denjongka plosives and affricates have four-way aspiration/voicing distinction: voiceless aspirated, voiceless unaspirated, voiceless slightly aspirated (devoiced), and voiced unaspirated. Two voiceless nasals and two voiceless liquids were found. Two phonation types were found to be contrastive, lax/breathy and tense/creaky. Nasalisation and length in vowels are phonemic. Denjongka is an incipient tone language. Tonal phenomena, which involve mainly pitch and phonation type, are complex. Pitch is most of the time predictable from the initial consonant and the phonation type. In some cases, however, pitch is the only contrastive feature between words. The description of Denjongka in this paper differs from the traditional four-tone system, which has been used in many descriptions of Tibetan-related languages. In the four-tone system, pitch is contrastive both in the high and low register, whereas in the present analysis pitch has been established to contrast only in the high register. Lastly, the appendices include a comparative word list of the four Denjongka dialects studied in this thesis.
Resumo:
A 26-hour English reading comprehension course was taught to two groups of second year Finnish Pharmacy students: a virtual group (33 students) and a teacher-taught group (25 students). The aims of the teaching experiment were to find out: 1.What has to be taken into account when teaching English reading comprehension to students of pharmacy via the Internet and using TopClass? 2. How will the learning outcomes of the virtual group and the control group differ? 3. How will the students and the Department of Pharmacy respond to the different and new method, i.e. the virtual teaching method? 4. Will it be possible to test English reading comprehension learning material using the groupware tool TopClass? The virtual exercises were written within the Internet authoring environment, TopClass. The virtual group was given the reading material and grammar booklet on paper, but they did the reading comprehension tasks (written by the teacher), autonomously via the Internet. The control group was taught by the same teacher in 12 2-hour sessions, while the virtual group could work independently within the given six weeks. Both groups studied the same material: ten pharmaceutical articles with reading comprehension tasks as well as grammar and vocabulary exercises. Both groups took the same final test. Students in both groups were asked to evaluate the course using a 1 to 5 rating scale and they were also asked to assess their respective courses verbally. A detailed analysis of the different aspects of the student evaluation is given. Conclusions: 1.The virtual students learned pharmaceutical English relatively well but not significantly better than the classroom students 2. The overall student satisfaction in the virtual pharmacy English reading comprehension group was found to be higher than that in the teacher-taught control group. 3. Virtual learning is easier for linguistically more able students; less able students need more time with the teacher. 4. The sample in this study is rather small, but it is a pioneering study. 5. The Department of Pharmacy in the University of Helsinki wishes to incorporate virtual English reading comprehension teaching in its curriculum. 6. The sophisticated and versatile TopClass system is relatively easy for a traditional teacher and quite easy for the students to learn. It can be used e.g. for automatic checking of routine answers and document transfer, which both lighten the workloads of both parties. It is especially convenient for teaching reading comprehension. Key words: English reading comprehension, teacher-taught class, virtual class, attitudes of students, learning outcomes
Resumo:
This thesis explores melodic and harmonic features of heavy metal, and while doing so, explores various methods of music analysis; their applicability and limitations regarding the study of heavy metal music. The study is built on three general hypotheses according to which 1) acoustic characteristics play a significant role for chord constructing in heavy metal, 2) heavy metal has strong ties and similarities with other Western musical styles, and 3) theories and analytical methods of Western art music may be applied to heavy metal. It seems evident that in heavy metal some chord structures appear far more frequently than others. It is suggested here that the fundamental reason for this is the use of guitar distortion effect. Subsequently, theories as to how and under what principles heavy metal is constructed need to be put under discussion; analytical models regarding the classification of consonance and dissonance and chord categorization are here revised to meet the common practices of this music. It is evident that heavy metal is not an isolated style of music; it is seen here as a cultural fusion of various musical styles. Moreover, it is suggested that the theoretical background to the construction of Western music and its analysis can offer invaluable insights to heavy metal. However, the analytical methods need to be reformed to some extent to meet the characteristics of the music. This reformation includes an accommodation of linear and functional theories that has been found rather rarely in music theory and musicology.
Resumo:
The aim was to analyse the growth and compositional development of the receptive and expressive lexicons between the ages 0,9 and 2;0 in the full-term (FT) and the very-low-birth-weight (VLBW) children who are acquiring Finnish. The associations between the expressive lexicon and grammar at 1;6 and 2;0 in the FT children were also studied. In addition, the language skills of the VLBW children at 2;0 were analysed, as well as the predictive value of early lexicon to the later language performance. Four groups took part in the studies: the longitudinal (N = 35) and cross-sectional (N = 146) samples of the FT children, and the longitudinal (N = 32) and cross-sectional (N = 66) samples of VLBW children. The data was gathered by applying of the structured parental rating method (the Finnish version of the Communicative Development Inventory), through analysis of the children´s spontaneous speech and by administering a a formal test (Reynell Developmental Language Scales). The FT children acquired their receptive lexicons earlier, at a faster rate and with larger individual variation than their expressive lexicons. The acquisition rate of the expressive lexicon increased from slow to faster in most children (91%). Highly parallel developmental paths for lexical semantic categories were detected in the receptive and expressive lexicons of the Finnish children when they were analysed in relation to the growth of the lexicon size, as described in the literature for children acquiring other languages. The emergence of grammar was closely associated with expressive lexical growth. The VLBW children acquired their receptive lexicons at a slower rate and had weaker language skills at 2;0 than the full-term children. The compositional development of both lexicons happened at a slower rate in the VLBW children when compared to the FT controls. However, when the compositional development was analysed in relation to the growth of lexicon size, this development occurred qualitatively in a nearly parallel manner in the VLBW children as in the FT children. Early receptive and expressive lexicon sizes were significantly associated with later language skills in both groups. The effect of the background variables (gender, length of the mother s basic education, birth weight) on the language development in the FT and the VLBW children differed. The results provide new information of early language acquisition by the Finnish FT and VLBW children. The results support the view that the early acquisition of the semantic lexical categories is related to lexicon growth. The current findings also propose that the early grammatical acquisition is closely related to the growth of expressive vocabulary size. The language development of the VLBW children should be followed in clinical work.
Resumo:
In this paper, we present the results of an exploratory study that examined the problem of automating content analysis of student online discussion transcripts. We looked at the problem of coding discussion transcripts for the levels of cognitive presence, one of the three main constructs in the Community of Inquiry (CoI) model of distance education. Using Coh-Metrix and LIWC features, together with a set of custom features developed to capture discussion context, we developed a random forest classification system that achieved 70.3% classification accuracy and 0.63 Cohen's kappa, which is significantly higher than values reported in the previous studies. Besides improvement in classification accuracy, the developed system is also less sensitive to overfitting as it uses only 205 classification features, which is around 100 times less features than in similar systems based on bag-of-words features. We also provide an overview of the classification features most indicative of the different phases of cognitive presence that gives an additional insights into the nature of cognitive presence learning cycle. Overall, our results show great potential of the proposed approach, with an added benefit of providing further characterization of the cognitive presence coding scheme.
Resumo:
This study investigated curriculum practices in Queensland community language schools and how these practices are supported by government policy. The conceptual framework drew on theories of ethnolinguistic vitality and curriculum dimensions. The research design involved case studies of two community language schools of different sizes, using classroom observation and interviews. Cross–case analysis revealed contrasting curriculum practices determined by student enrolments, and different capacities to access and benefit from what policy support was available. This study offers some implications and possibilities to better support quality curriculum practices in community language schools.
Resumo:
Much of physical education curriculum in the developed world and specifically in Australia tends to be guided in principle by syllabus documents that represent, in varying degrees, some form of government education priorities. Through the use of critical discourse analysis we analyze one such syllabus example (an official syllabus document of one of the Australian States) to explore the relationships between the emancipatory/social justice expectations presented in the rubric of and introduction to the official syllabus document, and the language details of learning outcomes that indicate how the expectations might be satisfied. Given the complexity and multilevel pathways of message systems/ideologies we question the efficacy of such documents oriented around social justice principles to genuinely deliver more radical agendas which promote social change and encourage a preparedness to engage in social action leading to a betterment of society.
Resumo:
The impact of Greek-Egyptian bilingualism on language use and linguistic competence is the key issue in this dissertation. The language use in a corpus of 148 Greek notarial contracts is analyzed on phonological, morphological and syntactic levels. The texts were written by bilingual notaries (agoranomoi) in Upper Egypt in the later Hellenistic period. They present, for the most part, very good administrative Greek. On the other hand, their language contains variation and idiosyncrasies that were earlier condemned as ungrammatical and bad Greek, and were not subjected to closer analysis. In order to reach plausible explanations for those phenomena, a thorough research into the sociohistorical and linguistic context was needed before the linguistic analysis. The general linguistic landscape, the population pattern and the status and frequency of Greek literacy in Ptolemaic Egypt in general, and in Upper Egypt in particular, are presented. Through a detailed examination of the notaries themselves (their names, families and handwriting), it became evident that there were one to three persons at the notarial office writing under the signature of one notary. Often the documents under one notary's name were written in the same hand. We get, therefore, exceptionally close to studying idiolects in written material from antiquity. The qualitative linguistic analysis revealed that the notaries made relatively few orthographic mistakes that reflect the ongoing phonological changes and they mastered the morphological forms. The problems arose at the syntactic level, for example, with the pattern of agreement between the noun groups or a noun with its modifiers. The significant structural differences between Greek and Egyptian can be behind the innovative strategies used by some of the notaries. Moreover, certain syntactic structures were clearly transferred from the notaries first language, Egyptian. This is obvious in the relative clause structure. Transfer can be found in other structures, as well, although, we must not forget the influence of parallel Greek structures. Sometimes these can act simultaneously. The interesting linguistic strategies and transfer features come mostly from the hand of one notary, Hermias. Some other notaries show similar patterns, for example, Hermias' cousin, Ammonios. Hermias' texts reveal that he probably spoke Greek more than his predecessors. It is possible to conclude, then, that the notaries of the later generations were more fluently bilingual; their two languages were partly integrated in their minds as an interlanguage combining elements from both languages. The earlier notaries had the two languages functionally separated and they followed the standardized contract formulae more rigidly.
Resumo:
We argue in this paper that corporate language policies have significant power implications that are easily overlooked. By drawing on previous work on power in organizations (Clegg, 1989), we examine the complex power implications of language policy decisions by looking at three levels of analysis: episodic social interaction, identity/subjectivity construction, and reconstruction of structures of domination. In our empirical analysis, we focus on the power implications of the choice of Swedish as the corporate language in the case of the recent banking sector merger between the Finnish Merita and the Swedish Nordbanken. Our findings show how language skills become empowering or disempowering resources in organizational communication, how these skills are associated with professional competence, and how this leads to the creation of new social networks. The case also illustrates how language skills are an essential element in the construction of international confrontation, lead to a construction of superiority and inferiority, and also reproduce post-colonial identities in the merging bank. Finally, we also point out how such policies ultimately lead to the reification of post-colonial and neo-colonial structures of domination in multinational corporations.
Resumo:
We argue in this paper that corporate language policies have significant power implications that are easily overlooked. By drawing on previous work on power in organizations (Clegg, 1989), we examine the complex power implications of language policy decisions by looking at three levels of analysis: episodic social interaction, identity/subjectivity construction, and reconstruction of structures of domination. In our empirical analysis, we focus on the power implications of the choice of Swedish as the corporate language in the case of the recent banking sector merger between the Finnish Merita and the Swedish Nordbanken. Our findings show how language skills become empowering or disempowering resources in organizational communication, how these skills are associated with professional competence, and how this leads to the creation of new social networks. The case also illustrates how language skills are an essential element in the construction of international confrontation, lead to a construction of superiority and inferiority, and also reproduce post-colonial identities in the merging bank. Finally, we also point out how such policies ultimately lead to the reification of post-colonial and neo-colonial structures of domination in multinational corporations.