4 resultados para Unstructured data
em Helda - Digital Repository of University of Helsinki
Resumo:
In this paper, I look into a grammatical phenomenon found among speakers of the Cambridgeshire dialect of English. According to my hypothesis, the phenomenon is a new entry into the past BE verb paradigm in the English language. In my paper, I claim that the structure I have found complements the existing two verb forms, was and were, with a third verb form that I have labelled ‘intermediate past BE’. The paper is divided into two parts. In the first section, I introduce the theoretical ground for the study of variation, which is founded on empiricist principles. In variationist linguistics, the main claim is that heterogeneous language use is structured and ordered. In the last 50 years of history in modern linguistics, this claim is controversial. In the 1960s, the generativist movement spearheaded by Noam Chomsky diverted attention away from grammatical theories that are based on empirical observations. The generativists steered away from language diversity, variation and change in favour of generalisations, abstractions and universalist claims. The theoretical part of my paper goes through the main points of the variationist agenda and concludes that abandoning the concept of language variation in linguistics is harmful for both theory and methodology. In the method part of the paper, I present the Helsinki Archive of Regional English Speech (HARES) corpus. It is an audio archive that contains interviews conducted in England in the 1970s and 1980s. The interviews were done in accordance to methods used generally in traditional dialectology. The informants are mostly elderly male people who have lived in the same region throughout their lives and who have left school at an early age. The interviews are actually conversations: the interviewer allowed the informant to pick the topic of conversation to induce a maximally relaxed and comfortable atmosphere and thus allow the most natural dialect variant to emerge in the informant’s speech. In the paper, the corpus chapter introduces some of the transcription and annotation problems associated with spoken language corpora (especially those containing dialectal speech). Questions surrounding the concept of variation are present in this part of the paper too, as especially transcription work is troubled by the fundamental problem of having to describe the fluctuations of everyday speech in text. In the empirical section of the paper, I use HARES to analyse the speech of four informants, with special focus on the emergence of the intermediate past BE variant. My observations and the subsequent analysis permit me to claim that my hypothesis seems to hold. The intermediate variant occupies almost all contexts where one would expect was or were in the informants’ speech. This means that the new variant is integrated into the speakers’ grammars and exemplifies the kind of variation that is at the heart of this paper.
Resumo:
The aim of this study was to measure seasonal variation in mood and behaviour. The dual vulnerability and latitude effect hypothesis, the risk of increased appetite, weight and other seasonal symptoms to develop metabolic syndrome, and perception of low illumination in quality of life and mental well-being were assessed. These variations are prevalent in persons who live in high latitudes and need balancing of metabolic processes to adapt to environmental changes due to seasons. A randomized sample of 8028 adults aged 30 and over (55% women) participated in an epidemiological health examination study, The Health 2000, applying the probability proportional to population size method for a range of socio-demographic characteristics. They were present in a face-to-face interview at home and health status examination. The questionnaires included the modified versions of the Seasonal Pattern Assessment Questionnaire (SPAQ) and Beck Depression Inventory (BDI), the Health Related Quality of Life (HRQoL) instrument 15D, and the General Health Questionnaire (GHQ). The structured and computerized Munich Composite International Diagnostic Interview (M-CIDI) as part of the interview was used to assess diagnoses of mental disorders, and, the National Cholesterol Education Program Adult Treatment Panel III (NCEP-ATPIII) criteria were assessed using all the available information to detect metabolic syndrome. A key finding was that 85% of this nationwide representative sample had seasonal variation in mood and behaviour. Approximately 9% of the study population presented combined seasonal and depressive symptoms with a significant association between their scores, and 2.6% had symptoms that corresponded to Seasonal Affective Disorder (SAD) in severity. Seasonal variations in weight and appetite are two important components that increase the risk of metabolic syndrome. Other factors such as waist circumference and major depressive disorder contributed to the metabolic syndrome as well. Persons reported of having seasonal symptoms were associated with a poorer quality of life and compromised mental well-being, especially if indoors illumination at home and/or at work was experienced as being low. Seasonal and circadian misalignments are suggested to associate with metabolic disorders, and could be remarked if individuals perceive low illumination levels at home and/or at work that affect the health-related quality of life and mental well-being. Keywords: depression, health-related quality of life, illumination, latitude, mental well-being, metabolic syndrome, seasonal variation, winter.
Resumo:
In genetic epidemiology, population-based disease registries are commonly used to collect genotype or other risk factor information concerning affected subjects and their relatives. This work presents two new approaches for the statistical inference of ascertained data: a conditional and full likelihood approaches for the disease with variable age at onset phenotype using familial data obtained from population-based registry of incident cases. The aim is to obtain statistically reliable estimates of the general population parameters. The statistical analysis of familial data with variable age at onset becomes more complicated when some of the study subjects are non-susceptible, that is to say these subjects never get the disease. A statistical model for a variable age at onset with long-term survivors is proposed for studies of familial aggregation, using latent variable approach, as well as for prospective studies of genetic association studies with candidate genes. In addition, we explore the possibility of a genetic explanation of the observed increase in the incidence of Type 1 diabetes (T1D) in Finland in recent decades and the hypothesis of non-Mendelian transmission of T1D associated genes. Both classical and Bayesian statistical inference were used in the modelling and estimation. Despite the fact that this work contains five studies with different statistical models, they all concern data obtained from nationwide registries of T1D and genetics of T1D. In the analyses of T1D data, non-Mendelian transmission of T1D susceptibility alleles was not observed. In addition, non-Mendelian transmission of T1D susceptibility genes did not make a plausible explanation for the increase in T1D incidence in Finland. Instead, the Human Leucocyte Antigen associations with T1D were confirmed in the population-based analysis, which combines T1D registry information, reference sample of healthy subjects and birth cohort information of the Finnish population. Finally, a substantial familial variation in the susceptibility of T1D nephropathy was observed. The presented studies show the benefits of sophisticated statistical modelling to explore risk factors for complex diseases.