9 resultados para Data cleaning

em Helda - Digital Repository of University of Helsinki


Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper, I look into a grammatical phenomenon found among speakers of the Cambridgeshire dialect of English. According to my hypothesis, the phenomenon is a new entry into the past BE verb paradigm in the English language. In my paper, I claim that the structure I have found complements the existing two verb forms, was and were, with a third verb form that I have labelled ‘intermediate past BE’. The paper is divided into two parts. In the first section, I introduce the theoretical ground for the study of variation, which is founded on empiricist principles. In variationist linguistics, the main claim is that heterogeneous language use is structured and ordered. In the last 50 years of history in modern linguistics, this claim is controversial. In the 1960s, the generativist movement spearheaded by Noam Chomsky diverted attention away from grammatical theories that are based on empirical observations. The generativists steered away from language diversity, variation and change in favour of generalisations, abstractions and universalist claims. The theoretical part of my paper goes through the main points of the variationist agenda and concludes that abandoning the concept of language variation in linguistics is harmful for both theory and methodology. In the method part of the paper, I present the Helsinki Archive of Regional English Speech (HARES) corpus. It is an audio archive that contains interviews conducted in England in the 1970s and 1980s. The interviews were done in accordance to methods used generally in traditional dialectology. The informants are mostly elderly male people who have lived in the same region throughout their lives and who have left school at an early age. The interviews are actually conversations: the interviewer allowed the informant to pick the topic of conversation to induce a maximally relaxed and comfortable atmosphere and thus allow the most natural dialect variant to emerge in the informant’s speech. In the paper, the corpus chapter introduces some of the transcription and annotation problems associated with spoken language corpora (especially those containing dialectal speech). Questions surrounding the concept of variation are present in this part of the paper too, as especially transcription work is troubled by the fundamental problem of having to describe the fluctuations of everyday speech in text. In the empirical section of the paper, I use HARES to analyse the speech of four informants, with special focus on the emergence of the intermediate past BE variant. My observations and the subsequent analysis permit me to claim that my hypothesis seems to hold. The intermediate variant occupies almost all contexts where one would expect was or were in the informants’ speech. This means that the new variant is integrated into the speakers’ grammars and exemplifies the kind of variation that is at the heart of this paper.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The research is related to the Finnish Jabal Harun Project (FJHP), which is part of the research unit directed by Professor Jaakko Frösén. The project consists of two interrelated parts: the excavation of a Byzantine monastery/pilgrimage centre on Jabal Harun, and a multiperiod archaeological survey of the surrounding landscape. It is generally held that the Near Eastern landscape has been modified by millennia of human habitation and activity. Past climatic changes and human activities could be expected to have significantly changed also the landscape of the Jabal Harun area. Therefore it was considered that a study of erosion in the Jabal Harun area could shed light on the environmental and human history of the area. It was hoped that it would be possible to connect the results of the sedimentological studies either to wider climatic changes in the Near East, or to archaeologically observable periods of human activity and land use. As evidence of some archaeological periods is completely missing from the Jabal Harun area, it was also of interest whether catastrophic erosion or unfavourable environmental change, caused either by natural forces or by human agency, could explain the gaps in the archaeological record. Changes in climate and/or land-use were expected to be reflected in the sedimentary record. The field research, carried out as part of the FJHP survey fieldwork, included the mapping of wadi terraces and cleaning of sediment profiles which were recorded and sampled for laboratory analyses of facies and lithology. To obtain a chronology for the sedimentation and erosion phases also OSL (optically stimulated luminescence) dating samples were collected. The results were compared to the record of the Near Eastern palaeoclimate, and to data from geoarchaeological studies in central and southern Jordan. The picture of the environmental development was then compared to the human history in the area, based on archaeological evidence from the FJHP survey and the published archaeological research in the Petra region, and the question of the relationship between human activity and environmental change was critically discussed. Using the palaeoclimatic data and the results from geoarchaeological studies it was possible to outline the environmental development in the Jabal Harun area from the Pleistocene to the present.It is appears that there was a phase of accumulation of sediment before the Middle Palaeolithic period, possibly related to tectonic movement. This phase was later followed by erosion, tentatively suggested to have taken place during the Upper Palaeolithic. A period of wadi aggradation probably occurred during the Late Glacial and continued until the end of the Pleistocene, followed by significant channel degradation, attributed to increased rainfall during the Early Holocene. It seems that during the later Holocene channel incision has been dominant in the Jabal Harûn area although there have been also small-scale channel aggradation phases, two of which were OSL-dated to around 4000-3000 BP and 2400-2000 BP. As there is no evidence of tectonic movements in the Jabal Harun area after the early Pleistocene, it is suggested that climate change and human activity have been the major causes of environmental change in the area. At a brief glance it seems that many of the changes in the settlement and land use in the Jabal Harun area can be explained by climatic and environmental conditions. However, the responses of human societies to environmental change are dependent on many factors. Therefore an evaluation of the significance of environmental, cultural, socio-economic and political factors is needed to decide whether certain phenomena are environmentally induced. Comparison with the wider Petra region is also needed to judge whether the phenomena are characteristic of the Jabal Harun area only, or can they be connected to social, political and economic development over a wider area.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The present cross-sectional study aimed to assess oral health behaviour, dental and periodontal conditions, dental care, and their relationships among elderly dentate patients in Lithuania. The target population in the study were dentate patients aged 60 and older attending public dental services in Kedainiai, Lithuania. The data collection took place between the autumn of 1999 and the winter of 2001. Data were collected by means of a self-administered questionnaire for all (n=174) and a clinical examination targeting about half of the subjects (n=100). The questionnaire inquired about oral health behaviour, the life-first and also the most recent dental treatments, sources on and self-assessed knowledge of oral self-care, a self-reported number of teeth, and socio-demographic information. The clinical examination included basic dental and periodontal conditions. A total of 82 women and 92 men completed the questionnaire; their mean age was 69.2 and their average number of teeth was 16.2 (CI 95% 15.4-17.1). In all, 25% had 21 or more teeth and 32% indicated wearing removable dentures. The oral health behaviour, the participants reported, was poor: 30% reported twice daily toothbrushing, 57% responded that they always use fluoride toothpaste, 19% indicated daily interdental cleaning, nearly all said they take sugar in their coffee and tea, and 30% indicated going for check-ups. As the main source of information on oral self-care, the subjects indicated health professionals (82%), followed by social contacts (72%), broadcasted media (58%), and printed media (42%). A total of 34% assessed their knowledge of oral self-care as good, and their self-assessed knowledge correlated (r=0.52) with professional guidance they had received about oral self-care. In their most recent treatment, conservative (39%) and non-conservative (34%) treatments dominated, and preventive ones were the least reported (7%). Regarding guidance in oral self-care, 54% reported having received such about toothbrushing, 32% about interdental cleaning, and 33% had been given visual information. Clinical examinations revealed the presence of plaque, calculus, bleeding on probing and deepened pockets in all of the subjects; 70% of the subjects were diagnosed with pockets of 6mm and deeper, 94% with caries, and 73% with overhangs of restorations. Those subjects assessing their knowledge of oral self-care as good and reporting a higher intensity of guidance in oral self-care as received, indicated practicing the recommended oral self-care more frequently. Twice daily toothbrushing was associated with good self-assessed knowledge of oral self-care (OR 4.1, p<0.001) and a university education (OR 5.6, p<0.001). Those subjects with better oral health behaviour had a greater number of teeth. Having 21 or more teeth was associated with good self-assessed knowledge of oral self-care (OR 4.1, p=0.03). Better periodontal conditions were associated with a higher frequency of toothbrushing. The presence of periodontal pockets of 6mm and deeper was associated with the level of self-assessed knowledge of oral self-care being below good (OR=3.0, p=0.04) and the level of dental cleanliness being poor (OR=2.7, p=0.02). To conclude, oral health behaviour and conditions call for improvement in elderly subjects in Lithuania. To improve the oral health of their elderly dentate patients, dentists should apply all the available tools of chair-side prevention and active guidance. The latter would be an effective means of updating the knowledge of oral self-care and supporting recommended oral health behaviour. A preventive approach should be strongly emphasized in countries with limited resources for oral health care, such as Lithuania. Author’s address: Sonata Vyšniauskaite, Department of Oral Public Health, Institute of Dentistry, University of Helsinki, P.O.Box 41, FI-00014 Helsinki, Finland. E-mail: sonata.vysniauskaite@helsinki.fi

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Objective and background. Tobacco smoking, pancreatitis and diabetes mellitus are the only known causes of pancreatic cancer, leaving ample room for yet unidentified determinants. This is an empirical study on a Finnish data on occupational exposures and pancreatic cancer risk, and a non-Bayesian and a hierarchical Bayesian meta-analysis of data on occupational factors and pancreatic cancer. Methods. The case-control study analyzed 595 incident cases of pancreatic cancer and 1,622 controls of stomach, colon, and rectum cancer, diagnosed 1984-1987 and known to be dead by 1990 in Finland. The next-of-kin responded to a mail questionnaire on job and medical histories and lifestyles. Meta-analysis of occupational risk factors of pancreatic cancer started off with 1,903 identified studies. The analyses were based on different subsets of that database. Five epidemiologists examined the reports and extracted the pertinent data using a standardized extraction form that covered 20 study descriptors and the relevant relative risk estimates. Random effects meta-analyses were applied for 23 chemical agents. In addition, hierarchical Bayesian models for meta-analysis were applied to the occupational data of 27 job titles using job exposure matrix as a link matrix and estimating the relative risks of pancreatic cancer associated with nine occupational agents. Results. In the case-control study, logistic regressions revealed excess risks of pancreatic cancer associated with occupational exposures to ionizing radiation, nonchlorinated solvents, and pesticides. Chlorinated hydrocarbon solvents and related compounds, used mainly in metal degreasing and dry cleaning, are emerging as likely risk factors of pancreatic cancer in the non-Bayesian and the hierarchical Bayesian meta-analysis. Consistent excess risk was found for insecticides, and a high excess for nickel and nickel compounds in the random effects meta-analysis but not in the hierarchical Bayesian meta-analysis. Conclusions. In this study occupational exposure to chlorinated hydrocarbon solvents and related compounds and insecticides increase risk of pancreatic cancer. Hierarchical Bayesian meta-analysis is applicable when studies addressing the agent(s) under study are lacking or very few, but several studies address job titles with potential exposure to these agents. A job-exposure matrix or a formal expert assessment system is necessary in this situation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The aim of this study was to measure seasonal variation in mood and behaviour. The dual vulnerability and latitude effect hypothesis, the risk of increased appetite, weight and other seasonal symptoms to develop metabolic syndrome, and perception of low illumination in quality of life and mental well-being were assessed. These variations are prevalent in persons who live in high latitudes and need balancing of metabolic processes to adapt to environmental changes due to seasons. A randomized sample of 8028 adults aged 30 and over (55% women) participated in an epidemiological health examination study, The Health 2000, applying the probability proportional to population size method for a range of socio-demographic characteristics. They were present in a face-to-face interview at home and health status examination. The questionnaires included the modified versions of the Seasonal Pattern Assessment Questionnaire (SPAQ) and Beck Depression Inventory (BDI), the Health Related Quality of Life (HRQoL) instrument 15D, and the General Health Questionnaire (GHQ). The structured and computerized Munich Composite International Diagnostic Interview (M-CIDI) as part of the interview was used to assess diagnoses of mental disorders, and, the National Cholesterol Education Program Adult Treatment Panel III (NCEP-ATPIII) criteria were assessed using all the available information to detect metabolic syndrome. A key finding was that 85% of this nationwide representative sample had seasonal variation in mood and behaviour. Approximately 9% of the study population presented combined seasonal and depressive symptoms with a significant association between their scores, and 2.6% had symptoms that corresponded to Seasonal Affective Disorder (SAD) in severity. Seasonal variations in weight and appetite are two important components that increase the risk of metabolic syndrome. Other factors such as waist circumference and major depressive disorder contributed to the metabolic syndrome as well. Persons reported of having seasonal symptoms were associated with a poorer quality of life and compromised mental well-being, especially if indoors illumination at home and/or at work was experienced as being low. Seasonal and circadian misalignments are suggested to associate with metabolic disorders, and could be remarked if individuals perceive low illumination levels at home and/or at work that affect the health-related quality of life and mental well-being. Keywords: depression, health-related quality of life, illumination, latitude, mental well-being, metabolic syndrome, seasonal variation, winter.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In genetic epidemiology, population-based disease registries are commonly used to collect genotype or other risk factor information concerning affected subjects and their relatives. This work presents two new approaches for the statistical inference of ascertained data: a conditional and full likelihood approaches for the disease with variable age at onset phenotype using familial data obtained from population-based registry of incident cases. The aim is to obtain statistically reliable estimates of the general population parameters. The statistical analysis of familial data with variable age at onset becomes more complicated when some of the study subjects are non-susceptible, that is to say these subjects never get the disease. A statistical model for a variable age at onset with long-term survivors is proposed for studies of familial aggregation, using latent variable approach, as well as for prospective studies of genetic association studies with candidate genes. In addition, we explore the possibility of a genetic explanation of the observed increase in the incidence of Type 1 diabetes (T1D) in Finland in recent decades and the hypothesis of non-Mendelian transmission of T1D associated genes. Both classical and Bayesian statistical inference were used in the modelling and estimation. Despite the fact that this work contains five studies with different statistical models, they all concern data obtained from nationwide registries of T1D and genetics of T1D. In the analyses of T1D data, non-Mendelian transmission of T1D susceptibility alleles was not observed. In addition, non-Mendelian transmission of T1D susceptibility genes did not make a plausible explanation for the increase in T1D incidence in Finland. Instead, the Human Leucocyte Antigen associations with T1D were confirmed in the population-based analysis, which combines T1D registry information, reference sample of healthy subjects and birth cohort information of the Finnish population. Finally, a substantial familial variation in the susceptibility of T1D nephropathy was observed. The presented studies show the benefits of sophisticated statistical modelling to explore risk factors for complex diseases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

During the last decades there has been a global shift in forest management from a focus solely on timber management to ecosystem management that endorses all aspects of forest functions: ecological, economic and social. This has resulted in a shift in paradigm from sustained yield to sustained diversity of values, goods and benefits obtained at the same time, introducing new temporal and spatial scales into forest resource management. The purpose of the present dissertation was to develop methods that would enable spatial and temporal scales to be introduced into the storage, processing, access and utilization of forest resource data. The methods developed are based on a conceptual view of a forest as a hierarchically nested collection of objects that can have a dynamically changing set of attributes. The temporal aspect of the methods consists of lifetime management for the objects and their attributes and of a temporal succession linking the objects together. Development of the forest resource data processing method concentrated on the extensibility and configurability of the data content and model calculations, allowing for a diverse set of processing operations to be executed using the same framework. The contribution of this dissertation to the utilisation of multi-scale forest resource data lies in the development of a reference data generation method to support forest inventory methods in approaching single-tree resolution.