996 resultados para Data cleaning


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Objective and background. Tobacco smoking, pancreatitis and diabetes mellitus are the only known causes of pancreatic cancer, leaving ample room for yet unidentified determinants. This is an empirical study on a Finnish data on occupational exposures and pancreatic cancer risk, and a non-Bayesian and a hierarchical Bayesian meta-analysis of data on occupational factors and pancreatic cancer. Methods. The case-control study analyzed 595 incident cases of pancreatic cancer and 1,622 controls of stomach, colon, and rectum cancer, diagnosed 1984-1987 and known to be dead by 1990 in Finland. The next-of-kin responded to a mail questionnaire on job and medical histories and lifestyles. Meta-analysis of occupational risk factors of pancreatic cancer started off with 1,903 identified studies. The analyses were based on different subsets of that database. Five epidemiologists examined the reports and extracted the pertinent data using a standardized extraction form that covered 20 study descriptors and the relevant relative risk estimates. Random effects meta-analyses were applied for 23 chemical agents. In addition, hierarchical Bayesian models for meta-analysis were applied to the occupational data of 27 job titles using job exposure matrix as a link matrix and estimating the relative risks of pancreatic cancer associated with nine occupational agents. Results. In the case-control study, logistic regressions revealed excess risks of pancreatic cancer associated with occupational exposures to ionizing radiation, nonchlorinated solvents, and pesticides. Chlorinated hydrocarbon solvents and related compounds, used mainly in metal degreasing and dry cleaning, are emerging as likely risk factors of pancreatic cancer in the non-Bayesian and the hierarchical Bayesian meta-analysis. Consistent excess risk was found for insecticides, and a high excess for nickel and nickel compounds in the random effects meta-analysis but not in the hierarchical Bayesian meta-analysis. Conclusions. In this study occupational exposure to chlorinated hydrocarbon solvents and related compounds and insecticides increase risk of pancreatic cancer. Hierarchical Bayesian meta-analysis is applicable when studies addressing the agent(s) under study are lacking or very few, but several studies address job titles with potential exposure to these agents. A job-exposure matrix or a formal expert assessment system is necessary in this situation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The aim of this study was to measure seasonal variation in mood and behaviour. The dual vulnerability and latitude effect hypothesis, the risk of increased appetite, weight and other seasonal symptoms to develop metabolic syndrome, and perception of low illumination in quality of life and mental well-being were assessed. These variations are prevalent in persons who live in high latitudes and need balancing of metabolic processes to adapt to environmental changes due to seasons. A randomized sample of 8028 adults aged 30 and over (55% women) participated in an epidemiological health examination study, The Health 2000, applying the probability proportional to population size method for a range of socio-demographic characteristics. They were present in a face-to-face interview at home and health status examination. The questionnaires included the modified versions of the Seasonal Pattern Assessment Questionnaire (SPAQ) and Beck Depression Inventory (BDI), the Health Related Quality of Life (HRQoL) instrument 15D, and the General Health Questionnaire (GHQ). The structured and computerized Munich Composite International Diagnostic Interview (M-CIDI) as part of the interview was used to assess diagnoses of mental disorders, and, the National Cholesterol Education Program Adult Treatment Panel III (NCEP-ATPIII) criteria were assessed using all the available information to detect metabolic syndrome. A key finding was that 85% of this nationwide representative sample had seasonal variation in mood and behaviour. Approximately 9% of the study population presented combined seasonal and depressive symptoms with a significant association between their scores, and 2.6% had symptoms that corresponded to Seasonal Affective Disorder (SAD) in severity. Seasonal variations in weight and appetite are two important components that increase the risk of metabolic syndrome. Other factors such as waist circumference and major depressive disorder contributed to the metabolic syndrome as well. Persons reported of having seasonal symptoms were associated with a poorer quality of life and compromised mental well-being, especially if indoors illumination at home and/or at work was experienced as being low. Seasonal and circadian misalignments are suggested to associate with metabolic disorders, and could be remarked if individuals perceive low illumination levels at home and/or at work that affect the health-related quality of life and mental well-being. Keywords: depression, health-related quality of life, illumination, latitude, mental well-being, metabolic syndrome, seasonal variation, winter.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In genetic epidemiology, population-based disease registries are commonly used to collect genotype or other risk factor information concerning affected subjects and their relatives. This work presents two new approaches for the statistical inference of ascertained data: a conditional and full likelihood approaches for the disease with variable age at onset phenotype using familial data obtained from population-based registry of incident cases. The aim is to obtain statistically reliable estimates of the general population parameters. The statistical analysis of familial data with variable age at onset becomes more complicated when some of the study subjects are non-susceptible, that is to say these subjects never get the disease. A statistical model for a variable age at onset with long-term survivors is proposed for studies of familial aggregation, using latent variable approach, as well as for prospective studies of genetic association studies with candidate genes. In addition, we explore the possibility of a genetic explanation of the observed increase in the incidence of Type 1 diabetes (T1D) in Finland in recent decades and the hypothesis of non-Mendelian transmission of T1D associated genes. Both classical and Bayesian statistical inference were used in the modelling and estimation. Despite the fact that this work contains five studies with different statistical models, they all concern data obtained from nationwide registries of T1D and genetics of T1D. In the analyses of T1D data, non-Mendelian transmission of T1D susceptibility alleles was not observed. In addition, non-Mendelian transmission of T1D susceptibility genes did not make a plausible explanation for the increase in T1D incidence in Finland. Instead, the Human Leucocyte Antigen associations with T1D were confirmed in the population-based analysis, which combines T1D registry information, reference sample of healthy subjects and birth cohort information of the Finnish population. Finally, a substantial familial variation in the susceptibility of T1D nephropathy was observed. The presented studies show the benefits of sophisticated statistical modelling to explore risk factors for complex diseases.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Screen-less oscillation photography is the method of choice for recording three-dimensional X-ray diffraction data for crystals of biological macromolecules. The geometry of an oscillation camera is extremely simple. However, the manner in which the reciprocal lattice is recorded in any experiment is fairly complex. This depends on the Laue symmetry of the reciprocal lattice, the lattice type, the orientation of the crystal on the camera and to a lesser extent on the unit-cell dimensions. Exploring the relative efficiency of collecting X-ray diffraction data for different crystal orientations prior to data collection might reduce the number of films required to record most of the unique data and the consequent amount of time required for processing these films. Here algorithms are presented suitable for this purpose and results are reported for the 11 Laue groups, different lattice types and crystal orientations often employed in data collection.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Establish an internet platform where spatially referenced data can be viewed, entered and stored.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Development of an internet based spatial data delivery and reporting system for the Australian Cotton Industry.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The relationship between major depressive disorder (MDD) and bipolar disorder (BD) remains controversial. Previous research has reported differences and similarities in risk factors for MDD and BD, such as predisposing personality traits. For example, high neuroticism is related to both disorders, whereas openness to experience is specific for BD. This study examined the genetic association between personality and MDD and BD by applying polygenic scores for neuroticism, extraversion, openness to experience, agreeableness and conscientiousness to both disorders. Polygenic scores reflect the weighted sum of multiple single-nucleotide polymorphism alleles associated with the trait for an individual and were based on a meta-analysis of genome-wide association studies for personality traits including 13,835 subjects. Polygenic scores were tested for MDD in the combined Genetic Association Information Network (GAIN-MDD) and MDD2000+ samples (N=8921) and for BD in the combined Systematic Treatment Enhancement Program for Bipolar Disorder and Wellcome Trust Case-Control Consortium samples (N=6329) using logistic regression analyses. At the phenotypic level, personality dimensions were associated with MDD and BD. Polygenic neuroticism scores were significantly positively associated with MDD, whereas polygenic extraversion scores were significantly positively associated with BD. The explained variance of MDD and BD, approximately 0.1%, was highly comparable to the variance explained by the polygenic personality scores in the corresponding personality traits themselves (between 0.1 and 0.4%). This indicates that the proportions of variance explained in mood disorders are at the upper limit of what could have been expected. This study suggests shared genetic risk factors for neuroticism and MDD on the one hand and for extraversion and BD on the other.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Many novel computer architectures like array and multiprocessors which achieve high performance through the use of concurrency exploit variations of the von Neumann model of computation. The effective utilization of the machines makes special demands on programmers and their programming languages, such as the structuring of data into vectors or the partitioning of programs into concurrent processes. In comparison, the data flow model of computation demands only that the principle of structured programming be followed. A data flow program, often represented as a data flow graph, is a program that expresses a computation by indicating the data dependencies among operators. A data flow computer is a machine designed to take advantage of concurrency in data flow graphs by executing data independent operations in parallel. In this paper, we discuss the design of a high level language (DFL: Data Flow Language) suitable for data flow computers. Some sample procedures in DFL are presented. The implementation aspects have not been discussed in detail since there are no new problems encountered. The language DFL embodies the concepts of functional programming, but in appearance closely resembles Pascal. The language is a better vehicle than the data flow graph for expressing a parallel algorithm. The compiler has been implemented on a DEC 1090 system in Pascal.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Digital Image

Relevância:

20.00% 20.00%

Publicador:

Resumo:

postwar version of F 38358

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Handedness refers to a consistent asymmetry in skill or preferential use between the hands and is related to lateralization within the brain of other functions such as language. Previous twin studies of handedness have yielded inconsistent results resulting from a general lack of statistical power to find significant effects. Here we present analyses from a large international collaborative study of handedness (assessed by writing/drawing or self report) in Australian and Dutch twins and their siblings (54,270 individuals from 25,732 families). Maximum likelihood analyses incorporating the effects of known covariates (sex, year of birth and birth weight) revealed no evidence of hormonal transfer, mirror imaging or twin specific effects. There were also no differences in prevalence between zygosity groups or between twins and their singleton siblings. Consistent with previous meta-analyses, additive genetic effects accounted for about a quarter (23.64%) of the variance (95%CI 20.17, 27.09%) with the remainder accounted for by non-shared environmental influences. The implications of these findings for handedness both as a primary phenotype and as a covariate in linkage and association analyses are discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Many fisheries worldwide have adopted vessel monitoring systems (VMS) for compliance purposes. An added benefit of these systems is that they collect a large amount of data on vessel locations at very fine spatial and temporal scales. This data can provide a wealth of information for stock assessment, research, and management. However, since most VMS implementations record vessel location at set time intervals with no regard to vessel activity, some methodology is required to determine which data records correspond to fishing activity. This paper describes a probabilistic approach, based on hidden Markov models (HMMs), to determine vessel activity. A HMM provides a natural framework for the problem and, by definition, models the intrinsic temporal correlation of the data. The paper describes the general approach that was developed and presents an example of this approach applied to the Queensland trawl fishery off the coast of eastern Australia. Finally, a simulation experiment is presented that compares the misallocation rates of the HMM approach with other approaches.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Standardised time series of fishery catch rates require collations of fishing power data on vessel characteristics. Linear mixed models were used to quantify fishing power trends and study the effect of missing data encountered when relying on commercial logbooks. For this, Australian eastern king prawn (Melicertus plebejus) harvests were analysed with historical (from vessel surveys) and current (from commercial logbooks) vessel data. Between 1989 and 2010, fishing power increased up to 76%. To date, both forward-filling and, alternatively, omitting records with missing vessel information from commercial logbooks produce broadly similar fishing power increases and standardised catch rates, due to the strong influence of years with complete vessel data (16 out of 23 years of data). However, if gaps in vessel information had not originated randomly and skippers from the most efficient vessels were the most diligent at filling in logbooks, considerable errors would be introduced. Also, the buffering effect of complete years would be short lived as years with missing data accumulate. Given ongoing changes in fleet profile with high-catching vessels fishing proportionately more of the fleet’s effort, compliance with logbook completion, or alternatively ongoing vessel gear surveys, is required for generating accurate estimates of fishing power and standardised catch rates.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents a Chance-constraint Programming approach for constructing maximum-margin classifiers which are robust to interval-valued uncertainty in training examples. The methodology ensures that uncertain examples are classified correctly with high probability by employing chance-constraints. The main contribution of the paper is to pose the resultant optimization problem as a Second Order Cone Program by using large deviation inequalities, due to Bernstein. Apart from support and mean of the uncertain examples these Bernstein based relaxations make no further assumptions on the underlying uncertainty. Classifiers built using the proposed approach are less conservative, yield higher margins and hence are expected to generalize better than existing methods. Experimental results on synthetic and real-world datasets show that the proposed classifiers are better equipped to handle interval-valued uncertainty than state-of-the-art.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A compilation of crystal structure data on deoxyribo- and ribonucleosides and their higher derivatives is presented. The aim of this paper is to highlight the flexibility of deoxyribose and ribose rings. So far, the conformational parameters of nucleic acids constituents of ribose and deoxyribose have not been analysed separately. This paper aims to correlate the conformational parameters with the nature and puckering of the sugar. Deoxyribose puckering occurs in the C2′ endo region while ribose puckering is observed both in the C3′ endo and C2′ endo regions. A few endocyclic and exocyclic bond angles depend on the puckering and the nature of the sugar. The majority of structures have an anti conformation about the glycosyl bond. There appears to be a puckering dependence on the torsion angle about the C4′---C5′ bonds. Such stereochemical information is useful in model building studies of polynucleotides and nucleic acids.