227 resultados para Sectional Twin Data
Resumo:
In many occupational safety interventions, the objective is to reduce the injury incidence as well as the mean claims cost once injury has occurred. The claims cost data within a period typically contain a large proportion of zero observations (no claim). The distribution thus comprises a point mass at 0 mixed with a non-degenerate parametric component. Essentially, the likelihood function can be factorized into two orthogonal components. These two components relate respectively to the effect of covariates on the incidence of claims and the magnitude of claims, given that claims are made. Furthermore, the longitudinal nature of the intervention inherently imposes some correlation among the observations. This paper introduces a zero-augmented gamma random effects model for analysing longitudinal data with many zeros. Adopting the generalized linear mixed model (GLMM) approach reduces the original problem to the fitting of two independent GLMMs. The method is applied to evaluate the effectiveness of a workplace risk assessment teams program, trialled within the cleaning services of a Western Australian public hospital.
Resumo:
Binning and truncation of data are common in data analysis and machine learning. This paper addresses the problem of fitting mixture densities to multivariate binned and truncated data. The EM approach proposed by McLachlan and Jones (Biometrics, 44: 2, 571-578, 1988) for the univariate case is generalized to multivariate measurements. The multivariate solution requires the evaluation of multidimensional integrals over each bin at each iteration of the EM procedure. Naive implementation of the procedure can lead to computationally inefficient results. To reduce the computational cost a number of straightforward numerical techniques are proposed. Results on simulated data indicate that the proposed methods can achieve significant computational gains with no loss in the accuracy of the final parameter estimates. Furthermore, experimental results suggest that with a sufficient number of bins and data points it is possible to estimate the true underlying density almost as well as if the data were not binned. The paper concludes with a brief description of an application of this approach to diagnosis of iron deficiency anemia, in the context of binned and truncated bivariate measurements of volume and hemoglobin concentration from an individual's red blood cells.
Resumo:
Motivation: This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. Results: The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets.
Resumo:
There is a pressing need to address productivity analysis in the hospitality industry if hotels are to exist as sustainable business entities in rapidly maturing markets. Unfortunately, productivity ratios commonly used by managers are narrowly defined. This study illustrates data envelopment analysis of cross-sectional data that benchmark hotels on observed best performances. Data envelopment analysis enables management to integrate unlike multiple inputs and outputs to make simultaneous comparisons. Findings from the cross-sectional data suggest that some of the hotels have the potential to reduce number of beds and number of part-time staff while increasing revenue.
Resumo:
Background: Several studies have shown that variation in serum gamma-glutamyltransferase (GGT) in the population is associated with risk of death or development of cardiovascular disease, type 2 diabetes, stroke, or hypertension. This association is only partly explained by associations between GGT and recognized risk factors. Our aim was to estimate the relative importance of genetic and environmental sources of variation in GGT as well as genetic and environmental sources of covariation between GGT and other liver enzymes and markers of cardiovascular risk in adult twin pairs. Methods: We recruited 1134 men and 2241 women through the Australian Twin Registry. Data were collected through mailed questionnaires, telephone interviews, and by analysis of blood samples. Sources of variation in GGT, alanine aminotransferase (ALT), and aspartate aminotransferase (AST) and of covariation between GGT and cardiovascular risk factors were assessed by maximum-likelihood model-fitting. Results: Serum GGT, ALT, and AST were affected by additive genetic and nonshared environmental factors, with heritabilities estimated at 0.52, 0.48, and 0.32, respectively. One-half of the genetic variance in GGT was shared with ALT, AST, or both. There were highly significant correlations between GGT and body mass index; serum lipids, lipoproteins, glucose, and insulin; and blood pressure. These correlations were more attributable to genes that affect both GGT and known cardiovascular risk factors than to environmental factors. Conclusions: Variation in serum enzymes that reflect liver function showed significant genetic effects, and there was evidence that both genetic and environmental factors that affect these enzymes can also affect cardiovascular risk. (C) 2002 American Association for Clinical Chemistry.
Resumo:
In order to investigate the genetic and environmental antecedents of osteoarthritis (CA), self-report measures of joint pain, stiffness and swelling were obtained from a population-based sample of 1242 twin pairs over 50 years of age. In order to provide validation for these self-report measures, a subsample of 118 twin pairs were examined according to the American College of Rheumatology clinical and radiographic criteria for the classification of osteoarthritis. A variety of statistical methods were employed to identify the model derived from self-report variables which would provide optimal prediction of these standardised assessments, and structural equation modelling was used to determine the relative influences of genetic and environmental influences on the development of osteoarthritis. Significant genetic effects were found to contribute to osteoarthritis of the hands, hips and knees in women, with heritability estimates ranging from 30-46% depending on the site. In addition, the additive genetic effects contributing to osteoarthritis in various parts of the body were confirmed to be the same. Statistically significant familial aggregation of osteoarthritis in men was also observed, but it was not possible to determine whether this was due to genetic or shared environmental effects.
Resumo:
There have been few replicated examples of genotype x environment interaction effects on behavioral variation or risk of psychiatric disorder. We review some of the factors that have made detection of genotype x environment interaction effects difficult, and show how genotype x shared environment interaction (GxSE) effects are commonly confounded with genetic parameters in data from twin pairs reared together. Historic data on twin pairs reared apart can in principle be used to estimate such GxSE effects, but have rarely been used for this purpose. We illustrate this using previously published data from the Swedish Adoption Twin Study of Aging (SATSA), which suggest that GxSE effects could account for as much as 25% of the total variance in risk of becoming a regular smoker. Since few separated twin pairs will be available for study in the future, we also consider methods for modifying variance components linkage analysis to allow for environmental interactions with linked loci.
Resumo:
Murray Valley encephalitis (MVE) virus is a mosquito-borne flavivirus causing severe encephalitis with a resultant high morbidity and mortality. In the period 1989-1993. we undertook a cross-sectional and longitudinal studs by annually screening members of a small remote Aboriginal community in northwestern Australia for MVE virus antibodies. Of the estimated 250-300 people in the community. 249 were tested, and 52.6% had positive serology to MVE. The proportion testing positive increased with increasing age group. and males were slightly more likely to be positive than females. During the study period. a high proportion of the population seroconverted to MVE: the clinical/subclinical ratio seems to be lower than previously reported. Although MVE is mostly asymptomatic, the devastating consequences of clinical illness indicate that advice should be provided regarding the avoidance of mosquito bites. Our longitudinal study showed that the risk of seroconversion was similar for each age group. not just the young.
Resumo:
We study the process of photodissociation of a molecular Bose-Einstein condensate as a potential source of strongly correlated twin atomic beams. We show that the two beams can possess nearly perfect quantum squeezing in their relative numbers.
Resumo:
Observations of an insect's movement lead to theory on the insect's flight behaviour and the role of movement in the species' population dynamics. This theory leads to predictions of the way the population changes in time under different conditions. If a hypothesis on movement predicts a specific change in the population, then the hypothesis can be tested against observations of population change. Routine pest monitoring of agricultural crops provides a convenient source of data for studying movement into a region and among fields within a region. Examples of the use of statistical and computational methods for testing hypotheses with such data are presented. The types of questions that can be addressed with these methods and the limitations of pest monitoring data when used for this purpose are discussed. (C) 2002 Elsevier Science B.V. All rights reserved.