192 resultados para Data matrix
Resumo:
In many occupational safety interventions, the objective is to reduce the injury incidence as well as the mean claims cost once injury has occurred. The claims cost data within a period typically contain a large proportion of zero observations (no claim). The distribution thus comprises a point mass at 0 mixed with a non-degenerate parametric component. Essentially, the likelihood function can be factorized into two orthogonal components. These two components relate respectively to the effect of covariates on the incidence of claims and the magnitude of claims, given that claims are made. Furthermore, the longitudinal nature of the intervention inherently imposes some correlation among the observations. This paper introduces a zero-augmented gamma random effects model for analysing longitudinal data with many zeros. Adopting the generalized linear mixed model (GLMM) approach reduces the original problem to the fitting of two independent GLMMs. The method is applied to evaluate the effectiveness of a workplace risk assessment teams program, trialled within the cleaning services of a Western Australian public hospital.
Resumo:
Binning and truncation of data are common in data analysis and machine learning. This paper addresses the problem of fitting mixture densities to multivariate binned and truncated data. The EM approach proposed by McLachlan and Jones (Biometrics, 44: 2, 571-578, 1988) for the univariate case is generalized to multivariate measurements. The multivariate solution requires the evaluation of multidimensional integrals over each bin at each iteration of the EM procedure. Naive implementation of the procedure can lead to computationally inefficient results. To reduce the computational cost a number of straightforward numerical techniques are proposed. Results on simulated data indicate that the proposed methods can achieve significant computational gains with no loss in the accuracy of the final parameter estimates. Furthermore, experimental results suggest that with a sufficient number of bins and data points it is possible to estimate the true underlying density almost as well as if the data were not binned. The paper concludes with a brief description of an application of this approach to diagnosis of iron deficiency anemia, in the context of binned and truncated bivariate measurements of volume and hemoglobin concentration from an individual's red blood cells.
Resumo:
Motivation: This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. Results: The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets.
Resumo:
Genetic research on risk of alcohol, tobacco or drug dependence must make allowance for the partial overlap of risk-factors for initiation of use, and risk-factors for dependence or other outcomes in users. Except in the extreme cases where genetic and environmental risk-factors for initiation and dependence overlap completely or are uncorrelated, there is no consensus about how best to estimate the magnitude of genetic or environmental correlations between Initiation and Dependence in twin and family data. We explore by computer simulation the biases to estimates of genetic and environmental parameters caused by model misspecification when Initiation can only be defined as a binary variable. For plausible simulated parameter values, the two-stage genetic models that we consider yield estimates of genetic and environmental variances for Dependence that, although biased, are not very discrepant from the true values. However, estimates of genetic (or environmental) correlations between Initiation and Dependence may be seriously biased, and may differ markedly under different two-stage models. Such estimates may have little credibility unless external data favor selection of one particular model. These problems can be avoided if Initiation can be assessed as a multiple-category variable (e.g. never versus early-onset versus later onset user), with at least two categories measurable in users at risk for dependence. Under these conditions, under certain distributional assumptions., recovery of simulated genetic and environmental correlations becomes possible, Illustrative application of the model to Australian twin data on smoking confirmed substantial heritability of smoking persistence (42%) with minimal overlap with genetic influences on initiation.
Resumo:
B3LYP/6-31G(d) calculations of structures, energies, and infrared spectra of several rearrangement products of (hetero)aromatic nitrenes and carbenes are reported. 3-Isoquinolylnitrene 36 ring closes to the azirine 37 prior to ring expansion to the potentially stable but unobserved seven-membered-ring carbodiimide 38 and diazacycloheptatrienylidene C-s-39S. A new, stable cycloheptatrienylidene, C-s-19S, is located on the naphthylcarbene energy surface. 4-Quinolylnitrene undergoes reaction via the azirine 50 in solution, but ring expansion to the stable seven-membered-ring ketenimine 47 under Ar matrix photolysis conditions. There is excellent agreement between calculated infrared spectra of 1,5-diazacyclohepta-1,2,4,6-tetraene 54 (obtained by photolysis of 4-pyridyl azide), 1-azacyclohepta-1,2,4,6-tetraene 5, 1-azacyclohepta-1,3,5,6-tetraene 55, and 1-azacyclohepta-1,3,4,6-tetraene 56 and the available experimental data.
Resumo:
Background. Nursing codes of ethics bind nurses to the role of patient advocate and compel them to take action when the rights or safety of a patient are jeopardized. Reporting misconduct is known as whistleblowing and studies indicate that there are personal and professional risks involved in blowing the whistle. Aim. The aim of this study was to explore the beliefs of nurses who wrestled with this ethical dilemma. Design. A descriptive survey design was used to examine the beliefs of nurses in Western Australia who reported misconduct (whistleblowers) and of those who did not report misconduct (nonwhistleblowers). Methods. The instrument listed statements from current ethical codes, statements from traditional views on nursing and statements of beliefs related to the participant's whistleblowing experience. Respondents were asked to rate each item on a five-point Likert format which ranged from strongly agree to strongly disagree. Data were analysed using a Pearson's correlation matrix and one-way ANOVA. To further explore the data, a factor analysis was run with varimax rotation. Results. Results indicated that whistleblowers supported the beliefs inherent in patient advocacy, while nonwhistleblowers retained a belief in the traditional role of nursing. Participants who reported misconduct (whistleblowers) supported the belief that nurses were primarily responsible to the patient and should protect a patient from incompetent or unethical people. Participants who did not report misconduct (nonwhistleblowers) supported the belief that nurses are obligated to follow a physician's order at all times and that nurses are equally responsible to the patient, the physician and the employer. Conclusion. These findings indicate that nurses may respond to ethical dilemmas based on different belief systems.
Resumo:
Observations of an insect's movement lead to theory on the insect's flight behaviour and the role of movement in the species' population dynamics. This theory leads to predictions of the way the population changes in time under different conditions. If a hypothesis on movement predicts a specific change in the population, then the hypothesis can be tested against observations of population change. Routine pest monitoring of agricultural crops provides a convenient source of data for studying movement into a region and among fields within a region. Examples of the use of statistical and computational methods for testing hypotheses with such data are presented. The types of questions that can be addressed with these methods and the limitations of pest monitoring data when used for this purpose are discussed. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
Accurate habitat mapping is critical to landscape ecological studies such as required for developing and testing Montreal Process indicator 1.1e, fragmentation of forest types. This task poses a major challenge to remote sensing, especially in mixedspecies, variable-age forests such as dry eucalypt forests of subtropical eastern Australia. In this paper, we apply an innovative approach that uses a small section of one-metre resolution airborne data to calibrate a moderate spatial resolution model (30 m resolution; scale 1:50 000) based on Landsat Thematic Mapper data to estimate canopy structural properties in St Marys State Forest, near Maryborough, south-eastern Queensland. The approach applies an image-processing model that assumes each image pixel is significantly larger than individual tree crowns and gaps to estimate crown-cover percentage, stem density and mean crown diameter. These parameters were classified into three discrete habitat classes to match the ecology of four exudivorous arboreal species (yellowbellied glider Petaurus australis, sugar glider P. breviceps, squirrel glider P. norfolcensis , and feathertail glider Acrobates pygmaeus), and one folivorous arboreal marsupial, the greater glider Petauroides volans. These species were targeted due to the known ecological preference for old trees with hollows, and differences in their home range requirements. The overall mapping accuracy, visually assessed against transects (n = 93) interpreted from a digital orthophoto and validated in the field, was 79% (KHAT statistic = 0.72). The KHAT statistic serves as an indicator of the extent that the percentage correct values of the error matrix are due to ‘true’ agreement verses ‘chance’ agreement. This means that we are able to reliably report on the effect of habitat loss on target species, especially those with a large home range size (e.g. yellow-bellied glider). However, the classified habitat map failed to accurately capture the spatial patterning (e.g. patch size and shape) of stands with a trace or sub-dominance of senescent trees. This outcome makes the reporting of the effects of habitat fragmentation more problematic, especially for species with a small home range size (e.g. feathertail glider). With further model refinement and validation, however, this moderateresolution approach offers an important, cost eff e c t i v e advancement in mapping the age of dry eucalypt forests in the region.
Resumo:
Phase-equilibrium data and the liquidus for the system. "MnO"-CaO-(Al2O3-SiO2) at a manganese-rich alloy saturation have been determined in the temperature range from 1423 to 1723 K. The results are presented in the form of a pseudoternary section "MnO"-CaO-(Al2O3 + SiO2) with an Al2O3/SiO2 weight ratio of 0.41. The following primary phases are present in the range of conditions investigated:, 3Al(2)O(3).2SiO(2); SiO2; MnO.Al2O3-2SiO(2); (Mn,Ca)O.SiO2; 2(Mn,Ca)O.SiO2; MnO.Al2O3; (Mn,Ca)O; alpha-2CaO.SiO2; alpha'-2CaO.SiO2; 2CaO.Al2O3.SiO2; CaO.SiO2, and CaO.Al2O3.2SiO(2). The presence of alumina in this system is shown to have a significant effect on the liquidus compared to the system "MnO"-CaO-SiO2, leading to, the stabilization of the anorthite and gehlenite phases.