970 resultados para Data exploration
Resumo:
Motivation: This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. Results: The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets.
Resumo:
Genetic research on risk of alcohol, tobacco or drug dependence must make allowance for the partial overlap of risk-factors for initiation of use, and risk-factors for dependence or other outcomes in users. Except in the extreme cases where genetic and environmental risk-factors for initiation and dependence overlap completely or are uncorrelated, there is no consensus about how best to estimate the magnitude of genetic or environmental correlations between Initiation and Dependence in twin and family data. We explore by computer simulation the biases to estimates of genetic and environmental parameters caused by model misspecification when Initiation can only be defined as a binary variable. For plausible simulated parameter values, the two-stage genetic models that we consider yield estimates of genetic and environmental variances for Dependence that, although biased, are not very discrepant from the true values. However, estimates of genetic (or environmental) correlations between Initiation and Dependence may be seriously biased, and may differ markedly under different two-stage models. Such estimates may have little credibility unless external data favor selection of one particular model. These problems can be avoided if Initiation can be assessed as a multiple-category variable (e.g. never versus early-onset versus later onset user), with at least two categories measurable in users at risk for dependence. Under these conditions, under certain distributional assumptions., recovery of simulated genetic and environmental correlations becomes possible, Illustrative application of the model to Australian twin data on smoking confirmed substantial heritability of smoking persistence (42%) with minimal overlap with genetic influences on initiation.
Resumo:
Allozyme analysis was used to address the question of the source of the Australian populations of the monarch butterfly Danaus plexippus (L.). The study had three major aims: (1) To compare the levels of diversity of Australian and Hawaiian populations with potential source populations. (2) To determine whether eastern and western North American populations were sufficiently divergent for the Australian populations to be aligned to a source population. (3) To compare the differentiation among regions in Australia and North America to test the prediction of greater genetic structure in Australia, as a consequence of reduced migratory behaviour. The reverse was found, with F-ST values an order of magnitude lower in Australia than in North America. Predictably, Australian and Hawaiian populations had lower allelic diversity, but unexpected higher heterozygosity values than North American populations. It was not possible to assign the Australian populations to a definitive source, although the high levels of similarity of Australian populations to each other suggest a single colonization event. The possibility that the Australian populations have not been here long enough to reach equilibrium is discussed. (C) 2002 The Linnean Society of London, Biological Journal of the Linnean Society, 2002, 75, 437-452.
Resumo:
Observations of an insect's movement lead to theory on the insect's flight behaviour and the role of movement in the species' population dynamics. This theory leads to predictions of the way the population changes in time under different conditions. If a hypothesis on movement predicts a specific change in the population, then the hypothesis can be tested against observations of population change. Routine pest monitoring of agricultural crops provides a convenient source of data for studying movement into a region and among fields within a region. Examples of the use of statistical and computational methods for testing hypotheses with such data are presented. The types of questions that can be addressed with these methods and the limitations of pest monitoring data when used for this purpose are discussed. (C) 2002 Elsevier Science B.V. All rights reserved.
Resumo:
Aims To identify influences on the development of alcohol use disorders in a Thai population, particularly parental drinking and childhood environment. Design Case-control study. Setting A university hospital, a regional hospital and a community hospital in southern Thailand. Participants Ninety-one alcohol-dependents and 177 hazardous/harmful drinkers were recruited as cases and 144 non-or infrequent drinkers as controls. Measurements Data on parental drinking, family demographic characteristics, family activities, parental disciplinary practice, early religious life and conduct disorder were obtained using a structured interview questionnaire. The main outcome measure was the subject's classification as alcohol-dependent, hazardous/harmful drinker or non-/infrequent drinker. Findings A significant relationship was found between having a drinking father and the occurrence of hazardous/harmful drinking or alcohol dependence in the subjects. Childhood factors (conduct disorder and having been a temple boy, relative probability ratios, RPRs and 95% CI: 6.39, 2.81-14.55 and 2.21, 1.19-4.08, respectively) also significantly predicted alcohol dependence, while perceived poverty and ethnic alienation was reported less frequently by hazardous/harmful drinkers and alcohol-dependents (RPRS and 95% CIs = 0.34, 0.19-0.62 and 0.59, 0.38-0.93, respectively) than the controls. The relative probability ratio for the effect of the father's infrequent drinking on the son's alcohol dependence was 2.92 (95% CI = 1.42-6.02) and for the father's heavy or dependent drinking 2.84 (95% CI=1.31-6.15). Conclusions Being exposed to a light-drinking, father increases the risk of a son's alcohol use disorders exhibited either as hazardous-harmful or dependent drinking. However, exposure to a heavy- or dependent-drinking father is associated more uniquely with an increased risk of his son being alcohol-dependent. The extent to which this is seen in other cultures is worthy of exploration.
Resumo:
Phase-equilibrium data and the liquidus for the system. "MnO"-CaO-(Al2O3-SiO2) at a manganese-rich alloy saturation have been determined in the temperature range from 1423 to 1723 K. The results are presented in the form of a pseudoternary section "MnO"-CaO-(Al2O3 + SiO2) with an Al2O3/SiO2 weight ratio of 0.41. The following primary phases are present in the range of conditions investigated:, 3Al(2)O(3).2SiO(2); SiO2; MnO.Al2O3-2SiO(2); (Mn,Ca)O.SiO2; 2(Mn,Ca)O.SiO2; MnO.Al2O3; (Mn,Ca)O; alpha-2CaO.SiO2; alpha'-2CaO.SiO2; 2CaO.Al2O3.SiO2; CaO.SiO2, and CaO.Al2O3.2SiO(2). The presence of alumina in this system is shown to have a significant effect on the liquidus compared to the system "MnO"-CaO-SiO2, leading to, the stabilization of the anorthite and gehlenite phases.
Mineral chemistry, whole-rock compositions, and petrogenesis of leg 176 gabbros: Data and discussion
Resumo:
We report mineral chemistry, whole-rock major element compositions, and trace element analyses on Hole 735B samples drilled and selected during Leg 176. We discuss these data, together with Leg 176 shipboard data and Leg 118 sample data from the literature, in terms of primary igneous petrogenesis. Despite mineral compositional variation in a given sample, major constituent minerals in Hole 735B gabbroic rocks display good chemical equilibrium as shown by significant correlations among Mg# (= Mg/[Mg+Fe2+]) of olivine, clinopyroxene, and orthopyroxene and An (=Ca/[Ca+Na]) of plagioclase. This indicates that the mineral assemblages olivine + plagioclase in troctolite, plagioclase + clinopyroxene in gabbro, plagioclases + clinopyroxene + olivine in olivine gabbro, and plagioclase + clinopyroxene + olivine + orthopyroxene in gabbronorite, and so on, have all coprecipitated from their respective parental melts. Fe-Ti oxides (ilmenite and titanomagnetite), which are ubiquitous in most of these rocks, are not in chemical equilibrium with olivine, clinopyroxene, and plagioclase, but precipitated later at lower temperatures. Disseminated oxides in some samples may have precipitated from trapped Fe-Ti–rich melts. Oxides that concentrate along shear bands/zones may mark zones of melt coalescence/transport expelled from the cumulate sequence as a result of compaction or filter pressing. Bulk Hole 735B is of cumulate composition. The most primitive olivine, with Fo = 0.842, in Hole 735B suggests that the most primitive melt parental to Hole 735B lithologies must have Mg# ≤ 0.637, which is significantly less than Mg# = 0.714 of bulk Hole 735B.
Resumo:
Alcohol and tobacco consumption are closely correlated and published results on their association with breast cancer have not always allowed adequately for confounding between these exposures. Over 80% of the relevant information worldwide on alcohol and tobacco consumption and breast cancer were collated, checked and analysed centrally. Analyses included 58515 women with invasive breast cancer and 95067 controls from 53 studies. Relative risks of breast cancer were estimated, after stratifying by study, age, parity and, where appropriate, women's age when their first child was born and consumption of alcohol and tobacco. The average consumption of alcohol reported by controls from developed countries was 6.0 g per day, i.e. about half a unit/drink of alcohol per day, and was greater in ever-smokers than never-smokers, (8.4 g per day and 5.0 g per day, respectively). Compared with women who reported drinking no alcohol, the relative risk of breast cancer was 1.32 (1.19 - 1.45, P < 0.00001) for an intake of 35 - 44 g per day alcohol, and 1.46 (1.33 - 1.61, P < 0.00001) for greater than or equal to 45 g per day alcohol. The relative risk of breast cancer increased by 7.1% (95% CI 5.5-8.7%; P