923 resultados para data publishing


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Motivation: This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. Results: The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genetic research on risk of alcohol, tobacco or drug dependence must make allowance for the partial overlap of risk-factors for initiation of use, and risk-factors for dependence or other outcomes in users. Except in the extreme cases where genetic and environmental risk-factors for initiation and dependence overlap completely or are uncorrelated, there is no consensus about how best to estimate the magnitude of genetic or environmental correlations between Initiation and Dependence in twin and family data. We explore by computer simulation the biases to estimates of genetic and environmental parameters caused by model misspecification when Initiation can only be defined as a binary variable. For plausible simulated parameter values, the two-stage genetic models that we consider yield estimates of genetic and environmental variances for Dependence that, although biased, are not very discrepant from the true values. However, estimates of genetic (or environmental) correlations between Initiation and Dependence may be seriously biased, and may differ markedly under different two-stage models. Such estimates may have little credibility unless external data favor selection of one particular model. These problems can be avoided if Initiation can be assessed as a multiple-category variable (e.g. never versus early-onset versus later onset user), with at least two categories measurable in users at risk for dependence. Under these conditions, under certain distributional assumptions., recovery of simulated genetic and environmental correlations becomes possible, Illustrative application of the model to Australian twin data on smoking confirmed substantial heritability of smoking persistence (42%) with minimal overlap with genetic influences on initiation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Members of the flightless genus Apterotheca Gebien (Coleoptera : Tenebrionidae) are mostly restricted to the high elevation rainforests of the Wet Tropics World Heritage Area of north-eastern Australia. This region has been recognised as an 'epicentre of evolution for low vagility animals'. The genus Apterotheca is the most diverse low vagility insect taxon known in this region. Forty-four species are included here in a revision of the genus. Three of these species were previously included in Apterotheca (A. antaroides (Pascoe), A. besti (Blackburn) and A. punctipennis Carter), four were previously included in other genera (A. australis (Kulzer), comb. nov. and A. punctifrons (Gebien), comb. nov. in Apterophenus Gebien, A. costata (Buck), comb. nov. in Caxtonana Buck and A. pustulosa (Carter), comb. nov. in Austropeus Carter) and 37 are new. The monotypic genera Austropeus Carter, syn. nov. and Caxtonana Buck, syn. nov. are proposed as new synonyms of Apterotheca. A lectotype for A. punctipennis and A. besti are designated. A key to the species of Apterotheca and a phylogenetic analysis based on the morphological features of adults, as well as a discussion of character evolution, are also included. Data presented here represent the framework for future studies on the determinants of the patterns of diversity found in the Wet Tropics.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We extend the earlier model of condensate growth of Davis et at (Davis M J, Gardiner C W and Ballagh R J 2000 Phys. Rev. A 62 063608) to include the effect of gravity in a magnetic trap. We carry out calculations to model the experiment reported by Kohl et al (Kohl M, Davis M J, Gardiner C W, Hansch T and Esslinger T 2001 Preprint cond-mat/0106642) who study the formation of a rubidium Bose-Einstein condensate for a range of evaporative cooling parameters. We find that, in the regime where our model is valid, the theoretical curves agree with all the experimental data with no fitting parameters. However, for the slowest cooling of the gas the theoretical curve deviates significantly from the experimental curves. It is possible that this discrepancy may be related to the formation of a quasicondensate.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Third-instar nymphs of the Australian assassin bug, Pristhesancus plagipennis (Walker), were released into cotton plots at two release densities and two crop growth stages to test their biological control potential. Release rates of 2 and 5 nymphs per metre row resulted in field populations of 0.51 and 1.38 nymphs per metre row, respectively, indicating that over 70% of nymphs died or emigrated within two weeks of release. Effective release rates of 1.38 nymphs per metre row reduced the number of Helicoverpa spp. larvae in the plots for a 7-week period. Crop yields were significantly greater in the plots to which P. plagipennis nymphs were released, with the effective release rate of 1.38 nymphs per metre row providing equivalent yields as insecticide treated plots. The data suggest that P. plagipennis has the capacity to reduce Helicoverpa spp. larvae densities in cotton crops when augmented through inundative release.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A total of 2071 individual prey items were identified from 34 active and 55 inactive wedge-tailed eagle nests following the 1995, 1996 and 1997 breeding seasons. Overall, the eagle's diet was comparable to that reported in other studies within semi-arid regions, with rabbits, reptiles and macropods accounting for 47.8, 22.6 and 13.7% of prey items, respectively. In spring 1996 rabbit calicivirus moved into the study area, resulting in a 44-78% reduction in rabbit abundance (Sharp et al. 2001). An index was developed to enable the time since death for individual prey items to be approximated and a historical perspective of the eagle's diet to be constructed. Rabbits constituted 56-69% of dietary items collected during the pre-rabbit calicivirus disease (RCD) samples, but declined to 31% and 16% in the two post-RCD samples. A reciprocal trend was observed for the proportion of reptiles in the diet, which increased from 8-21% of pre-RCD dietary items to 49-54% after the advent of RCD. Similarly, the proportion of avian prey items was observed to increase in the post-RCD samples. These data suggested that prey switching may have occurred following the RCD epizootic. However, a lack of data on the relative abundances of reptiles and birds prevented an understanding of the eagle's functional responses to be developed and definitive conclusions to be drawn. Nevertheless, the eagles were observed to modify their diet to the change in rabbit densities by consuming larger quantities of native prey species.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Partitioned Bremer support (PBS) is a valuable means of assessing congruence in combined data sets, but some aspects require clarification. When more than one equally parsimonious tree is found during the constrained search for trees lacking the node of interest, averaging PBS for each data set across these trees can conceal conflict, and PBS should ideally be examined for each constrained tree. Similarly, when multiple most parsimonious trees (MPTs) are generated during analysis of the combined data, PBS is usually calculated on the consensus tree. However, extra information can be obtained if PBS is calculated on each of the MPTs or even suboptimal trees. (C) 2002 The Willi Hennig Society. Published by Elsevier Science (USA). All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Observations of an insect's movement lead to theory on the insect's flight behaviour and the role of movement in the species' population dynamics. This theory leads to predictions of the way the population changes in time under different conditions. If a hypothesis on movement predicts a specific change in the population, then the hypothesis can be tested against observations of population change. Routine pest monitoring of agricultural crops provides a convenient source of data for studying movement into a region and among fields within a region. Examples of the use of statistical and computational methods for testing hypotheses with such data are presented. The types of questions that can be addressed with these methods and the limitations of pest monitoring data when used for this purpose are discussed. (C) 2002 Elsevier Science B.V. All rights reserved.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Fault detection and isolation (FDI) are important steps in the monitoring and supervision of industrial processes. Biological wastewater treatment (WWT) plants are difficult to model, and hence to monitor, because of the complexity of the biological reactions and because plant influent and disturbances are highly variable and/or unmeasured. Multivariate statistical models have been developed for a wide variety of situations over the past few decades, proving successful in many applications. In this paper we develop a new monitoring algorithm based on Principal Components Analysis (PCA). It can be seen equivalently as making Multiscale PCA (MSPCA) adaptive, or as a multiscale decomposition of adaptive PCA. Adaptive Multiscale PCA (AdMSPCA) exploits the changing multivariate relationships between variables at different time-scales. Adaptation of scale PCA models over time permits them to follow the evolution of the process, inputs or disturbances. Performance of AdMSPCA and adaptive PCA on a real WWT data set is compared and contrasted. The most significant difference observed was the ability of AdMSPCA to adapt to a much wider range of changes. This was mainly due to the flexibility afforded by allowing each scale model to adapt whenever it did not signal an abnormal event at that scale. Relative detection speeds were examined only summarily, but seemed to depend on the characteristics of the faults/disturbances. The results of the algorithms were similar for sudden changes, but AdMSPCA appeared more sensitive to slower changes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We are witnessing an enormous growth in biological nitrogen removal from wastewater. It presents specific challenges beyond traditional COD (carbon) removal. A possibility for optimised process design is the use of biomass-supporting media. In this paper, attached growth processes (AGP) are evaluated using dynamic simulations. The advantages of these systems that were qualitatively described elsewhere, are validated quantitatively based on a simulation benchmark for activated sludge treatment systems. This simulation benchmark is extended with a biofilm model that allows for fast and accurate simulation of the conversion of different substrates in a biofilm. The economic feasibility of this system is evaluated using the data generated with the benchmark simulations. Capital savings due to volume reduction and reduced sludge production are weighed out against increased aeration costs. In this evaluation, effluent quality is integrated as well.