921 resultados para Heckman selection model
Resumo:
Motivation: This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. Results: The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets.
Resumo:
Theory predicts that in small isolated populations random genetic drift can lead to phenotypic divergence; however this prediction has rarely been tested quantitatively in natural populations. Here we utilize natural repeated island colonization events by members of the avian species complex, Zosterops lateralis, to assess whether or not genetic drift alone is an adequate explanation for the observed patterns of microevolutionary divergence in morphology. Morphological and molecular genetic characteristics of island and mainland populations are compared to test three predictions of drift theory: (1) that the pattern of morphological change is idiosyncratic to each island; (2) that there is concordance between morphological and neutral genetic shifts across island populations; and (3) for populations whose time of colonization is known, that the rate of morphological change is sufficiently slow to be accounted for solely by genetic drift. Our results are not consistent with these predictions. First, the direction of size shifts was consistently towards larger size, suggesting the action of a nonrandom process. Second, patterns of morphological divergence among recently colonized populations showed little concordance with divergence in neutral genetic characters. Third, rate tests of morphological change showed that effective population sizes were not small enough for random processes alone to account for the magnitude of microevolutionary change. Altogether, these three lines of evidence suggest that drift alone is not an adequate explanation of morphological differentiation in recently colonized island Zosterops and therefore we suggest that the observed microevolutionary changes are largely a result of directional natural selection.
Resumo:
Most sugarcane breeding programs in Australia use large unreplicated trials to evaluate clones in the early stages of selection. Commercial varieties that are replicated provide a method of local control of soil fertility. Although such methods may be useful in detecting broad trends in the field, variation often occurs on a much smaller scale. Methods such as spatial analysis adjust a plot for variability by using information from immediate neighbours. These techniques are routinely used to analyse cereal data in Australia and have resulted in increased accuracy and precision in the estimates of variety effects. In this paper, spatial analyses in which the variability is decomposed into local, natural, and extraneous components are applied to early selection trials in sugarcane. Interplot competition in cane yield and trend in sugar content were substantial in many of the trials and there were often large differences in the selections between the spatial and current method used by the Bureau of Sugar Experiment Stations. A joint modelling approach for tonnes sugar per hectare in response to fertility trends and interplot competition is recommended.
Resumo:
The thin-layer drying behaviour of bananas in a beat pump dehumidifier dryer was examined. Four pre-treatments (blanching, chilling, freezing and combined blanching and freezing) were applied to the bananas, which were dried at 50 degreesC with an air velocity of 3.1 m s(-1) and with the relative humidity of the inlet air of 10-35%. Three drying models, the simple model, the two-term exponential model and the Page model were examined. All models were evaluated using three statistical measures, correlation coefficient, root means square error, and mean absolute percent error. Moisture diffusivity was calculated based on the diffusion equation for an infinite cylindrical shape using the slope method. The rate of drying was higher for the pre-treatments involving freezing. The sample which was blanched only did not show any improvement in drying rate. In fact, a longer drying time resulted due to water absorption during blanching. There was no change in the rate for the chilled sample compared with the control. While all models closely fitted the drying data, the simple model showed greatest deviation from the experimental results. The two-term exponential model was found to be the best model for describing the drying curves of bananas because its parameters represent better the physical characteristics of the drying process. Moisture diffusivities of bananas were in the range 4.3-13.2 x 10(-10) m(2)s(-1). (C) 2002 Published by Elsevier Science Ltd.
Resumo:
The haploid NK model developed by Kauffman can be extended to diploid genomes and to incorporate gene-by-environment interaction effects in combination with epistasis. To provide the flexibility to include a wide range of forms of gene-by-environment interactions, a target population of environment types (TPE) is defined. The TPE consists of a set of E different environment types, each with their own frequency of occurrence. Each environment type conditions a different NK gene network structure or series of gene effects for a given network structure, providing the framework for defining gene-by-environment interactions. Thus, different NK models can be partially or completely nested within the E environment types of a TPE, giving rise to the E(NK) model for a biological system. With this model it is possible to examine how populations of genotypes evolve in context with properties of the environment that influence the contributions of genes to the fitness values of genotypes. We are using the E(NK) model to investigate how both epistasis and gene-by-environment interactions influence the genetic improvement of quantitative traits by plant breeding strategies applied to agricultural systems. © 2002 Wiley Periodicals, Inc.
Resumo:
Why does species richness vary so greatly across lineages? Traditionally, variation in species richness has been attributed to deterministic processes, although it is equally plausible that it may result from purely stochastic processes. We show that, based on the best available phylogenetic hypothesis, the pattern of cladogenesis among agamid lizards is not consistent with a random model, with some lineages having more species, and others fewer species, than expected by chance. We then use phylogenetic comparative methods to test six types of deterministic explanation for variation in species richness: body size, life history, sexual selection, ecological generalism, range size and latitude. Of eight variables we tested, only sexual size dimorphism and sexual dichromatism predicted species richness. Increases in species richness are associated with increases in sexual dichromatism but reductions in sexual size dimorphism. Consistent with recent comparative studies, we find no evidence that species richness is associated with small body size or high fecundity. Equally, we find no evidence that species richness covaries with ecological generalism, latitude or range size.
Resumo:
Purpose Achieving sustainability by rethinking products, services and strategies is an enormous challenge currently laid upon the economic sector, in which materials selection plays a critical role. In this context, the present work describes an environmental and economic life cycle analysis of a structural product, comparing two possible material alternatives. The product chosen is a storage tank, presently manufactured in stainless steel (SST) or in a glass fibre reinforced polymer composite (CST). The overall goal of the study is to identify environmental and economic strong and weak points related to the life cycle of the two material alternatives. The consequential win-win or trade-off situations will be identified via a Life Cycle Assessment/Life Cycle Costing (LCA/LCC) integrated model. Methods The LCA/LCC integrated model used consists in applying the LCA methodology to the product system, incorporating, in parallel, its results into the LCC study, namely those of the Life Cycle Inventory (LCI) and the Life Cycle Impact Assessment (LCIA). Results In both the SST and CST systems the most significant life cycle phase is the raw materials production, in which the most significant environmental burdens correspond to the Fossil fuels and Respiratory inorganics categories. The LCA/LCC integrated analysis shows that the CST has globally a preferable environmental and economic profile, as its impacts are lower than those of the SST in all life cycle stages. Both the internal and external costs are lower, the former resulting mainly from the composite material being significantly less expensive than stainless steel. This therefore represents a full win-win situation. As a consequence, the study clearly indicates that using a thermoset composite material to manufacture storage tanks is environmentally and economically desirable. However, it was also evident that the environmental performance of the CST could be improved by altering its End-of-Life stage. Conclusions The results of the present work provide enlightening insights into the synergies between the environmental and the economic performance of a structural product made with alternative materials. Further, they provide conclusive evidence to support the integration of environmental and economic life cycle analysis in the product development processes of a manufacturing company, or in some cases even in its procurement practices.
Resumo:
Toxic amides, such as acrylamide, are potentially harmful to Human health, so there is great interest in the fabrication of compact and economical devices to measure their concentration in food products and effluents. The CHEmically Modified Field Effect Transistor (CHEMFET) based onamorphous silicon technology is a candidate for this type of application due to its low fabrication cost. In this article we have used a semi-empirical modelof the device to predict its performance in a solution of interfering ions. The actual semiconductor unit of the sensor was fabricated by the PECVD technique in the top gate configuration. The CHEMFET simulation was performed based on the experimental current voltage curves of the semiconductor unit and on an empirical model of the polymeric membrane. Results presented here are useful for selection and design of CHEMFET membranes and provide an idea of the limitations of the amorphous CHEMFET device. In addition to the economical advantage, the small size of this prototype means it is appropriate for in situ operation and integration in a sensor array.
Resumo:
Copyright © 2013 Springer Netherlands.
Resumo:
Research on the problem of feature selection for clustering continues to develop. This is a challenging task, mainly due to the absence of class labels to guide the search for relevant features. Categorical feature selection for clustering has rarely been addressed in the literature, with most of the proposed approaches having focused on numerical data. In this work, we propose an approach to simultaneously cluster categorical data and select a subset of relevant features. Our approach is based on a modification of a finite mixture model (of multinomial distributions), where a set of latent variables indicate the relevance of each feature. To estimate the model parameters, we implement a variant of the expectation-maximization algorithm that simultaneously selects the subset of relevant features, using a minimum message length criterion. The proposed approach compares favourably with two baseline methods: a filter based on an entropy measure and a wrapper based on mutual information. The results obtained on synthetic data illustrate the ability of the proposed expectation-maximization method to recover ground truth. An application to real data, referred to official statistics, shows its usefulness.
Resumo:
Moving towards autonomous operation and management of increasingly complex open distributed real-time systems poses very significant challenges. This is particularly true when reaction to events must be done in a timely and predictable manner while guaranteeing Quality of Service (QoS) constraints imposed by users, the environment, or applications. In these scenarios, the system should be able to maintain a global feasible QoS level while allowing individual nodes to autonomously adapt under different constraints of resource availability and input quality. This paper shows how decentralised coordination of a group of autonomous interdependent nodes can emerge with little communication, based on the robust self-organising principles of feedback. Positive feedback is used to reinforce the selection of the new desired global service solution, while negative feedback discourages nodes to act in a greedy fashion as this adversely impacts on the provided service levels at neighbouring nodes. The proposed protocol is general enough to be used in a wide range of scenarios characterised by a high degree of openness and dynamism where coordination tasks need to be time dependent. As the reported results demonstrate, it requires less messages to be exchanged and it is faster to achieve a globally acceptable near-optimal solution than other available approaches.
Resumo:
The choice of an information systems is a critical factor of success in an organization's performance, since, by involving multiple decision-makers, with often conflicting objectives, several alternatives with aggressive marketing, makes it particularly complex by the scope of a consensus. The main objective of this work is to make the analysis and selection of a information system to support the school management, pedagogical and administrative components, using a multicriteria decision aid system – MMASSITI – Multicriteria Method- ology to Support the Selection of Information Systems/Information Technologies – integrates a multicriteria model that seeks to provide a systematic approach in the process of choice of Information Systems, able to produce sustained recommendations concerning the decision scope. Its application to a case study has identi- fied the relevant factors in the selection process of school educational and management information system and get a solution that allows the decision maker’ to compare the quality of the various alternatives.
Resumo:
Journal of Proteome Research (2006)5: 2720-2726
Resumo:
Dissertação para a obtenção de Grau de Mestre em Engenharia e Gestão Industrial
Resumo:
During the last years, several studies have been made aiming to assess the out-of-plane seismic response of unreinforced stone masonry structures. This fact led to the development of a wide variety of models and approaches, ranging from simple kinematic based analytical models up to complex numerical simulations. Nevertheless, for the sake of simplicity, the out-of-plane seismic response of a masonry wall pier may be obtained by means of a simple single-degree-of-freedom system while still providing good results. In fact, despite the assumptions associated with such a simple formulation, it is also true that the epistemic uncertainty inherent with the selection of appropriate input parameters in more complex models may render them truly ineffective. In this framework, this paper focuses on the study of the out-of-plane bending of unreinforced stone masonry walls (cantilevers) by proposing a simplified analytical approach based on the construction of a linearized four-branch model, which is used to characterize the linear and nonlinear response of such structural elements through an overturning moment-rotation relationship. The formulation of the four-branch model is presented and described in detail and the meaningful parameters used for its construction are obtained from a set of experimental laboratory tests performed on six full-scale unreinforced regular sacco stone masonry specimens. Moreover, a parametric analysis aiming to evaluate the effect of these parameters’ variation on the final configuration of the model is presented and critically discussed. Finally, the results obtained from the application of the developed four-branch model on real unreinforced regular sacco stone masonry walls are thoroughly analysed and the main conclusions obtained from its application are summarized.