953 resultados para binary mixture
Resumo:
When the data consist of certain attributes measured on the same set of items in different situations, they would be described as a three-mode three-way array. A mixture likelihood approach can be implemented to cluster the items (i.e., one of the modes) on the basis of both of the other modes simultaneously (i.e,, the attributes measured in different situations). In this paper, it is shown that this approach can be extended to handle three-mode three-way arrays where some of the data values are missing at random in the sense of Little and Rubin (1987). The methodology is illustrated by clustering the genotypes in a three-way soybean data set where various attributes were measured on genotypes grown in several environments.
Resumo:
Binning and truncation of data are common in data analysis and machine learning. This paper addresses the problem of fitting mixture densities to multivariate binned and truncated data. The EM approach proposed by McLachlan and Jones (Biometrics, 44: 2, 571-578, 1988) for the univariate case is generalized to multivariate measurements. The multivariate solution requires the evaluation of multidimensional integrals over each bin at each iteration of the EM procedure. Naive implementation of the procedure can lead to computationally inefficient results. To reduce the computational cost a number of straightforward numerical techniques are proposed. Results on simulated data indicate that the proposed methods can achieve significant computational gains with no loss in the accuracy of the final parameter estimates. Furthermore, experimental results suggest that with a sufficient number of bins and data points it is possible to estimate the true underlying density almost as well as if the data were not binned. The paper concludes with a brief description of an application of this approach to diagnosis of iron deficiency anemia, in the context of binned and truncated bivariate measurements of volume and hemoglobin concentration from an individual's red blood cells.
Resumo:
Motivation: This paper introduces the software EMMIX-GENE that has been developed for the specific purpose of a model-based approach to the clustering of microarray expression data, in particular, of tissue samples on a very large number of genes. The latter is a nonstandard problem in parametric cluster analysis because the dimension of the feature space (the number of genes) is typically much greater than the number of tissues. A feasible approach is provided by first selecting a subset of the genes relevant for the clustering of the tissue samples by fitting mixtures of t distributions to rank the genes in order of increasing size of the likelihood ratio statistic for the test of one versus two components in the mixture model. The imposition of a threshold on the likelihood ratio statistic used in conjunction with a threshold on the size of a cluster allows the selection of a relevant set of genes. However, even this reduced set of genes will usually be too large for a normal mixture model to be fitted directly to the tissues, and so the use of mixtures of factor analyzers is exploited to reduce effectively the dimension of the feature space of genes. Results: The usefulness of the EMMIX-GENE approach for the clustering of tissue samples is demonstrated on two well-known data sets on colon and leukaemia tissues. For both data sets, relevant subsets of the genes are able to be selected that reveal interesting clusterings of the tissues that are either consistent with the external classification of the tissues or with background and biological knowledge of these sets.
Resumo:
A heterogeneous modified vacancy solution model of adsorption developed is evaluated. The new model considers the adsorption process through a mass-action law and is thermodynamically consistent, while maintaining the simplicity in calculation of multicomponent adsorption equilibria, as in the original vacancy solution theory. It incorporates the adsorbent heterogeneity through a pore-width-related potential energy, represented by Steele's 10-4-3 potential expression. The experimental data of various hydrocarbons, CO2 and SO2 on four different activated carbons - Ajax, Norit, Nuxit, and BPL - at multiple temperatures over a wide range of pressures were studied by the heterogeneous modified VST model to obtain the isotherm parameters and micropore-size distribution of carbons. The model successfully correlates the single-component adsorption equilibrium data for all compounds studied on various carbons. The fitting results for the vacancy occupancy parameter are consistent with the pressure change on different carbons, and the effect of pore heterogeneity is important in adsorption at elevated pressure. It predicts binary adsorption equilibria better than the IAST scheme, reflecting the significance of molecular size nonideality.
Resumo:
A theoretical analysis of adsorption of mixtures containing subcritical adsorbates into activated carbon is presented as an extension to the theory for pure component developed earlier by Do and coworkers. In this theory, adsorption of mixtures in a pore follows a two-stage process, similar to that for pure component systems. The first stage is the layering of molecules on the surface, with the behavior of the second and higher layers resembling to that of vapor-liquid equilibrium. The second stage is the pore-filling process when the remaining pore width is small enough and the pressure is high enough to promote the pore filling with liquid mixture having the same compositions as those of the outermost molecular layer just prior to pore filling. The Kelvin equation is applied for mixtures, with the vapor pressure term being replaced by the equilibrium pressure at the compositions of the outermost layer of the liquid film. Simulations are detailed to illustrate the effects of various parameters, and the theory is tested with a number of experimental data on mixture. The predictions were very satisfactory.
Resumo:
We compare Bayesian methodology utilizing free-ware BUGS (Bayesian Inference Using Gibbs Sampling) with the traditional structural equation modelling approach based on another free-ware package, Mx. Dichotomous and ordinal (three category) twin data were simulated according to different additive genetic and common environment models for phenotypic variation. Practical issues are discussed in using Gibbs sampling as implemented by BUGS to fit subject-specific Bayesian generalized linear models, where the components of variation may be estimated directly. The simulation study (based on 2000 twin pairs) indicated that there is a consistent advantage in using the Bayesian method to detect a correct model under certain specifications of additive genetics and common environmental effects. For binary data, both methods had difficulty in detecting the correct model when the additive genetic effect was low (between 10 and 20%) or of moderate range (between 20 and 40%). Furthermore, neither method could adequately detect a correct model that included a modest common environmental effect (20%) even when the additive genetic effect was large (50%). Power was significantly improved with ordinal data for most scenarios, except for the case of low heritability under a true ACE model. We illustrate and compare both methods using data from 1239 twin pairs over the age of 50 years, who were registered with the Australian National Health and Medical Research Council Twin Registry (ATR) and presented symptoms associated with osteoarthritis occurring in joints of the hand.
Resumo:
We consider a mixture model approach to the regression analysis of competing-risks data. Attention is focused on inference concerning the effects of factors on both the probability of occurrence and the hazard rate conditional on each of the failure types. These two quantities are specified in the mixture model using the logistic model and the proportional hazards model, respectively. We propose a semi-parametric mixture method to estimate the logistic and regression coefficients jointly, whereby the component-baseline hazard functions are completely unspecified. Estimation is based on maximum likelihood on the basis of the full likelihood, implemented via an expectation-conditional maximization (ECM) algorithm. Simulation studies are performed to compare the performance of the proposed semi-parametric method with a fully parametric mixture approach. The results show that when the component-baseline hazard is monotonic increasing, the semi-parametric and fully parametric mixture approaches are comparable for mildly and moderately censored samples. When the component-baseline hazard is not monotonic increasing, the semi-parametric method consistently provides less biased estimates than a fully parametric approach and is comparable in efficiency in the estimation of the parameters for all levels of censoring. The methods are illustrated using a real data set of prostate cancer patients treated with different dosages of the drug diethylstilbestrol. Copyright (C) 2003 John Wiley Sons, Ltd.
Resumo:
The effect of pore-network connectivity on binary liquid-phase adsorption equilibria using the ideal adsorbed solution theory (LAST) was studied. The liquid-phase binary adsorption experiments used ethyl propionate, ethyl butyrate, and ethyl isovalerate as the adsorbates and commercial activated carbons Filtrasorb-400 and Norit ROW 0.8 as adsorbents. As the single-component isotherm, a modified Dubinin-Radushkevich equation was used. A comparison with experimental data shows that incorporating the connectivity of the pore network and considering percolation processes associated with different molecular sizes of the adsorptives in the mixture, as well as their different corresponding accessibility, can improve the prediction of binary adsorption equilibria using the LAST Selectivity of adsorption for the larger molecule in binary systems increases with an increase in the pore-network coordination number, as well with an increase in the mean pore width and in the spread of the pore-size distribution.
Resumo:
The use of cover crops is important for the agricultural crop and soil management in order to improve the system and, consequently, to increase yield. Therefore, the present study analyzed the effect of crop residues of black oat (Avena strigosa Schreb.) (BO) and a cocktail (CO) of BO, forage turnip (Raphanus sativus L.) (FT) and common vetch (Vicia sativa L.) (V) on the emergence speed index (ESI), seedling emergence speed (SES) plant height and soybean yield in different intervals between cover crop desiccation with glyphosate 480 (3 L ha-1) and BRS 232 cultivar sowing. Plots of 5 x 2.5 m with 1 m of border received four treatments with BO cover crops and four with CO as well as a control for each cover crop, at random, with five replications. The plots were desiccated in intervals of 1, 10, 20 and 30 days before soybean seeding. The harvest was manual while yield was adjusted to 13% of moisture content. The experimental design was completely randomized with splitplots and means compared by the Scott and Knott test at 5% of significance. The results showed that CO of cover crops can be recommended for soybean to obtain a more vigorous seedling emergence, from 10 days after cover crop desiccation.
Resumo:
This paper presents the results from an experimental study of the technical viability of two mixture designs for self-consolidating concrete (SCC) proposed by two Portuguese researchers in a previous work. The objective was to find the best method to provide the required characteristics of SCC in fresh and hardened states without having to experiment with a large number of mixtures. Five SCC mixtures, each with a volume of 25 L (6.61 gal.) were prepared using a forced mixer with a vertical axis for each of three compressive strength targets: 40, 55, and 70 MPa (5.80, 7.98, and 10.15 ksi). The mixtures' fresh state properties of fluidity, segregation resistance ability, and bleeding and blockage tendency, and their hardened state property of compressive strength were compared. For this study, the following tests were performed. slump-flow, V-funnel, L-box, box, and compressive strength. The results of this study made it possible to identify the most influential factors in the design of the SCC mixtures.
Resumo:
We investigate the phase behaviour of 2D mixtures of bi-functional and three-functional patchy particles and 3D mixtures of bi-functional and tetra-functional patchy particles by means of Monte Carlo simulations and Wertheim theory. We start by computing the critical points of the pure systems and then we investigate how the critical parameters change upon lowering the temperature. We extend the successive umbrella sampling method to mixtures to make it possible to extract information about the phase behaviour of the system at a fixed temperature for the whole range of densities and compositions of interest. (C) 2013 AIP Publishing LLC.