329 resultados para mixture model
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
In a sample of censored survival times, the presence of an immune proportion of individuals who are not subject to death, failure or relapse, may be indicated by a relatively high number of individuals with large censored survival times. In this paper the generalized log-gamma model is modified for the possibility that long-term survivors may be present in the data. The model attempts to separately estimate the effects of covariates on the surviving fraction, that is, the proportion of the population for which the event never occurs. The logistic function is used for the regression model of the surviving fraction. Inference for the model parameters is considered via maximum likelihood. Some influence methods, such as the local influence and total local influence of an individual are derived, analyzed and discussed. Finally, a data set from the medical area is analyzed under the log-gamma generalized mixture model. A residual analysis is performed in order to select an appropriate model.
Resumo:
Gene clustering is a useful exploratory technique to group together genes with similar expression levels under distinct cell cycle phases or distinct conditions. It helps the biologist to identify potentially meaningful relationships between genes. In this study, we propose a clustering method based on multivariate normal mixture models, where the number of clusters is predicted via sequential hypothesis tests: at each step, the method considers a mixture model of m components (m = 2 in the first step) and tests if in fact it should be m - 1. If the hypothesis is rejected, m is increased and a new test is carried out. The method continues (increasing m) until the hypothesis is accepted. The theoretical core of the method is the full Bayesian significance test, an intuitive Bayesian approach, which needs no model complexity penalization nor positive probabilities for sharp hypotheses. Numerical experiments were based on a cDNA microarray dataset consisting of expression levels of 205 genes belonging to four functional categories, for 10 distinct strains of Saccharomyces cerevisiae. To analyze the method's sensitivity to data dimension, we performed principal components analysis on the original dataset and predicted the number of classes using 2 to 10 principal components. Compared to Mclust (model-based clustering), our method shows more consistent results.
Resumo:
Today several different unsupervised classification algorithms are commonly used to cluster similar patterns in a data set based only on its statistical properties. Specially in image data applications, self-organizing methods for unsupervised classification have been successfully applied for clustering pixels or group of pixels in order to perform segmentation tasks. The first important contribution of this paper refers to the development of a self-organizing method for data classification, named Enhanced Independent Component Analysis Mixture Model (EICAMM), which was built by proposing some modifications in the Independent Component Analysis Mixture Model (ICAMM). Such improvements were proposed by considering some of the model limitations as well as by analyzing how it should be improved in order to become more efficient. Moreover, a pre-processing methodology was also proposed, which is based on combining the Sparse Code Shrinkage (SCS) for image denoising and the Sobel edge detector. In the experiments of this work, the EICAMM and other self-organizing models were applied for segmenting images in their original and pre-processed versions. A comparative analysis showed satisfactory and competitive image segmentation results obtained by the proposals presented herein. (C) 2008 Published by Elsevier B.V.
Resumo:
Historically, the cure rate model has been used for modeling time-to-event data within which a significant proportion of patients are assumed to be cured of illnesses, including breast cancer, non-Hodgkin lymphoma, leukemia, prostate cancer, melanoma, and head and neck cancer. Perhaps the most popular type of cure rate model is the mixture model introduced by Berkson and Gage [1]. In this model, it is assumed that a certain proportion of the patients are cured, in the sense that they do not present the event of interest during a long period of time and can found to be immune to the cause of failure under study. In this paper, we propose a general hazard model which accommodates comprehensive families of cure rate models as particular cases, including the model proposed by Berkson and Gage. The maximum-likelihood-estimation procedure is discussed. A simulation study analyzes the coverage probabilities of the asymptotic confidence intervals for the parameters. A real data set on children exposed to HIV by vertical transmission illustrates the methodology.
Resumo:
This paper proposes a novel computer vision approach that processes video sequences of people walking and then recognises those people by their gait. Human motion carries different information that can be analysed in various ways. The skeleton carries motion information about human joints, and the silhouette carries information about boundary motion of the human body. Moreover, binary and gray-level images contain different information about human movements. This work proposes to recover these different kinds of information to interpret the global motion of the human body based on four different segmented image models, using a fusion model to improve classification. Our proposed method considers the set of the segmented frames of each individual as a distinct class and each frame as an object of this class. The methodology applies background extraction using the Gaussian Mixture Model (GMM), a scale reduction based on the Wavelet Transform (WT) and feature extraction by Principal Component Analysis (PCA). We propose four new schemas for motion information capture: the Silhouette-Gray-Wavelet model (SGW) captures motion based on grey level variations; the Silhouette-Binary-Wavelet model (SBW) captures motion based on binary information; the Silhouette-Edge-Binary model (SEW) captures motion based on edge information and the Silhouette Skeleton Wavelet model (SSW) captures motion based on skeleton movement. The classification rates obtained separately from these four different models are then merged using a new proposed fusion technique. The results suggest excellent performance in terms of recognising people by their gait.
Resumo:
The inverse Weibull distribution has the ability to model failure rates which are quite common in reliability and biological studies. A three-parameter generalized inverse Weibull distribution with decreasing and unimodal failure rate is introduced and studied. We provide a comprehensive treatment of the mathematical properties of the new distribution including expressions for the moment generating function and the rth generalized moment. The mixture model of two generalized inverse Weibull distributions is investigated. The identifiability property of the mixture model is demonstrated. For the first time, we propose a location-scale regression model based on the log-generalized inverse Weibull distribution for modeling lifetime data. In addition, we develop some diagnostic tools for sensitivity analysis. Two applications of real data are given to illustrate the potentiality of the proposed regression model.
Resumo:
P>In the context of either Bayesian or classical sensitivity analyses of over-parametrized models for incomplete categorical data, it is well known that prior-dependence on posterior inferences of nonidentifiable parameters or that too parsimonious over-parametrized models may lead to erroneous conclusions. Nevertheless, some authors either pay no attention to which parameters are nonidentifiable or do not appropriately account for possible prior-dependence. We review the literature on this topic and consider simple examples to emphasize that in both inferential frameworks, the subjective components can influence results in nontrivial ways, irrespectively of the sample size. Specifically, we show that prior distributions commonly regarded as slightly informative or noninformative may actually be too informative for nonidentifiable parameters, and that the choice of over-parametrized models may drastically impact the results, suggesting that a careful examination of their effects should be considered before drawing conclusions.Resume Que ce soit dans un cadre Bayesien ou classique, il est bien connu que la surparametrisation, dans les modeles pour donnees categorielles incompletes, peut conduire a des conclusions erronees. Cependant, certains auteurs persistent a negliger les problemes lies a la presence de parametres non identifies. Nous passons en revue la litterature dans ce domaine, et considerons quelques exemples surparametres simples dans lesquels les elements subjectifs influencent de facon non negligeable les resultats, independamment de la taille des echantillons. Plus precisement, nous montrons comment des a priori consideres comme peu ou non-informatifs peuvent se reveler extremement informatifs en ce qui concerne les parametres non identifies, et que le recours a des modeles surparametres peut avoir sur les conclusions finales un impact considerable. Ceci suggere un examen tres attentif de l`impact potentiel des a priori.
Resumo:
We investigate the phase diagram of a discrete version of the Maier-Saupe model with the inclusion of additional degrees of freedom to mimic a distribution of rodlike and disklike molecules. Solutions of this problem on a Bethe lattice come from the analysis of the fixed points of a set of nonlinear recursion relations. Besides the fixed points associated with isotropic and uniaxial nematic structures, there is also a fixed point associated with a biaxial nematic structure. Due to the existence of large overlaps of the stability regions, we resorted to a scheme to calculate the free energy of these structures deep in the interior of a large Cayley tree. Both thermodynamic and dynamic-stability analyses rule out the presence of a biaxial phase, in qualitative agreement with previous mean-field results.
Resumo:
This study was undertaken to test whether the structural remodelling of pulmonary parenchyma can be sequentially altered in a model and method that demonstrate the progression of the disease and result in remodelling within the lungs that is typical of idiopathic pulmonary fibrosis. Three groups of mice were studied: (i) animals that received 3-5-di-tert-butyl-4-hydroxytoluene (BHT) and were killed after 2 weeks (early BHT = 9); (ii) animals that received BHT and were killed after 4 weeks (late BHT = 11); (iii) animals that received corn oil solution (control = 10). The mice were placed in a ventilated Plexiglas chamber with a mixture of pure humidified oxygen and compressed air. Lung histological sections underwent haematoxylin-eosin, immunohistochemistry (epithelial, endothelial and immune cells) and specific staining (collagen/elastic fibres) methods for morphometric analysis. When compared with the control group, early BHT and late BHT groups showed significant decrease of type II pneumocytes, lower vascular density in both and higher endothelial activity. CD4 was increased in late BHT compared with early and control groups, while CD8, macrophage and neutrophil cells were more prominent only in early BHT. The collagenous fibre density were significantly higher only in late BHT, whereas elastic fibre content in late BHT was lower than that in control group. We conclude that the BHT experimental model is pathologically very similar to human usual interstitial pneumonia. This feature is important in the identification of animal models of idiopathic pulmonary fibrosis that can accurately reflect the pathogenesis and progression of the human disease.
Resumo:
Strategies to promote bone repair have included exposure of cells to growth factor (GF) preparations from blood that generally include proteins as part of a complex mixture. This study aimed to evaluate the effects of such a mixture on different parameters of the development of the osteogenic phenotype in vitro. Osteoblastic cells were obtained by enzymatic digestion of human alveolar bone and cultured under standard osteogenic conditions until subconfluence. They were subcultured on Thermanox coverslips up to 14 days. Treated cultures were exposed during the first 7 days to osteogenic medium supplemented with a GFs + proteins mixture containing the major components found in platelet extracts [plate I et-derived growth factor-BB, transforming growth factor (TGF)-beta 1, TGF-beta 2, albumin, fibronectin, and thrombospondin] and to osteogenic medium alone thereafter. Control cultures were exposed only to the osteogenic medium. Treated cultures exhibited a significantly higher number of adherent cells from day 4 onward and of cycling cells at days 1 and 4, weak alkaline phosphatase (ALP) labeling, and significantly decreased levels of ALP activity and mRNA expression. At day 14, no Alizarin red-stained nodular areas were detected in cultures treated with GFs + proteins. Results were confirmed in the rat calvaria-derived osteogenic cell culture model. The addition of bone morphogenetic protein 7 or growth and differentiation factor 5 to treated cultures upregulated Runx2 and ALP mRNA expression, but surprisingly, ALP activity was not restored. These results showed that a mixture of GFs + proteins affects the development of the osteogenic phenotype both in human and rat cultures, leading to an increase in the number of cells, but expressed a less differentiated state.
Resumo:
The deterpenation of bergamot essential oil can be performed by liquid liquid extraction using hydrous ethanol as the solvent. A ternary mixture composed of 1-methyl-4-prop-1-en-2-yl-cydohexene (limonene), 3,7-dimethylocta-1,6-dien-3-yl-acetate (linalyl acetate), and 3,7-dimethylocta-1,6-dien-3-ol (linalool), three major compounds commonly found in bergamot oil, was used to simulate this essential oil. Liquid liquid equilibrium data were experimentally determined for systems containing essential oil compounds, ethanol, and water at 298.2 K and are reported in this paper. The experimental data were correlated using the NRTL and UNIQUAC models, and the mean deviations between calculated and experimental data were lower than 0.0062 in all systems, indicating the good descriptive quality of the molecular models. To verify the effect of the water mass fraction in the solvent and the linalool mass fraction in the terpene phase on the distribution coefficients of the essential oil compounds, nonlinear regression analyses were performed, obtaining mathematical models with correlation coefficient values higher than 0.99. The results show that as the water content in the solvent phase increased, the kappa value decreased, regardless of the type of compound studied. Conversely, as the linalool content increased, the distribution coefficients of hydrocarbon terpene and ester also increased. However, the linalool distribution coefficient values were negatively affected when the terpene alcohol content increased in the terpene phase.
Resumo:
In this paper we introduce a parametric model for handling lifetime data where an early lifetime can be related to the infant-mortality failure or to the wear processes but we do not know which risk is responsible for the failure. The maximum likelihood approach and the sampling-based approach are used to get the inferences of interest. Some special cases of the proposed model are studied via Monte Carlo methods for size and power of hypothesis tests. To illustrate the proposed methodology, we introduce an example consisting of a real data set.
Resumo:
Lyotropic nematics consisting of surfactant-cosurfactant water solutions may present a biaxial phase or direct U(+) <-> U(-) transitions, in different regions of the temperature-relative concentration phase diagram, for different systems and compositions. We propose that these may be related to changes of uniaxial micellar form, which may occur either smoothly or abruptly. Smooth change of cylinder-like into disc-like shapes requires a distribution of Maier-Saupe interaction constants and we consider two limiting cases for the distribution of forms: a single Gaussian and a double Gaussian. Alternatively, an abrupt change of form is described by a discontinuous distribution of interaction constants. Our results show that the dispersive distributions yield a biaxial phase, while an abrupt change of shape leads to coexistence of uniaxial phases. Fitting the theory to the experiment for the ternary system KL/decanol/D2O leads to transition lines in very good agreement with experimental results. In order to rationalise the results of the comparison, we analyse temperature and concentration form dependence, which connects micellar and experimental macroscopic parameters. Physically consistent variations of micellar asymmetry, amphiphile partitioning and volume are obtained. To the best of the authors` knowledge, this is the first truly statistical microscopic approach that is able to model experimentally observed lyotropic biaxial nematic phases.
A robust Bayesian approach to null intercept measurement error model with application to dental data
Resumo:
Measurement error models often arise in epidemiological and clinical research. Usually, in this set up it is assumed that the latent variable has a normal distribution. However, the normality assumption may not be always correct. Skew-normal/independent distribution is a class of asymmetric thick-tailed distributions which includes the Skew-normal distribution as a special case. In this paper, we explore the use of skew-normal/independent distribution as a robust alternative to null intercept measurement error model under a Bayesian paradigm. We assume that the random errors and the unobserved value of the covariate (latent variable) follows jointly a skew-normal/independent distribution, providing an appealing robust alternative to the routine use of symmetric normal distribution in this type of model. Specific distributions examined include univariate and multivariate versions of the skew-normal distribution, the skew-t distributions, the skew-slash distributions and the skew contaminated normal distributions. The methods developed is illustrated using a real data set from a dental clinical trial. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
OBJECTIVES: The complexity and heterogeneity of human bone, as well as ethical issues, frequently hinder the development of clinical trials. The purpose of this in vitro study was to determine the modulus of elasticity of a polyurethane isotropic experimental model via tension tests, comparing the results to those reported in the literature for mandibular bone, in order to validate the use of such a model in lieu of mandibular bone in biomechanical studies. MATERIAL AND METHODS: Forty-five polyurethane test specimens were divided into 3 groups of 15 specimens each, according to the ratio (A/B) of polyurethane reagents (PU-1: 1/0.5, PU-2: 1/1, PU-3: 1/1.5). RESULTS: Tension tests were performed in each experimental group and the modulus of elasticity values found were 192.98 MPa (SD=57.20) for PU-1, 347.90 MPa (SD=109.54) for PU-2 and 304.64 MPa (SD=25.48) for PU-3. CONCLUSION: The concentration of choice for building the experimental model was 1/1.