955 resultados para Expectation-conditional Maximization (ecm)


Relevância:

20.00% 20.00%

Publicador:

Resumo:

The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade’s worth of statistical methodology development. The recently developed RNA sequencing (RNA-seq) technology has generated much excitement in part due to claims of reduced variability in comparison to microarrays. However, we show RNA-seq data demonstrates unwanted and obscuring variability similar to what was first observed in microarrays. In particular, we find GC-content has a strong sample specific effect on gene expression measurements that, if left uncorrected, leads to false positives in downstream results. We also report on commonly observed data distortions that demonstrate the need for data normalization. Here we describe statistical methodology that improves precision by 42% without loss of accuracy. Our resulting conditional quantile normalization (CQN) algorithm combines robust generalized regression to remove systematic bias introduced by deterministic features such as GC-content, and quantile normalization to correct for global distortions.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Latent class analysis (LCA) and latent class regression (LCR) are widely used for modeling multivariate categorical outcomes in social sciences and biomedical studies. Standard analyses assume data of different respondents to be mutually independent, excluding application of the methods to familial and other designs in which participants are clustered. In this paper, we develop multilevel latent class model, in which subpopulation mixing probabilities are treated as random effects that vary among clusters according to a common Dirichlet distribution. We apply the Expectation-Maximization (EM) algorithm for model fitting by maximum likelihood (ML). This approach works well, but is computationally intensive when either the number of classes or the cluster size is large. We propose a maximum pairwise likelihood (MPL) approach via a modified EM algorithm for this case. We also show that a simple latent class analysis, combined with robust standard errors, provides another consistent, robust, but less efficient inferential procedure. Simulation studies suggest that the three methods work well in finite samples, and that the MPL estimates often enjoy comparable precision as the ML estimates. We apply our methods to the analysis of comorbid symptoms in the Obsessive Compulsive Disorder study. Our models' random effects structure has more straightforward interpretation than those of competing methods, thus should usefully augment tools available for latent class analysis of multilevel data.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mouse cell lines were immortalized by introduction of specific immortalizing genes. Embryonic and adult animals and an embryonal stem cell line were used as a source of primary cells. The immortalizing genes were either introduced by DNA transfection or by ecotropic retrovirus transduction. Fibroblasts were obtained by expression of SV40 virus large T antigen (TAg). The properties of the resulting fibroblast cell lines were reproducible, independent of the donor mouse strains employed and the cells showed no transformed properties in vitro and did not form tumors in vivo. Endothelial cell lines were generated by Polyoma virus middle T antigen expression in primary embryonal cells. These cell lines consistently expressed relevant endothelial cell surface markers. Since the expression of the immortalizing genes was expected to strongly influence the cellular characteristics fibroblastoid cells were reversibly immortalized by using a vector that allows conditional expression of the TAg. Under inducing conditions, these cells exhibited properties that were highly similar to the properties of constitutively immortalized cells. In the absence of TAg expression, cell proliferation stops. Cell growth is resumed when TAg expression is restored. Gene expression profiling indicates that TAg influences the expression levels of more than 1000 genes that are involved in diverse cellular processes. The data show that conditionally immortalized cell lines have several advantageous properties over constitutively immortalized cells.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A key energy-saving adaptation to chronic hypoxia that enables cardiomyocytes to withstand severe ischemic insults is hibernation, i.e., a reversible arrest of contractile function. Whereas hibernating cardiomyocytes represent the critical reserve of dysfunctional cells that can be potentially rescued, a lack of a suitable animal model has hampered insights on this medically important condition. We developed a transgenic mouse system for conditional induction of long-term hibernation and a system to rescue hibernating cardiomyocytes at will. Via myocardium-specific induction (and, in turn, deinduction) of a VEGF-sequestering soluble receptor, we show that VEGF is indispensable for adjusting the coronary vasculature to match increased oxygen consumption and exploit this finding to generate a hypoperfused heart. Importantly, ensuing ischemia is tunable to a level at which large cohorts of cardiomyocytes are driven to enter a hibernation mode, without cardiac cell death. Relieving the VEGF blockade even months later resulted in rapid revascularization and full recovery of contractile function. Furthermore, we show that left ventricular remodeling associated with hibernation is also fully reversible. The unique opportunity to uncouple hibernation from other ischemic heart phenotypes (e.g., infarction) was used to determine the genetic program of hibernation; uncovering hypoxia-inducible factor target genes associated with metabolic adjustments and induced expression of several cardioprotective genes. Autophagy, specifically self-digestion of mitochondria, was identified as a key prosurvival mechanism in hibernating cardiomyocytes. This system may lend itself for examining the potential utility of treatments to rescue dysfunctional cardiomyocytes and reverse maladaptive remodeling.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Similarity measure is one of the main factors that affect the accuracy of intensity-based 2D/3D registration of X-ray fluoroscopy to CT images. Information theory has been used to derive similarity measure for image registration leading to the introduction of mutual information, an accurate similarity measure for multi-modal and mono-modal image registration tasks. However, it is known that the standard mutual information measure only takes intensity values into account without considering spatial information and its robustness is questionable. Previous attempt to incorporate spatial information into mutual information either requires computing the entropy of higher dimensional probability distributions, or is not robust to outliers. In this paper, we show how to incorporate spatial information into mutual information without suffering from these problems. Using a variational approximation derived from the Kullback-Leibler bound, spatial information can be effectively incorporated into mutual information via energy minimization. The resulting similarity measure has a least-squares form and can be effectively minimized by a multi-resolution Levenberg-Marquardt optimizer. Experimental results are presented on datasets of two applications: (a) intra-operative patient pose estimation from a few (e.g. 2) calibrated fluoroscopic images, and (b) post-operative cup alignment estimation from single X-ray radiograph with gonadal shielding.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The degree of polarization of a refected field from active laser illumination can be used for object identifcation and classifcation. The goal of this study is to investigate methods for estimating the degree of polarization for refected fields with active laser illumination, which involves the measurement and processing of two orthogonal field components (complex amplitudes), two orthogonal intensity components, and the total field intensity. We propose to replace interferometric optical apparatuses with a computational approach for estimating the degree of polarization from two orthogonal intensity data and total intensity data. Cramer-Rao bounds for each of the three sensing modalities with various noise models are computed. Algebraic estimators and maximum-likelihood (ML) estimators are proposed. Active-set algorithm and expectation-maximization (EM) algorithm are used to compute ML estimates. The performances of the estimators are compared with each other and with their corresponding Cramer-Rao bounds. Estimators for four-channel polarimeter (intensity interferometer) sensing have a better performance than orthogonal intensities estimators and total intensity estimators. Processing the four intensities data from polarimeter, however, requires complicated optical devices, alignment, and four CCD detectors. It only requires one or two detectors and a computer to process orthogonal intensities data and total intensity data, and the bounds and estimator performances demonstrate that reasonable estimates may still be obtained from orthogonal intensities or total intensity data. Computational sensing is a promising way to estimate the degree of polarization.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Questionnaire data may contain missing values because certain questions do not apply to all respondents. For instance, questions addressing particular attributes of a symptom, such as frequency, triggers or seasonality, are only applicable to those who have experienced the symptom, while for those who have not, responses to these items will be missing. This missing information does not fall into the category 'missing by design', rather the features of interest do not exist and cannot be measured regardless of survey design. Analysis of responses to such conditional items is therefore typically restricted to the subpopulation in which they apply. This article is concerned with joint multivariate modelling of responses to both unconditional and conditional items without restricting the analysis to this subpopulation. Such an approach is of interest when the distributions of both types of responses are thought to be determined by common parameters affecting the whole population. By integrating the conditional item structure into the model, inference can be based both on unconditional data from the entire population and on conditional data from subjects for whom they exist. This approach opens new possibilities for multivariate analysis of such data. We apply this approach to latent class modelling and provide an example using data on respiratory symptoms (wheeze and cough) in children. Conditional data structures such as that considered here are common in medical research settings and, although our focus is on latent class models, the approach can be applied to other multivariate models.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper presents a comparison of principal component (PC) regression and regularized expectation maximization (RegEM) to reconstruct European summer and winter surface air temperature over the past millennium. Reconstruction is performed within a surrogate climate using the National Center for Atmospheric Research (NCAR) Climate System Model (CSM) 1.4 and the climate model ECHO-G 4, assuming different white and red noise scenarios to define the distortion of pseudoproxy series. We show how sensitivity tests lead to valuable “a priori” information that provides a basis for improving real world proxy reconstructions. Our results emphasize the need to carefully test and evaluate reconstruction techniques with respect to the temporal resolution and the spatial scale they are applied to. Furthermore, we demonstrate that uncertainties inherent to the predictand and predictor data have to be more rigorously taken into account. The comparison of the two statistical techniques, in the specific experimental setting presented here, indicates that more skilful results are achieved with RegEM as low frequency variability is better preserved. We further detect seasonal differences in reconstruction skill for the continental scale, as e.g. the target temperature average is more adequately reconstructed for summer than for winter. For the specific predictor network given in this paper, both techniques underestimate the target temperature variations to an increasing extent as more noise is added to the signal, albeit RegEM less than with PC regression. We conclude that climate field reconstruction techniques can be improved and need to be further optimized in future applications.