973 resultados para Monte Carlo à chaîne de Markov


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We develop a model for stochastic processes with random marginal distributions. Our model relies on a stick-breaking construction for the marginal distribution of the process, and introduces dependence across locations by using a latent Gaussian copula model as the mechanism for selecting the atoms. The resulting latent stick-breaking process (LaSBP) induces a random partition of the index space, with points closer in space having a higher probability of being in the same cluster. We develop an efficient and straightforward Markov chain Monte Carlo (MCMC) algorithm for computation and discuss applications in financial econometrics and ecology. This article has supplementary material online.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We describe a strategy for Markov chain Monte Carlo analysis of non-linear, non-Gaussian state-space models involving batch analysis for inference on dynamic, latent state variables and fixed model parameters. The key innovation is a Metropolis-Hastings method for the time series of state variables based on sequential approximation of filtering and smoothing densities using normal mixtures. These mixtures are propagated through the non-linearities using an accurate, local mixture approximation method, and we use a regenerating procedure to deal with potential degeneracy of mixture components. This provides accurate, direct approximations to sequential filtering and retrospective smoothing distributions, and hence a useful construction of global Metropolis proposal distributions for simulation of posteriors for the set of states. This analysis is embedded within a Gibbs sampler to include uncertain fixed parameters. We give an example motivated by an application in systems biology. Supplemental materials provide an example based on a stochastic volatility model as well as MATLAB code.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Transcriptional regulation has been studied intensively in recent decades. One important aspect of this regulation is the interaction between regulatory proteins, such as transcription factors (TF) and nucleosomes, and the genome. Different high-throughput techniques have been invented to map these interactions genome-wide, including ChIP-based methods (ChIP-chip, ChIP-seq, etc.), nuclease digestion methods (DNase-seq, MNase-seq, etc.), and others. However, a single experimental technique often only provides partial and noisy information about the whole picture of protein-DNA interactions. Therefore, the overarching goal of this dissertation is to provide computational developments for jointly modeling different experimental datasets to achieve a holistic inference on the protein-DNA interaction landscape.

We first present a computational framework that can incorporate the protein binding information in MNase-seq data into a thermodynamic model of protein-DNA interaction. We use a correlation-based objective function to model the MNase-seq data and a Markov chain Monte Carlo method to maximize the function. Our results show that the inferred protein-DNA interaction landscape is concordant with the MNase-seq data and provides a mechanistic explanation for the experimentally collected MNase-seq fragments. Our framework is flexible and can easily incorporate other data sources. To demonstrate this flexibility, we use prior distributions to integrate experimentally measured protein concentrations.

We also study the ability of DNase-seq data to position nucleosomes. Traditionally, DNase-seq has only been widely used to identify DNase hypersensitive sites, which tend to be open chromatin regulatory regions devoid of nucleosomes. We reveal for the first time that DNase-seq datasets also contain substantial information about nucleosome translational positioning, and that existing DNase-seq data can be used to infer nucleosome positions with high accuracy. We develop a Bayes-factor-based nucleosome scoring method to position nucleosomes using DNase-seq data. Our approach utilizes several effective strategies to extract nucleosome positioning signals from the noisy DNase-seq data, including jointly modeling data points across the nucleosome body and explicitly modeling the quadratic and oscillatory DNase I digestion pattern on nucleosomes. We show that our DNase-seq-based nucleosome map is highly consistent with previous high-resolution maps. We also show that the oscillatory DNase I digestion pattern is useful in revealing the nucleosome rotational context around TF binding sites.

Finally, we present a state-space model (SSM) for jointly modeling different kinds of genomic data to provide an accurate view of the protein-DNA interaction landscape. We also provide an efficient expectation-maximization algorithm to learn model parameters from data. We first show in simulation studies that the SSM can effectively recover underlying true protein binding configurations. We then apply the SSM to model real genomic data (both DNase-seq and MNase-seq data). Through incrementally increasing the types of genomic data in the SSM, we show that different data types can contribute complementary information for the inference of protein binding landscape and that the most accurate inference comes from modeling all available datasets.

This dissertation provides a foundation for future research by taking a step toward the genome-wide inference of protein-DNA interaction landscape through data integration.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We propose a novel unsupervised approach for linking records across arbitrarily many files, while simultaneously detecting duplicate records within files. Our key innovation is to represent the pattern of links between records as a {\em bipartite} graph, in which records are directly linked to latent true individuals, and only indirectly linked to other records. This flexible new representation of the linkage structure naturally allows us to estimate the attributes of the unique observable people in the population, calculate $k$-way posterior probabilities of matches across records, and propagate the uncertainty of record linkage into later analyses. Our linkage structure lends itself to an efficient, linear-time, hybrid Markov chain Monte Carlo algorithm, which overcomes many obstacles encountered by previously proposed methods of record linkage, despite the high dimensional parameter space. We assess our results on real and simulated data.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Históricamente, los modelos de no-ejercicio para predecir el consumo máximo de oxígeno (VO2max) han sido construidos mediante regresión lineal frecuentista, usando técnicas estándar de selección de modelos. Sin embargo, existe incertidumbre acerca de la estructura estadística en el proceso de selección del modelo. En este estudio se propuso construir un modelo de no-ejercicio para predecir el VO2max en deportistas orientados al rendimiento, considerando la incertidumbre de modelo a través del Promedio Bayesiano de Modelos (BMA). Un objetivo adicional fue comparar la performance predictiva del BMA con las de los modelos derivados de varias técnicas frecuentistas usuales de selección de variables. Con tal fin, se implementó un submuestreo aleatorio estratificado repetido. Los datos incluyeron observaciones de la variable respuesta (en L·min-1), así como registros de Género, Deporte, Edad, Peso, Talla e Índice de masa corporal (BMI) (Edad = 22.1 ± 4.9 años, media ± SD; n = 272). Se propuso una clasificación de deportes con el objetivo de incluirla dentro del proceso de construcción del modelo: Combate, Juego, Resistencia 1 y Resistencia 2. El enfoque BMA se implementó en base a dos métodos: Occam's window y Composición de Modelo mediante el método de Monte Carlo con Cadenas de Markov (MC²). Se observaron discrepancias en la selección de variables entre los procedimientos frecuentistas. Ambos métodos de BMA produjeron resultados muy similares. Los modelos que incluyeron Género y las variables dummies para Resistencia 1 y Resistencia 2 acumularon virtualmente toda la probabilidad de modelo a posteriori. El Peso fue el predictor continuo con la más alta probabilidad de inclusión a posteriori (menor a 0.8). Las combinaciones de variables que involucraron predictores con un alto nivel de multicolinealidad fueron desacreditadas. Los modelos con sustancial contribución para el BMA presentaron un ajuste apreciable (R² ajustado menor a 0.8). Entre los modelos seleccionados por estrategias frecuentistas, el obtenido mediante el método de regresión por pasos (Stepwise regression method) con alfa igual a 0.05 fue el más respaldado por los datos, en términos de probabilidad de modelo a posteriori. En concordancia con la literatura, el BMA tuvo mejor performance predictiva de los datos fuera de la muestra que los modelos seleccionados por técnicas frecuentistas, medida por la cobertura del intervalo de predicción de 90 por ciento. La clasificación de deportes reveló resultados consistentes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The lesser sandeel Ammodytes marinus is a key species in the North Sea ecosystem, transferring energy from planktonic producers to top predators. Previous studies have shown a long-term decline in the size of 0-group sandeels in the western North Sea, but they were unable to pinpoint the mechanism (later hatching, slower growth or changes in size-dependent mortality) or cause. To investigate the first 2 possibilities we combined 2 independent time series of sandeel size, namely data from chick-feeding Atlantic puffins Fratercula arctica and from the Continuous Plankton Recorder (CPR), in a novel statistical model implemented using Markov Chain Monte Carlo (MCMC). The model estimated annual mean length on 1 July, as well as hatching date and growth rate for sandeels from 1973 to 2006. Mean length-at-date declined by 22% over this period, corresponding to a 60% decrease in energy content, with a sharper decline since 2002. Up to the mid-1990s, the decline was associated with a trend towards later hatching. Subsequently, hatching became earlier again, and the continued trend towards smaller size appears to have been driven by lower growth rates, particularly in the most recent years, although we could not rule out changes in size-dependent mortality. Our findings point to major changes in key aspects of sandeel life history, which we consider are most likely due to direct and indirect temperature-related changes over a range of biotic factors, including the seasonal distribution of copepods and intra- and inter-specific competition with planktivorous fish. The results have implications both for the many predators of sandeels and for age and size of maturation in this aggregation of North Sea sandeels.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Raised bog peat deposits form important archives for reconstructing past changes in climate. Precise and reliable age models are of vital importance for interpreting such archives. We propose enhanced, Markov chain Monte Carlo based methods for obtaining age models from radiocarbon-dated peat cores, based on the assumption of piecewise linear accumulation. Included are automatic choice of sections, a measure of the goodness of fit and outlier downweighting. The approach is illustrated by using a peat core from the Netherlands.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Some of the first results are reported from RISE - a new fast camera mounted on the Liverpool Telescope primarily designed to obtain high time-resolution light curves of transiting extrasolar planets for the purpose of transit timing. A full and partial transit of WASP-3 are presented, and a Markov-Chain Monte Carlo analysis is used to update the parameters from the discovery paper. This results in a planetary radius of 1.29(-0.12)(+0.05) R-J and therefore a density of 0.82(-0.09)(+0.14) rho(J), consistent with previous results. The inclination is 85.06(-0.15)(+0.16) deg, in agreement (but with a significant improvement in the precision) with the previously determined value. Central transit times are found to be consistent with the ephemeris given in the discovery paper; however, a new ephemeris calculated using the longer baseline results in T-c(0) = 2 454 605.55915 +/- 0.00023 HJD and P = 1.846835 +/- 0.000002 days.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present nine newly observed transits of TrES-3, taken as part of a transit timing program using the RISE instrument on the Liverpool Telescope. A Markov-Chain Monte Carlo analysis was used to determine the planet star radius ratio and inclination of the system, which were found to be R-p/R-star = 0.1664(-0.0018)(+0.0011) and i = 81.73(-0.04)(+0.13), respectively, consistent with previous results. The central transit times and uncertainties were also calculated, using a residual-permutation algorithm as an independent check on the errors. A re-analysis of eight previously published TrES-3 light curves was conducted to determine the transit times and uncertainties using consistent techniques. Whilst the transit times were not found to be in agreement with a linear ephemeris, giving chi(2) = 35.07 for 15 degrees of freedom, we interpret this to be the result of systematics in the light curves rather than a real transit timing variation. This is because the light curves that show the largest deviation from a constant period either have relatively little out-of-transit coverage or have clear systematics. A new ephemeris was calculated using the transit times and was found to be T-c(0) = 2454632.62610 +/- 0.00006 HJD and P = 1.3061864 +/- 0.0000005 days. The transit times were then used to place upper mass limits as a function of the period ratio of a potential perturbing planet, showing that our data are sufficiently sensitive to have probed sub-Earth mass planets in both interior and exterior 2:1 resonances, assuming that the additional planet is in an initially circular orbit.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We report the discovery of WASP-10b, a new transiting extrasolar planet (ESP) discovered by the Wide Angle Search for Planets ( WASP) Consortium and confirmed using Nordic Optical Telescope FIbre-fed Echelle Spectrograph and SOPHIE radial velocity data. A 3.09-d period, 29 mmag transit depth and 2.36 h duration are derived for WASP-10b using WASP and high-precision photometric observations. Simultaneous fitting to the photometric and radial velocity data using a Markov Chain Monte Carlo procedure leads to a planet radius of 1.28R(J), a mass of 2.96M(J) and eccentricity of approximate to 0.06. WASP-10b is one of the more massive transiting ESPs, and we compare its characteristics to the current sample of transiting ESP, where there is currently little information for masses greater than approximate to 2M(J) and non-zero eccentricities. WASP-10's host star, GSC 2752-00114 (USNO-B1.0 1214-0586164) is among the fainter stars in the WASP sample, with V = 12.7 and a spectral type of K5. This result shows promise for future late-type dwarf star surveys.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The IntCal04 and Marine04 radiocarbon calibration curves have been updated from 12 cal kBP (cal kBP is here defined as thousands of calibrated years before AD 1950), and extended to 50 cal kBP, utilizing newly available data sets that meet the IntCal Working Group criteria for pristine corals and other carbonates and for quantification of uncertainty in both the 14C and calendar timescales as established in 2002. No change was made to the curves from 0-12 cal kBP. The curves were constructed using a Markov chain Monte Carlo (MCMC) implementation of the random walk model used for IntCal04 and Marine04. The new curves were ratified at the 20th International Radiocarbon Conference in June 2009 and are available in the Supplemental Material at www.radiocarbon.org.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present seven light curves of the exoplanet system HAT-P-3, taken as part of a transit timing programme using the rapid imager to search for exoplanets instrument on the Liverpool Telescope. The light curves are analysed using a Markov chain Monte Carlo algorithm to update the parameters of the system. The inclination is found to be i = 86.75+0.22-0.21°, the planet-star radius ratio to be Rp/R* = 0.1098+0.0010-0.0012 and the stellar radius to be R* = 0.834+0.018-0.026Rsolar, consistent with previous results but with a significant improvement in the precision. Central transit times and uncertainties for each light curve are also determined, and a residual permutation algorithm is used as an independent check on the errors. The transit times are found to be consistent with a linear ephemeris, and a new ephemeris is calculated as Tc(0) = 2454856.70118 +/- 0.00018 HJD and P = 2.899738 +/- 0.000007 d. Model timing residuals are fitted to the measured timing residuals to place upper mass limits for a hypothetical perturbing planet as a function of the period ratio. These show that we have probed for planets with masses as low as 0.33 and 1.81 M? in the interior and exterior 2:1 resonances, respectively, assuming the planets are initially in circular orbits.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We report a transit timing study of the transiting exoplanetary system HD 189733. In total, we observed 10 transits in 2006 and 2008 with the 2.6-m Nordic Optical Telescope, and two transits in 2007 with the 4.2-m William Herschel Telescope. We used Markov Chain Monte Carlo simulations to derive the system parameters and their uncertainties, and our results are in a good agreement with previously published values. We performed two independent analyses of transit timing residuals to place upper mass limits on putative perturbing planets. The results show no evidence for the presence of planets down to 1 Earth mass near the 1:2 and 2:1 resonance orbits, and planets down to 2.2 Earth masses near the 3:5 and 5:3 resonance orbits with HD 189733b. These are the strongest limits to date on the presence of other planets in this system. Based on observations made with the Nordic Optical Telescope, operated on the island of La Palma jointly by Denmark, Finland, Iceland, Norway and Sweden, in the Spanish Observatorio del Roque de los Muchachos of the Instituto de Astrofísica de Canarias. Based on observations made with the William Herschel Telescope operated on the island of La Palma by the Isaac Newton Group in the Spanish Observatorio del Roque de los Muchachos of the Instituto de Astrofísica de Canarias. ‡

Relevância:

100.00% 100.00%

Publicador:

Resumo:

We present an occultation of the newly discovered hot Jupiter system WASP-19, observed with the High Acuity Wide-field K-band Imager instrument on the VLT, in order to measure thermal emission from the planet's dayside at ~2µm. The light curve was analysed using a Markov Chain Monte Carlo method to find the eclipse depth and the central transit time. The transit depth was found to be 0.366 +/- 0.072 per cent, corresponding to a brightness temperature of 2540 +/- 180 K. This is significantly higher than the calculated (zero-albedo) equilibrium temperature and indicates that the planet shows poor redistribution of heat to the night side, consistent with models of highly irradiated planets. Further observations are needed to confirm the existence of a temperature inversion and possibly molecular emission lines. The central eclipse time was found to be consistent with a circular orbit.