Biblioteca Digital

852 resultados para Gibbs sampler

Sampling phylogenetic tree space with the generalized Gibbs sampler

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The generalized Gibbs sampler (GGS) is a recently developed Markov chain Monte Carlo (MCMC) technique that enables Gibbs-like sampling of state spaces that lack a convenient representation in terms of a fixed coordinate system. This paper describes a new sampler, called the tree sampler, which uses the GGS to sample from a state space consisting of phylogenetic trees. The tree sampler is useful for a wide range of phylogenetic applications, including Bayesian, maximum likelihood, and maximum parsimony methods. A fast new algorithm to search for a maximum parsimony phylogeny is presented, using the tree sampler in the context of simulated annealing. The mathematics underlying the algorithm is explained and its time complexity is analyzed. The method is tested on two large data sets consisting of 123 sequences and 500 sequences, respectively. The new algorithm is shown to compare very favorably in terms of speed and accuracy to the program DNAPARS from the PHYLIP package.

Segmenting eukaryotic genomes with the generalized Gibbs sampler

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Eukaryotic genomes display segmental patterns of variation in various properties, including GC content and degree of evolutionary conservation. DNA segmentation algorithms are aimed at identifying statistically significant boundaries between such segments. Such algorithms may provide a means of discovering new classes of functional elements in eukaryotic genomes. This paper presents a model and an algorithm for Bayesian DNA segmentation and considers the feasibility of using it to segment whole eukaryotic genomes. The algorithm is tested on a range of simulated and real DNA sequences, and the following conclusions are drawn. Firstly, the algorithm correctly identifies non-segmented sequence, and can thus be used to reject the null hypothesis of uniformity in the property of interest. Secondly, estimates of the number and locations of change-points produced by the algorithm are robust to variations in algorithm parameters and initial starting conditions and correspond to real features in the data. Thirdly, the algorithm is successfully used to segment human chromosome 1 according to GC content, thus demonstrating the feasibility of Bayesian segmentation of eukaryotic genomes. The software described in this paper is available from the author's website (www.uq.edu.au/similar to uqjkeith/) or upon request to the author.

Interação genótipo x ambiente para a produção de leite na espécie bubalina utilizando inferência Bayesiana por meio de Amostradores de Gibbs

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Com o objetivo de verificar a existência da interação genótipo x ambiente, sob a forma de heterogeneidade de variâncias para a produção de leite na espécie bubalina e o seu impacto na avaliação genética dos animais, utilizando a inferência Bayesiana por meio de Amostrador de Gibbs, foram utilizados 5.484 registros de produção de leite referentes à produções de 2.994 búfalas predominantemente Murrah, filhas de 150 reprodutores, acasalados com 1130 matrizes, cujos partos ocorreram entre os anos de 1974 e 2004. Os registros foram provenientes do Programa de Melhoramento Genético dos Bubalinos (PROMEBUL) com a adição de registros provenientes do rebanho da EMBRAPA Amazônia Oriental -EAO, localizada em Belém, Pará. Foram estabelecidas classes de rebanho-ano de parto e de acordo com o desvio padrão de cada classe, os registros de produção de leite foram classificados em classes de alto e baixo desvio-padrão fenotípico. Posteriormente, os dados foram analisados desconsiderando e considerando as classes de desvio-padrão. O modelo utilizado empregou os efeitos fixos referentes às classes de rebanho-ano, mês de parto e covariáveis idade da fêmea ao parto e duração da lactação, além do efeito aleatório de animal, ambiente permanente e ambiente temporário. Para os efeitos fixos, foi assumido distribuição à priori uniforme e para os componentes de (co)variâncias foram assumidas distribuições priori qui-quadrado inversa e Wishart invertida. As médias observadas e desvio-padrão para produção de leite nas classes de alto e baixo desvio-padrão e em análise geral, foram iguais a 1870,21±758,78, 1900,50±587,76 e 1885,48±677,98, respectivamente. As médias posteriores para os componentes de variâncias foram maiores na classe de alto desvio-padrão. A herdabilidade obtida na classe de alto desvio-padrão foi próxima do valor observado na análise geral e inferior ao valor encontrado na classe de baixo desvio-padrão fenotípico. A correlação genética para produção de leite entre as classes de desvio-padrão foi igual a 0,58. As correlações de Spearman entre os valores genéticos para a produção de leite obtidos em análise geral com os valores obtidos nas classes de alto e baixo desvio padrão foram iguais a 0,94 e 0,93, respectivamente, para todos os reprodutores. Para uma amostra dos 10 melhores reprodutores, as mesmas correlações foram iguais a 0,94 e 0,47, respectivamente. Tais resultados revelam presença de heterogeneidade de variâncias entre rebanhos e esta heterogeneidade de variâncias é resultante de fatores ambientais, que podem levar a uma classificação errônea dos melhores reprodutores geneticamente para a produção leite.

A generalized Markov sampler

Relevância:

70.00% 70.00%

Publicador:

Resumo:

A recent development of the Markov chain Monte Carlo (MCMC) technique is the emergence of MCMC samplers that allow transitions between different models. Such samplers make possible a range of computational tasks involving models, including model selection, model evaluation, model averaging and hypothesis testing. An example of this type of sampler is the reversible jump MCMC sampler, which is a generalization of the Metropolis-Hastings algorithm. Here, we present a new MCMC sampler of this type. The new sampler is a generalization of the Gibbs sampler, but somewhat surprisingly, it also turns out to encompass as particular cases all of the well-known MCMC samplers, including those of Metropolis, Barker, and Hastings. Moreover, the new sampler generalizes the reversible jump MCMC. It therefore appears to be a very general framework for MCMC sampling. This paper describes the new sampler and illustrates its use in three applications in Computational Biology, specifically determination of consensus sequences, phylogenetic inference and delineation of isochores via multiple change-point analysis.

Análise bayesiana do modelo de herança monogênica no melhoramento vegetal: um exemplo com abobrinha

Relevância:

60.00% 60.00%

Publicador:

Resumo:

A common breeding strategy is to carry out basic studies to investigate the hypothesis of a single gene controlling the trait (major gene) with or without polygenes of minor effect. In this study we used Bayesian inference to fit genetic additive-dominance models of inheritance to plant breeding experiments with multiple generations. Normal densities with different means, according to the major gene genotype, were considered in a linear model in which the design matrix of the genetic effects had unknown coefficients (which were estimated in individual basis). An actual data set from an inheritance study of partenocarpy in zucchini (Cucurbita pepo L.) was used for illustration. Model fitting included posterior probabilities for all individual genotypes. Analysis agrees with results in the literature but this approach was far more efficient than previous alternatives assuming that design matrix was known for the generations. Partenocarpy in zucchini is controlled by a major gene with important additive effect and partial dominance.

Abordagem Bayesiana da curva de lactação de cabras Saanen de primeira e segunda ordem de parto

Relevância:

60.00% 60.00%

Publicador:

Resumo:

O objetivo deste trabalho foi utilizar o método Bayesiano no ajuste do modelo de Wood a dados de produção de leite de cabras da raça Saanen. Dois grupos de animais da primeira e segunda lactação foram considerados. Amostras das distribuições marginais a posteriori dos parâmetros do modelo de Wood e das funções de produção derivadas desses parâmetros - pico de produção, tempo do pico de produção, persistência e produção total de leite - foram obtidas pelo algoritmo Gibbs Sampler. As inferências foram feitas em cada população e os resultados mostraram diferenças na taxa de decréscimo da produção após o pico e na persistência, indicando maior produção nos animais de segunda lactação. Realizou-se um estudo de simulação de dados para avaliar o método Bayesiano sob diferentes estruturas de matrizes de covariâncias dos parâmetros. Os resultados desse estudo indicam que o método é eficiente no estudo das curvas de lactação quando a matriz de covariância apresenta alta correlação dos parâmetros.

Recyclage des candidats dans l'algorithme Metropolis à essais multiples

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Les méthodes de Monte Carlo par chaînes de Markov (MCCM) sont des méthodes servant à échantillonner à partir de distributions de probabilité. Ces techniques se basent sur le parcours de chaînes de Markov ayant pour lois stationnaires les distributions à échantillonner. Étant donné leur facilité d’application, elles constituent une des approches les plus utilisées dans la communauté statistique, et tout particulièrement en analyse bayésienne. Ce sont des outils très populaires pour l’échantillonnage de lois de probabilité complexes et/ou en grandes dimensions. Depuis l’apparition de la première méthode MCCM en 1953 (la méthode de Metropolis, voir [10]), l’intérêt pour ces méthodes, ainsi que l’éventail d’algorithmes disponibles ne cessent de s’accroître d’une année à l’autre. Bien que l’algorithme Metropolis-Hastings (voir [8]) puisse être considéré comme l’un des algorithmes de Monte Carlo par chaînes de Markov les plus généraux, il est aussi l’un des plus simples à comprendre et à expliquer, ce qui en fait un algorithme idéal pour débuter. Il a été sujet de développement par plusieurs chercheurs. L’algorithme Metropolis à essais multiples (MTM), introduit dans la littérature statistique par [9], est considéré comme un développement intéressant dans ce domaine, mais malheureusement son implémentation est très coûteuse (en termes de temps). Récemment, un nouvel algorithme a été développé par [1]. Il s’agit de l’algorithme Metropolis à essais multiples revisité (MTM revisité), qui définit la méthode MTM standard mentionnée précédemment dans le cadre de l’algorithme Metropolis-Hastings sur un espace étendu. L’objectif de ce travail est, en premier lieu, de présenter les méthodes MCCM, et par la suite d’étudier et d’analyser les algorithmes Metropolis-Hastings ainsi que le MTM standard afin de permettre aux lecteurs une meilleure compréhension de l’implémentation de ces méthodes. Un deuxième objectif est d’étudier les perspectives ainsi que les inconvénients de l’algorithme MTM revisité afin de voir s’il répond aux attentes de la communauté statistique. Enfin, nous tentons de combattre le problème de sédentarité de l’algorithme MTM revisité, ce qui donne lieu à un tout nouvel algorithme. Ce nouvel algorithme performe bien lorsque le nombre de candidats générés à chaque itérations est petit, mais sa performance se dégrade à mesure que ce nombre de candidats croît.

Bayesian estimation of the double hurdle model in the presence of fixed costs

Relevância:

60.00% 60.00%

Publicador:

Resumo:

We present a model of market participation in which the presence of non-negligible fixed costs leads to random censoring of the traditional double-hurdle model. Fixed costs arise when household resources must be devoted a priori to the decision to participate in the market. These costs, usually of time, are manifested in non-negligible minimum-efficient supplies and supply correspondence that requires modification of the traditional Tobit regression. The costs also complicate econometric estimation of household behavior. These complications are overcome by application of the Gibbs sampler. The algorithm thus derived provides robust estimates of the fixed-costs, double-hurdle model. The model and procedures are demonstrated in an application to milk market participation in the Ethiopian highlands.

Additive genetic relationships between scrotal circumference, heifer pregnancy, and stayability in Nellore cattle

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Scrotal circumference data from 47,605 Nellore young bulls, measured at around 18 mo of age (SC18), were analyzed simultaneously with 27,924 heifer pregnancy (HP) and 80,831 stayability (STAY) records to estimate their additive genetic relationships. Additionally, the possibility that economically relevant traits measured directly in females could replace SC18 as a selection criterion was verified. Heifer pregnancy was defined as the observation that a heifer conceived and remained pregnant, which was assessed by rectal palpation at 60 d. Females were exposed to sires for the first time at about 14 mo of age (between 11 and 16 mo). Stayability was defined as whether or not a cow calved every year up to 5 yr of age, when the opportunity to breed was provided. A Bayesian linear-threshold-threshold analysis via Gibbs sampler was used to estimate the variance and covariance components of the multitrait model. Heritability estimates were 0.42 +/- 0.01, 0.53 +/- 0.03, and 0.10 +/- 0.01, for SC18, HP, and STAY, respectively. The genetic correlation estimates were 0.29 +/- 0.05, 0.19 +/- 0.05, and 0.64 +/- 0.07 between SC18 and HP, SC18 and STAY, and HP and STAY, respectively. The residual correlation estimate between HP and STAY was -0.08 +/- 0.03. The heritability values indicate the existence of considerable genetic variance for SC18 and HP traits. However, genetic correlations between SC18 and the female reproductive traits analyzed in the present study can only be considered moderate. The small residual correlation between HP and STAY suggests that environmental effects common to both traits are not major. The large heritability estimate for HP and the high genetic correlation between HP and STAY obtained in the present study confirm that EPD for HP can be used to select bulls for the production of precocious, fertile, and long-lived daughters. Moreover, SC18 could be incorporated in multitrait analysis to improve the prediction accuracy for HP genetic merit of young bulls.

A Bayesian model for estimating the malaria transition probabilities considering individuals lost to follow-up

Relevância:

60.00% 60.00%

Publicador:

Resumo:

It is known that patients may cease participating in a longitudinal study and become lost to follow-up. The objective of this article is to present a Bayesian model to estimate the malaria transition probabilities considering individuals lost to follow-up. We consider a homogeneous population, and it is assumed that the considered period of time is small enough to avoid two or more transitions from one state of health to another. The proposed model is based on a Gibbs sampling algorithm that uses information of lost to follow-up at the end of the longitudinal study. To simulate the unknown number of individuals with positive and negative states of malaria at the end of the study and lost to follow-up, two latent variables were introduced in the model. We used a real data set and a simulated data to illustrate the application of the methodology. The proposed model showed a good fit to these data sets, and the algorithm did not show problems of convergence or lack of identifiability. We conclude that the proposed model is a good alternative to estimate probabilities of transitions from one state of health to the other in studies with low adherence to follow-up.

Skew-normal linear calibration: a Bayesian perspective

Relevância:

60.00% 60.00%

Publicador:

Resumo:

In this paper, we present a Bayesian approach for estimation in the skew-normal calibration model, as well as the conditional posterior distributions which are useful for implementing the Gibbs sampler. Data transformation is thus avoided by using the methodology proposed. Model fitting is implemented by proposing the asymmetric deviance information criterion, ADIC, a modification of the ordinary DIC. We also report an application of the model studied by using a real data set, related to the relationship between the resistance and the elasticity of a sample of concrete beams. Copyright (C) 2008 John Wiley & Sons, Ltd.

Bayesian Analysis for the Generalized Lognormal Distribution Applied to Failure Time Analysis

Relevância:

60.00% 60.00%

Publicador:

Resumo:

There are several versions of the lognormal distribution in the statistical literature, one is based in the exponential transformation of generalized normal distribution (GN). This paper presents the Bayesian analysis for the generalized lognormal distribution (logGN) considering independent non-informative Jeffreys distributions for the parameters as well as the procedure for implementing the Gibbs sampler to obtain the posterior distributions of parameters. The results are used to analyze failure time models with right-censored and uncensored data. The proposed method is illustrated using actual failure time data of computers.

Modelos dinâmicos e simulação estocástica

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This paper presents new methodology for making Bayesian inference about dy~ o!s for exponential famiIy observations. The approach is simulation-based _~t> use of ~vlarkov chain Monte Carlo techniques. A yletropolis-Hastings i:U~UnLlllll 1::; combined with the Gibbs sampler in repeated use of an adjusted version of normal dynamic linear models. Different alternative schemes are derived and compared. The approach is fully Bayesian in obtaining posterior samples for state parameters and unknown hyperparameters. Illustrations to real data sets with sparse counts and missing values are presented. Extensions to accommodate for general distributions for observations and disturbances. intervention. non-linear models and rnultivariate time series are outlined.

Bayesian inference in genetic parameter estimation of visual scores in Nellore beef-cattle

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The aim of this study was to estimate the components of variance and genetic parameters for the visual scores which constitute the Morphological Evaluation System (MES), such as body structure (S), precocity (P) and musculature (M) in Nellore beef-cattle at the weaning and yearling stages, by using threshold Bayesian models. The information used for this was gleaned from visual scores of 5,407 animals evaluated at the weaning and 2,649 at the yearling stages. The genetic parameters for visual score traits were estimated through two-trait analysis, using the threshold animal model, with Bayesian statistics methodology and MTGSAM (Multiple Trait Gibbs Sampler for Animal Models) threshold software. Heritability estimates for S, P and M were 0.68, 0.65 and 0.62 (at weaning) and 0.44, 0.38 and 0.32 (at the yearling stage), respectively. Heritability estimates for S, P and M were found to be high, and so it is expected that these traits should respond favorably to direct selection. The visual scores evaluated at the weaning and yearling stages might be used in the composition of new selection indexes, as they presented sufficient genetic variability to promote genetic progress in such morphological traits.

Genetic parameters of total milk yield and factors describing the shape of lactation curve in dairy buffaloes

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The objective of this study was to apply factor analysis to describe lactation curves in dairy buffaloes in order to estimate the phenotypic and genetic association between common latent factors and cumulative milk yield. A total of 31 257 monthly test-day milk yield records from buffaloes belonging to herds located in the state of São Paulo were used to estimate two common latent factors, which were then analysed in a multi-trait animal model for estimating genetic parameters. Estimates of (co)variance components for the two common latent factors and cumulated 270-d milk yield were obtained by Bayesian inference using a multiple trait animal model. Contemporary group, number of milkings per day (two levels) and age of buffalo cow at calving (linear and quadratic) as covariate were included in the model as fixed effects. The additive genetic, permanent environmental and residual effects were included as random effects. The first common latent factor (F1) was associated with persistency of lactation and the second common latent factor (F2) with the level of production in early lactation. Heritability estimates for Fl and F2 were 0.12 and 0.07, respectively. Genetic correlation estimates between El and F2 with cumulative milk yield were positive and moderate (0.63 and 0.52). Multivariate statistics employing factor analysis allowed the extraction of two variables (latent factors) that described the shape of the lactation curve. It is expected that the response to selection to increase lactation persistency is higher than the response obtained from selecting animals to increase lactation peak. Selection for higher total milk yield would result in a favourable correlated response to increase the level of production in early lactation and the lactation persistency.

«
1
2
3
4
5
6
7
8
...
56
57
»