965 resultados para Bayesian Analysis
Resumo:
We sequenced part of the 16S rRNA mitochondrial gene in 17 extant taxa of Pilosa (sloths and anteaters) and used these sequences along with GenBank sequences of both extant and extinct sloths to perform phylogenetic analysis based on parsimony, maximum-likelihood and Bayesian methods. By increasing the taxa density for anteaters and sloths we were able to clarify some points of the Pilosa phylogenetic tree. Our mitochondrial 16S results show Bradypodidae as a monophyletic and robustly supported clade in all the analysis. However, the Pleistocene fossil Mylodon darwinii does not group significantly to either Bradypodidae or Megalonychidae which indicates that trichotomy best represents the relationship between the families Mylodontidae, Bradypodidae and Megalonychidae. Divergence times also allowed us to discuss the taxonomic status of Cyclopes and the three species of three-toed sloths, Bradypus tridactylus, Bradypus variegatus and Bradypus torquatus. In the Bradypodidae the split between Bradypus torquatus and the proto-Bradypus tridactylus / B. variegatus was estimated as about 7.7 million years ago (MYA), while in the Myrmecophagidae the first offshoot was Cyclopes at about 31.8 MYA followed by the split between Myrmecophaga and Tamandua at 12.9 MYA. We estimate the split between sloths and anteaters to have occurred at about 37 MYA.
Resumo:
The objective of the study was to estimate heritability for calving interval (CI) and age at first calving (AFC) and also calculate repeatability for CI in buffaloes using Bayesian inference. The Brazilian Buffaloes Genetic Improvement Program provided the database. Data consists on information from 628 females and four different herds, born between 1980 and 2003. In order to estimate the variance, univariate analyses were performed employing Gibbs sampler procedure included in the MTGSAM software. The model for CI included the random effects direct additive and permanent environment factors, and the fixed effects of contemporary groups and calving orders. The model for AFC included the direct additive random effect and contemporary groups as a fixed effect. The convergence diagnosis was obtained using Geweke that was implemented through the Bayesian Output Analysis package in R software. The estimated averages were 433.2 days and 36.7months for CI and AFC, respectively. The means, medians and modes for the calculated heritability coefficients were similar. The heritability coefficients were 0.10 and 0.42 for CI and AFC respectively, with a posteriori marginal density that follows a normal distribution for both traits. The repeatability for CI was 0.13. The low heritability estimated for CI indicates that the variation in this trait is, to a large extent, influenced by environmental factors such as herd management policies. The age at first calving has clear potential for yield improvement through direct selection in these animals.
Resumo:
The aim of this study was to estimate genetic, environmental and phenotypic correlation between birth weight (BW) and weight at 205 days age (W205), BW and weight at 365 days age (W365) and W205-W365, using Bayesian inference. The Brazilian Program for Genetic Improvement of Buffaloes provided the data that included 3,883 observations from Mediterranean breed buffaloes. With the purpose to estimate variance and covariance, bivariate analyses were performed using Gibbs sampler that is included in the MTGSAM software. The model for BW, W205 and W365 included additive direct and maternal genetic random effects, maternal environmental random effect and contemporary group as fixed effect. The convergence diagnosis was achieved using Geweke, a method that uses an algorithm implemented in R software through the package Bayesian Output Analysis. The calculated direct genetic correlations were 0.34 (BW-W205), 0.25 (BW-W365) and 0.74 (W205-W365). The environmental correlations were 0.12, 0.11 and 0.72 between BW-W205, BW-W365 and W205-W365, respectively. The phenotypic correlations were low for BW-W205 (0.01) and BW-W365 (0.04), differently than the obtained for W205-W365 with a value of 0.67. The results indicate that BW trait have low genetic, environmental and phenotypic association with the two others traits. The genetic correlation between W205 and W365 was high and suggests that the selection for weight at around 205 days could be beneficial to accelerate the genetic gain.
Resumo:
Quantitative analysis of growth genetic parameters is not available for many breeds of buffaloes making selection and breeding decisions an empirical process that lacks robustness. The objective of this study was to estimate heritability for birth weight (BW), weight at 205 days (W205) and 365 days (W365) of age using Bayesian inference. The Brazilian Program for Genetic Improvement of Buffaloes provided the data. For the traits BW, W205 and W365 of Brazilian Mediterranean buffaloes 5169, 3792 and 3883 observations have been employed for the analysis, respectively. In order to obtain the estimates of variance, univariate analyses were conducted using the Gibbs sampler included in the MTGSAM software. The model for BW, W205 and W365 included additive direct and maternal genetic random effects, random maternal permanent environmental effect and contemporary group that was treated as a fixed effect. The convergence diagnosis was performed employing Geweke, a method that uses an algorithm from the Bayesian Output Analysis package that was implemented using R software environment. The average values for weight traits were 37.6 +/- 4.7 kg for BW, 192.7 +/- 40.3 kg for W205 and 298.6 +/- 67.4 kg for W365. The heritability posterior distributions for direct and maternal effects were symmetric and close to those expected in a normal distribution. Direct heritability estimates obtained using the modes were 0.30 (BW), 0.52 (W205) and 0.54 (W365). The maternal heritability coefficient estimates were 0.31, 0.19 and 0.21 for BW, W205 and W365, respectively. Our data suggests that all growth traits and mainly W205 and W365, have clear potential for yield improvement through direct genetic selection.
Resumo:
The objective of the study was to estimate heritability and repeatability for milk yield (MY) and lactation length (LL) in buffaloes using Bayesian inference. The Brazilian genetic improvement program of buffalo provided the data that included 628 females, from four herds, born between 1980 and 2003. In order to obtain the estimates of variance, univariate analyses were performed with the Gibbs sampler, using the MTGSAM software. The model for MY and LL included direct genetic additive and permanent environment as random effects, and contemporary groups, milking frequency and calving number as fixed effects. The convergence diagnosis was performed with the Geweke method using an algorithm implemented in R software through the package Bayesian Output Analysis. Average for milk yield and lactation length was 1,546.1 +/- 483.8 kg and 252.3 +/- 42.5 days, respectively. The heritability coefficients were 0.31 (mode), 0.35 (mean) and 0.34 (median) for MY and 0.11 (mode), 0.10 (mean) and 0.10 (median) for LL. The repeatability coefficient (mode) were 0.50 and 0.15 for MY and LL, respectively. Milk yield is the only trait with clear potential for genetic improvement by direct genetic selection. The repeatability for MY indicates that selection based on the first lactation could contribute for an improvement in this trait.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Pós-graduação em Genética e Melhoramento Animal - FCAV
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
Analyses of ecological data should account for the uncertainty in the process(es) that generated the data. However, accounting for these uncertainties is a difficult task, since ecology is known for its complexity. Measurement and/or process errors are often the only sources of uncertainty modeled when addressing complex ecological problems, yet analyses should also account for uncertainty in sampling design, in model specification, in parameters governing the specified model, and in initial and boundary conditions. Only then can we be confident in the scientific inferences and forecasts made from an analysis. Probability and statistics provide a framework that accounts for multiple sources of uncertainty. Given the complexities of ecological studies, the hierarchical statistical model is an invaluable tool. This approach is not new in ecology, and there are many examples (both Bayesian and non-Bayesian) in the literature illustrating the benefits of this approach. In this article, we provide a baseline for concepts, notation, and methods, from which discussion on hierarchical statistical modeling in ecology can proceed. We have also planted some seeds for discussion and tried to show where the practical difficulties lie. Our thesis is that hierarchical statistical modeling is a powerful way of approaching ecological analysis in the presence of inevitable but quantifiable uncertainties, even if practical issues sometimes require pragmatic compromises.
Resumo:
The HIV-1 subtype C has spread efficiently in the southern states of Brazil (Rio Grande do Sul, Santa Catarina and Parana). Phylogeographic studies indicate that the subtype C epidemic in southern Brazil was initiated by the introduction of a single founder virus population at some time point between 1960 and 1980, but little is known about the spatial dynamics of viral spread. A total of 135 Brazilian HIV-1 subtype C pol sequences collected from 1992 to 2009 at the three southern state capitals (Porto Alegre, Florianopolis and Curitiba) were analyzed. Maximum-likelihood and Bayesian methods were used to explore the degree of phylogenetic mixing of subtype C sequences from different cities and to reconstruct the geographical pattern of viral spread in this country region. Phylogeographic analyses supported the monophyletic origin of the HIV-1 subtype C clade circulating in southern Brazil and placed the root of that clade in Curitiba (Parana state). This analysis further suggested that Florianopolis (Santa Catarina state) is an important staging post in the subtype C dissemination displaying high viral migration rates from and to the other cities, while viral flux between Curitiba and Porto Alegre (Rio Grande do Sul state) is very low. We found a positive correlation (r(2) = 0.64) between routine travel and viral migration rates among localities. Despite the intense viral movement, phylogenetic intermixing of subtype C sequences from different Brazilian cities is lower than expected by chance. Notably, a high proportion (67%) of subtype C sequences from Porto Alegre branched within a single local monophyletic sub-cluster. These results suggest that the HIV-1 subtype C epidemic in southern Brazil has been shaped by both frequent viral migration among states and in situ dissemination of local clades.
Resumo:
Item response theory (IRT) comprises a set of statistical models which are useful in many fields, especially when there is an interest in studying latent variables (or latent traits). Usually such latent traits are assumed to be random variables and a convenient distribution is assigned to them. A very common choice for such a distribution has been the standard normal. Recently, Azevedo et al. [Bayesian inference for a skew-normal IRT model under the centred parameterization, Comput. Stat. Data Anal. 55 (2011), pp. 353-365] proposed a skew-normal distribution under the centred parameterization (SNCP) as had been studied in [R. B. Arellano-Valle and A. Azzalini, The centred parametrization for the multivariate skew-normal distribution, J. Multivariate Anal. 99(7) (2008), pp. 1362-1382], to model the latent trait distribution. This approach allows one to represent any asymmetric behaviour concerning the latent trait distribution. Also, they developed a Metropolis-Hastings within the Gibbs sampling (MHWGS) algorithm based on the density of the SNCP. They showed that the algorithm recovers all parameters properly. Their results indicated that, in the presence of asymmetry, the proposed model and the estimation algorithm perform better than the usual model and estimation methods. Our main goal in this paper is to propose another type of MHWGS algorithm based on a stochastic representation (hierarchical structure) of the SNCP studied in [N. Henze, A probabilistic representation of the skew-normal distribution, Scand. J. Statist. 13 (1986), pp. 271-275]. Our algorithm has only one Metropolis-Hastings step, in opposition to the algorithm developed by Azevedo et al., which has two such steps. This not only makes the implementation easier but also reduces the number of proposal densities to be used, which can be a problem in the implementation of MHWGS algorithms, as can be seen in [R.J. Patz and B.W. Junker, A straightforward approach to Markov Chain Monte Carlo methods for item response models, J. Educ. Behav. Stat. 24(2) (1999), pp. 146-178; R. J. Patz and B. W. Junker, The applications and extensions of MCMC in IRT: Multiple item types, missing data, and rated responses, J. Educ. Behav. Stat. 24(4) (1999), pp. 342-366; A. Gelman, G.O. Roberts, and W.R. Gilks, Efficient Metropolis jumping rules, Bayesian Stat. 5 (1996), pp. 599-607]. Moreover, we consider a modified beta prior (which generalizes the one considered in [3]) and a Jeffreys prior for the asymmetry parameter. Furthermore, we study the sensitivity of such priors as well as the use of different kernel densities for this parameter. Finally, we assess the impact of the number of examinees, number of items and the asymmetry level on the parameter recovery. Results of the simulation study indicated that our approach performed equally as well as that in [3], in terms of parameter recovery, mainly using the Jeffreys prior. Also, they indicated that the asymmetry level has the highest impact on parameter recovery, even though it is relatively small. A real data analysis is considered jointly with the development of model fitting assessment tools. The results are compared with the ones obtained by Azevedo et al. The results indicate that using the hierarchical approach allows us to implement MCMC algorithms more easily, it facilitates diagnosis of the convergence and also it can be very useful to fit more complex skew IRT models.
Resumo:
We explore the meaning of information about quantities of interest. Our approach is divided in two scenarios: the analysis of observations and the planning of an experiment. First, we review the Sufficiency, Conditionality and Likelihood principles and how they relate to trivial experiments. Next, we review Blackwell Sufficiency and show that sampling without replacement is Blackwell Sufficient for sampling with replacement. Finally, we unify the two scenarios presenting an extension of the relationship between Blackwell Equivalence and the Likelihood Principle.
Resumo:
A set of predictor variables is said to be intrinsically multivariate predictive (IMP) for a target variable if all properly contained subsets of the predictor set are poor predictors of the. target but the full set predicts the target with great accuracy. In a previous article, the main properties of IMP Boolean variables have been analytically described, including the introduction of the IMP score, a metric based on the coefficient of determination (CoD) as a measure of predictiveness with respect to the target variable. It was shown that the IMP score depends on four main properties: logic of connection, predictive power, covariance between predictors and marginal predictor probabilities (biases). This paper extends that work to a broader context, in an attempt to characterize properties of discrete Bayesian networks that contribute to the presence of variables (network nodes) with high IMP scores. We have found that there is a relationship between the IMP score of a node and its territory size, i.e., its position along a pathway with one source: nodes far from the source display larger IMP scores than those closer to the source, and longer pathways display larger maximum IMP scores. This appears to be a consequence of the fact that nodes with small territory have larger probability of having highly covariate predictors, which leads to smaller IMP scores. In addition, a larger number of XOR and NXOR predictive logic relationships has positive influence over the maximum IMP score found in the pathway. This work presents analytical results based on a simple structure network and an analysis involving random networks constructed by computational simulations. Finally, results from a real Bayesian network application are provided. (C) 2012 Elsevier Inc. All rights reserved.