954 resultados para bayesian bottleneck
Resumo:
Um modelo bayesiano de regressão binária é desenvolvido para predizer óbito hospitalar em pacientes acometidos por infarto agudo do miocárdio. Métodos de Monte Carlo via Cadeias de Markov (MCMC) são usados para fazer inferência e validação. Uma estratégia para construção de modelos, baseada no uso do fator de Bayes, é proposta e aspectos de validação são extensivamente discutidos neste artigo, incluindo a distribuição a posteriori para o índice de concordância e análise de resíduos. A determinação de fatores de risco, baseados em variáveis disponíveis na chegada do paciente ao hospital, é muito importante para a tomada de decisão sobre o curso do tratamento. O modelo identificado se revela fortemente confiável e acurado, com uma taxa de classificação correta de 88% e um índice de concordância de 83%.
Resumo:
In this work we compared the estimates of the parameters of ARCH models using a complete Bayesian method and an empirical Bayesian method in which we adopted a non-informative prior distribution and informative prior distribution, respectively. We also considered a reparameterization of those models in order to map the space of the parameters into real space. This procedure permits choosing prior normal distributions for the transformed parameters. The posterior summaries were obtained using Monte Carlo Markov chain methods (MCMC). The methodology was evaluated by considering the Telebras series from the Brazilian financial market. The results show that the two methods are able to adjust ARCH models with different numbers of parameters. The empirical Bayesian method provided a more parsimonious model to the data and better adjustment than the complete Bayesian method.
Resumo:
Several statistical models can be used for assessing genotype X environment interaction (GEI) and studying genotypic stability. The objectives of this research were to show how (i) to use Bayesian methodology for computing Shukla's phenotypic stability variance and (ii) to incorporate prior information on the parameters for better estimation. Potato [Solanum tuberosum subsp. andigenum (Juz. & Bukasov) Hawkes], wheat (Triticum aestivum L.), and maize (Zea mays L.) multi environment trials (MET) were used for illustrating the application of the Bayes paradigm. The potato trial included 15 genotypes, but prior information for just three genotypes was used. The wheat trial used prior information on all 10 genotypes included in the trial, whereas for the maize trial, noninformative priors for the nine genotypes was used. Concerning the posterior distribution of the genotypic means, the maize MET with 20 sites gave less disperse posterior distributions of the genotypic means than did the posterior distribution of the genotypic means of the other METs, which included fewer environments. The Bayesian approach allows use of other statistical strategies such as the normal truncated distribution (used in this study). When analyzing grain yield, a lower bound of zero and an upper bound set by the researcher's experience can be used. The Bayesian paradigm offers plant breeders the possibility of computing the probability of a genotype being the best performer. The results of this study show that although some genotypes may have a very low probability of being the best in all sites, they have a relatively good chance of being among the five highest yielding genotypes.
Resumo:
Linear mixed effects models are frequently used to analyse longitudinal data, due to their flexibility in modelling the covariance structure between and within observations. Further, it is easy to deal with unbalanced data, either with respect to the number of observations per subject or per time period, and with varying time intervals between observations. In most applications of mixed models to biological sciences, a normal distribution is assumed both for the random effects and for the residuals. This, however, makes inferences vulnerable to the presence of outliers. Here, linear mixed models employing thick-tailed distributions for robust inferences in longitudinal data analysis are described. Specific distributions discussed include the Student-t, the slash and the contaminated normal. A Bayesian framework is adopted, and the Gibbs sampler and the Metropolis-Hastings algorithms are used to carry out the posterior analyses. An example with data on orthodontic distance growth in children is discussed to illustrate the methodology. Analyses based on either the Student-t distribution or on the usual Gaussian assumption are contrasted. The thick-tailed distributions provide an appealing robust alternative to the Gaussian process for modelling distributions of the random effects and of residuals in linear mixed models, and the MCMC implementation allows the computations to be performed in a flexible manner.
Resumo:
Practical Bayesian inference depends upon detailed examination of posterior distribution. When the prior and likelihood are conjugate, this is easily carried out; however, in general, one must resort to numerical approximation. In this paper, our aim is to solve, using MAPLE, the Bayesian paradigm, for a very special data collecting procedure, known as the randomized-response technique. This allows researchers to obtain sensitive information while guaranteeing privacy to respondents. This approach intends to reduce false responses on sensitive questions. Exact methods and approximations will be compared from the accuracy point of view as well as for the computational effort.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Knowledge of genetic parameters is essential for improved reproductive management and increased yield. Quantitative analysis of genetic parameters is lacking for many breeds of buffaloes. This article provides the first estimate of genetic parameters for dual purpose (meat and milk) Brazilian Jaffarabadi buffaloes, using Bayesian inference. Data on milk yield (MY), lactation length (LL), weight at 205 days (W205) and 365 (W365) days of age, and average daily gain (ADG) from 205 to 365 days of age were collected in two herds. Bivariate analyses (using the program MTGSAM) were performed with the Gibbs sampler to obtain estimates of variance and covariance. Average lactation milk yield and lactation length were 1 620.2 +/- 450.9 kg and 257.6 +/- 46.8 days, respectively, and the mean values for weight traits (kg) were 181.6 +/- 63.3 (W205), 298.04 +/- 116.1 (W365), and 0.73 +/- 0.35 (ADG). Heritability estimates (modes) were 0.16 for MY, 0.10 for LL, 0.43 for W205, 0.48 for W365 and 0.32 for ADG. There was a high genetic correlation (0.96) between milk yield and lactation length and very high genetic correlations (0.99) between the three growth traits. Our data suggest that both milk production and growth traits have clear potential for yield improvement through direct selection in this dual purpose breed. The selection for weight at an early age would be successful and selection for MY can be performed in the first lactation.
Resumo:
We propose alternative approaches to analyze residuals in binary regression models based on random effect components. Our preferred model does not depend upon any tuning parameter, being completely automatic. Although the focus is mainly on accommodation of outliers, the proposed methodology is also able to detect them. Our approach consists of evaluating the posterior distribution of random effects included in the linear predictor. The evaluation of the posterior distributions of interest involves cumbersome integration, which is easily dealt with through stochastic simulation methods. We also discuss different specifications of prior distributions for the random effects. The potential of these strategies is compared in a real data set. The main finding is that the inclusion of extra variability accommodates the outliers, improving the adjustment of the model substantially, besides correctly indicating the possible outliers.
Resumo:
The generalized exponential distribution, proposed by Gupta and Kundu (1999), is a good alternative to standard lifetime distributions as exponential, Weibull or gamma. Several authors have considered the problem of Bayesian estimation of the parameters of generalized exponential distribution, assuming independent gamma priors and other informative priors. In this paper, we consider a Bayesian analysis of the generalized exponential distribution by assuming the conventional non-informative prior distributions, as Jeffreys and reference prior, to estimate the parameters. These priors are compared with independent gamma priors for both parameters. The comparison is carried out by examining the frequentist coverage probabilities of Bayesian credible intervals. We shown that maximal data information prior implies in an improper posterior distribution for the parameters of a generalized exponential distribution. It is also shown that the choice of a parameter of interest is very important for the reference prior. The different choices lead to different reference priors in this case. Numerical inference is illustrated for the parameters by considering data set of different sizes and using MCMC (Markov Chain Monte Carlo) methods.
Resumo:
P>In this study, Bayesian analysis under a threshold animal model was used to estimate genetic correlations between morphological traits (body structure, finishing precocity and muscling) in Nelore cattle evaluated at weaning and yearling. Visual scores obtained from 7651 Nelore cattle at weaning and from 4155 animals at yearling, belonging to the Brazilian Nelore Program, were used. Genetic parameters for the morphological traits were estimated by two-trait Bayesian analysis under a threshold animal model. The genetic correlations between the morphological traits evaluated at two ages of the animal (weaning and yearling) were positive and high for body structure (0.91), finishing precocity (0.96) and muscling (0.94). These results indicate that the traits are mainly determined by the same set of genes of additive action and that direct selection at weaning will also result in genetic progress for the same traits at yearling. Thus, selection of the best genotypes during only one phase of life of the animal is suggested. However, genetic differences between morphological traits were better detected during the growth phase to yearling. Direct selection for body structure, finishing precocity and muscling at only one age, preferentially at yearling, is recommended as genetic differences between traits can be detected at this age.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
This study proposes to ascertain the importance of each alimentary category in the Tetrapturus albidus diet composition, as well as to propose the use of the Bayesian approach for analysis of these data. The stomachs were collected during fishing cruises carried out by the Santos-SP longliner from July 2007 to June 2008. For Bayesian model formulation, each alimentary item was clustered in four food categories as: teleost, cephalopod, crustaceans, and others. To estimate the proportion of each food category, the multinomial model with Dirichlet conjugate prior distribution was used. After the stomach contents analysis, 133 food items were identified, which belonged to 9 taxa. The most important food category is constituted by cephalopod molluscs, followed by teleost fishes. The food category comprised of crustaceans presents a low contribution and in this case it could be considered to be an accidental food item. The Bayesian approach means a distinct view in relation to traditional methods, as it permits one to incorporate information obtained from the literature. It should be useful to analyse great top predators, which are usually caught in small numbers.
Resumo:
In the context of Bayesian statistical analysis, elicitation is the process of formulating a prior density f(.) about one or more uncertain quantities to represent a person's knowledge and beliefs. Several different methods of eliciting prior distributions for one unknown parameter have been proposed. However, there are relatively few methods for specifying a multivariate prior distribution and most are just applicable to specific classes of problems and/or based on restrictive conditions, such as independence of variables. Besides, many of these procedures require the elicitation of variances and correlations, and sometimes elicitation of hyperparameters which are difficult for experts to specify in practice. Garthwaite et al. (2005) discuss the different methods proposed in the literature and the difficulties of eliciting multivariate prior distributions. We describe a flexible method of eliciting multivariate prior distributions applicable to a wide class of practical problems. Our approach does not assume a parametric form for the unknown prior density f(.), instead we use nonparametric Bayesian inference, modelling f(.) by a Gaussian process prior distribution. The expert is then asked to specify certain summaries of his/her distribution, such as the mean, mode, marginal quantiles and a small number of joint probabilities. The analyst receives that information, treating it as a data set D with which to update his/her prior beliefs to obtain the posterior distribution for f(.). Theoretical properties of joint and marginal priors are derived and numerical illustrations to demonstrate our approach are given. (C) 2010 Elsevier B.V. All rights reserved.