882 resultados para Bayesian model selection


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Annual counts of migrating raptors at fixed observation points are a widespread practice, and changes in numbers counted over time, adjusted for survey effort, are commonly used as indices of trends in population size. Unmodeled year-to-year variation in detectability may introduce bias, reduce precision of trend estimates, and reduce power to detect trends. We conducted dependent double-observer surveys at the annual fall raptor migration count at Lucky Peak, Idaho, in 2009 and 2010 and applied Huggins closed-capture removal models and information-theoretic model selection to determine the relative importance of factors affecting detectability. The most parsimonious model included effects of observer team identity, distance, species, and day of the season. We then simulated 30 years of counts with heterogeneous individual detectability, a population decline (λ = 0.964), and unexplained random variation in the number of available birds. Imperfect detectability did not bias trend estimation, and increased the time required to achieve 80% power by less than 11%. Results suggested that availability is a greater source of variance in annual counts than detectability; thus, efforts to account for availability would improve the monitoring value of migration counts. According to our models, long-term trends in observer efficiency or migratory flight distance may introduce substantial bias to trend estimates. Estimating detectability with a novel count protocol like our double-observer method is just one potential means of controlling such effects. The traditional approach of modeling the effects of covariates and adjusting the index may also be effective if ancillary data is collected consistently.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Foot-and-mouth disease (FMD), a disease of cloven hooved animals caused by FMD virus (FMDV), is one of the most economically devastating diseases of livestock worldwide. The global burden of disease is borne largely by livestock-keepers in areas of Africa and Asia where the disease is endemic and where many people rely on livestock for their livelihoods and food-security. Yet, there are many gaps in our knowledge of the drivers of FMDV circulation in these settings. In East Africa, FMD epidemiology is complicated by the circulation of multiple FMDV serotypes (distinct antigenic variants) and by the presence of large populations of susceptible wildlife and domestic livestock. The African buffalo (Syncerus caffer) is the only wildlife species with consistent evidence of high levels of FMDV infection, and East Africa contains the largest population of this species globally. To inform FMD control in this region, key questions relate to heterogeneities in FMD prevalence and impacts in different livestock management systems and to the role of wildlife as a potential source of FMDV for livestock. To develop FMD control strategies and make best use of vaccine control options, serotype-specific patterns of circulation need to be characterised. In this study, the impacts and epidemiology of FMD were investigated across a range of traditional livestock-keeping systems in northern Tanzania, including pastoralist, agro-pastoralist and rural smallholder systems. Data were generated through field studies and laboratory analyses between 2010 and 2015. The study involved analysis of existing household survey data and generated serological data from cross-sectional livestock and buffalo samples and longitudinal cattle samples. Serological analyses included non-structural protein ELISAs, serotype-specific solid-phase competitive ELISAs, with optimisation to detect East African FMDV variants, and virus neutralisation testing. Risk factors for FMDV infection and outbreaks were investigated through analysis of cross-sectional serological data in conjunction with a case-control outbreak analysis. A novel Bayesian modeling approach was developed to infer serotype-specific infection history from serological data, and combined with virus isolation data from FMD outbreaks to characterise temporal and spatial patterns of serotype-specific infection. A high seroprevalence of FMD was detected in both northern Tanzanian livestock (69%, [66.5 - 71.4%] in cattle and 48.5%, [45.7-51.3%] in small ruminants) and in buffalo (80.9%, [74.7-86.1%]). Four different serotypes of FMDV (A, O, SAT1 and SAT2) were isolated from livestock. Up to three outbreaks per year were reported by households and active surveillance highlighted up to four serial outbreaks in the same herds within three years. Agro-pastoral and pastoral livestock keepers reported more frequent FMD outbreaks compared to smallholders. Households in all three management systems reported that FMD outbreaks caused significant impacts on milk production and sales, and on animals’ draught power, hence on crop production, with implications for food security and livelihoods. Risk factor analyses showed that older livestock were more likely to be seropositive for FMD (Odds Ratio [OR] 1.4 [1.4-1.5] per extra year) and that cattle (OR 3.3 [2.7-4.0]) were more likely than sheep and goats to be seropositive. Livestock managed by agro-pastoralists (OR 8.1 [2.8-23.6]) or pastoralists (OR 7.1 [2.9-17.6]) were more likely to be seropositive compared to those managed by smallholders. Larger herds (OR: 1.02 [1.01-1.03] per extra bovine) and those that recently acquired new livestock (OR: 5.57 [1.01 – 30.91]) had increased odds of suffering an FMD outbreak. Measures of potential contact with buffalo or with other FMD susceptible wildlife did not increase the likelihood of FMD in livestock in either the cross-sectional serological analysis or case-control outbreak analysis. The Bayesian model was validated to correctly infer from ELISA data the most recent serotype to infect cattle. Consistent with the lack of risk factors related to wildlife contact, temporal and spatial patterns of exposure to specific FMDV serotypes were not tightly linked in cattle and buffalo. In cattle, four serial waves of different FMDV serotypes that swept through southern Kenyan and northern Tanzanian livestock populations over a four-year period dominated infection patterns. In contrast, only two serotypes (SAT1 and SAT2) dominated in buffalo populations. Key conclusions are that FMD has a substantial impact in traditional livestock systems in East Africa. Wildlife does not currently appear to act as an important source of FMDV for East African livestock, and control efforts in the region should initially focus on livestock management and vaccination strategies. A novel modeling approach greatly facilitated the interpretation of serological data and may be a potent epidemiological tool in the African setting. There was a clear temporal pattern of FMDV antigenic dominance across northern Tanzania and southern Kenya. Longer-term research to investigate whether serotype-specific FMDV sweeps are truly predictable, and to shed light on FMD post-infection immunity in animals exposed to serial FMD infections is warranted.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The study of random probability measures is a lively research topic that has attracted interest from different fields in recent years. In this thesis, we consider random probability measures in the context of Bayesian nonparametrics, where the law of a random probability measure is used as prior distribution, and in the context of distributional data analysis, where the goal is to perform inference given avsample from the law of a random probability measure. The contributions contained in this thesis can be subdivided according to three different topics: (i) the use of almost surely discrete repulsive random measures (i.e., whose support points are well separated) for Bayesian model-based clustering, (ii) the proposal of new laws for collections of random probability measures for Bayesian density estimation of partially exchangeable data subdivided into different groups, and (iii) the study of principal component analysis and regression models for probability distributions seen as elements of the 2-Wasserstein space. Specifically, for point (i) above we propose an efficient Markov chain Monte Carlo algorithm for posterior inference, which sidesteps the need of split-merge reversible jump moves typically associated with poor performance, we propose a model for clustering high-dimensional data by introducing a novel class of anisotropic determinantal point processes, and study the distributional properties of the repulsive measures, shedding light on important theoretical results which enable more principled prior elicitation and more efficient posterior simulation algorithms. For point (ii) above, we consider several models suitable for clustering homogeneous populations, inducing spatial dependence across groups of data, extracting the characteristic traits common to all the data-groups, and propose a novel vector autoregressive model to study of growth curves of Singaporean kids. Finally, for point (iii), we propose a novel class of projected statistical methods for distributional data analysis for measures on the real line and on the unit-circle.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)

Relevância:

50.00% 50.00%

Publicador:

Resumo:

P>In this study, Bayesian analysis under a threshold animal model was used to estimate genetic correlations between morphological traits (body structure, finishing precocity and muscling) in Nelore cattle evaluated at weaning and yearling. Visual scores obtained from 7651 Nelore cattle at weaning and from 4155 animals at yearling, belonging to the Brazilian Nelore Program, were used. Genetic parameters for the morphological traits were estimated by two-trait Bayesian analysis under a threshold animal model. The genetic correlations between the morphological traits evaluated at two ages of the animal (weaning and yearling) were positive and high for body structure (0.91), finishing precocity (0.96) and muscling (0.94). These results indicate that the traits are mainly determined by the same set of genes of additive action and that direct selection at weaning will also result in genetic progress for the same traits at yearling. Thus, selection of the best genotypes during only one phase of life of the animal is suggested. However, genetic differences between morphological traits were better detected during the growth phase to yearling. Direct selection for body structure, finishing precocity and muscling at only one age, preferentially at yearling, is recommended as genetic differences between traits can be detected at this age.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

A data set of a commercial Nellore beef cattle selection program was used to compare breeding models that assumed or not markers effects to estimate the breeding values, when a reduced number of animals have phenotypic, genotypic and pedigree information available. This herd complete data set was composed of 83,404 animals measured for weaning weight (WW), post-weaning gain (PWG), scrotal circumference (SC) and muscle score (MS), corresponding to 116,652 animals in the relationship matrix. Single trait analyses were performed by MTDFREML software to estimate fixed and random effects solutions using this complete data. The additive effects estimated were assumed as the reference breeding values for those animals. The individual observed phenotype of each trait was adjusted for fixed and random effects solutions, except for direct additive effects. The adjusted phenotype composed of the additive and residual parts of observed phenotype was used as dependent variable for models' comparison. Among all measured animals of this herd, only 3160 animals were genotyped for 106 SNP markers. Three models were compared in terms of changes on animals' rank, global fit and predictive ability. Model 1 included only polygenic effects, model 2 included only markers effects and model 3 included both polygenic and markers effects. Bayesian inference via Markov chain Monte Carlo methods performed by TM software was used to analyze the data for model comparison. Two different priors were adopted for markers effects in models 2 and 3, the first prior assumed was a uniform distribution (U) and, as a second prior, was assumed that markers effects were distributed as normal (N). Higher rank correlation coefficients were observed for models 3_U and 3_N, indicating a greater similarity of these models animals' rank and the rank based on the reference breeding values. Model 3_N presented a better global fit, as demonstrated by its low DIC. The best models in terms of predictive ability were models 1 and 3_N. Differences due prior assumed to markers effects in models 2 and 3 could be attributed to the better ability of normal prior in handle with collinear effects. The models 2_U and 2_N presented the worst performance, indicating that this small set of markers should not be used to genetically evaluate animals with no data, since its predictive ability is restricted. In conclusion, model 3_N presented a slight superiority when a reduce number of animals have phenotypic, genotypic and pedigree information. It could be attributed to the variation retained by markers and polygenic effects assumed together and the normal prior assumed to markers effects, that deals better with the collinearity between markers. (C) 2012 Elsevier B.V. All rights reserved.

Relevância:

50.00% 50.00%

Publicador:

Resumo:

In 2011, there will be an estimated 1,596,670 new cancer cases and 571,950 cancer-related deaths in the US. With the ever-increasing applications of cancer genetics in epidemiology, there is great potential to identify genetic risk factors that would help identify individuals with increased genetic susceptibility to cancer, which could be used to develop interventions or targeted therapies that could hopefully reduce cancer risk and mortality. In this dissertation, I propose to develop a new statistical method to evaluate the role of haplotypes in cancer susceptibility and development. This model will be flexible enough to handle not only haplotypes of any size, but also a variety of covariates. I will then apply this method to three cancer-related data sets (Hodgkin Disease, Glioma, and Lung Cancer). I hypothesize that there is substantial improvement in the estimation of association between haplotypes and disease, with the use of a Bayesian mathematical method to infer haplotypes that uses prior information from known genetics sources. Analysis based on haplotypes using information from publically available genetic sources generally show increased odds ratios and smaller p-values in both the Hodgkin, Glioma, and Lung data sets. For instance, the Bayesian Joint Logistic Model (BJLM) inferred haplotype TC had a substantially higher estimated effect size (OR=12.16, 95% CI = 2.47-90.1 vs. 9.24, 95% CI = 1.81-47.2) and more significant p-value (0.00044 vs. 0.008) for Hodgkin Disease compared to a traditional logistic regression approach. Also, the effect sizes of haplotypes modeled with recessive genetic effects were higher (and had more significant p-values) when analyzed with the BJLM. Full genetic models with haplotype information developed with the BJLM resulted in significantly higher discriminatory power and a significantly higher Net Reclassification Index compared to those developed with haplo.stats for lung cancer. Future analysis for this work could be to incorporate the 1000 Genomes project, which offers a larger selection of SNPs can be incorporated into the information from known genetic sources as well. Other future analysis include testing non-binary outcomes, like the levels of biomarkers that are present in lung cancer (NNK), and extending this analysis to full GWAS studies.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Background: Plasmodium vivax malaria is a major public health challenge in Latin America, Asia and Oceania, with 130-435 million clinical cases per year worldwide. Invasion of host blood cells by P. vivax mainly depends on a type I membrane protein called Duffy binding protein (PvDBP). The erythrocyte-binding motif of PvDBP is a 170 amino-acid stretch located in its cysteine-rich region II (PvDBP(II)), which is the most variable segment of the protein. Methods: To test whether diversifying natural selection has shaped the nucleotide diversity of PvDBP(II) in Brazilian populations, this region was sequenced in 122 isolates from six different geographic areas. A Bayesian method was applied to test for the action of natural selection under a population genetic model that incorporates recombination. The analysis was integrated with a structural model of PvDBP(II), and T-and B-cell epitopes were localized on the 3-D structure. Results: The results suggest that: (i) recombination plays an important role in determining the haplotype structure of PvDBP(II), and (ii) PvDBP(II) appears to contain neutrally evolving codons as well as codons evolving under natural selection. Diversifying selection preferentially acts on sites identified as epitopes, particularly on amino acid residues 417, 419, and 424, which show strong linkage disequilibrium. Conclusions: This study shows that some polymorphisms of PvDBP(II) are present near the erythrocyte-binding domain and might serve to elude antibodies that inhibit cell invasion. Therefore, these polymorphisms should be taken into account when designing vaccines aimed at eliciting antibodies to inhibit erythrocyte invasion.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Motivation: Understanding the patterns of association between polymorphisms at different loci in a population ( linkage disequilibrium, LD) is of fundamental importance in various genetic studies. Many coefficients were proposed for measuring the degree of LD, but they provide only a static view of the current LD structure. Generative models (GMs) were proposed to go beyond these measures, giving not only a description of the actual LD structure but also a tool to help understanding the process that generated such structure. GMs based in coalescent theory have been the most appealing because they link LD to evolutionary factors. Nevertheless, the inference and parameter estimation of such models is still computationally challenging. Results: We present a more practical method to build GM that describe LD. The method is based on learning weighted Bayesian network structures from haplotype data, extracting equivalence structure classes and using them to model LD. The results obtained in public data from the HapMap database showed that the method is a promising tool for modeling LD. The associations represented by the learned models are correlated with the traditional measure of LD D`. The method was able to represent LD blocks found by standard tools. The granularity of the association blocks and the readability of the models can be controlled in the method. The results suggest that the causality information gained by our method can be useful to tell about the conservability of the genetic markers and to guide the selection of subset of representative markers.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Joint generalized linear models and double generalized linear models (DGLMs) were designed to model outcomes for which the variability can be explained using factors and/or covariates. When such factors operate, the usual normal regression models, which inherently exhibit constant variance, will under-represent variation in the data and hence may lead to erroneous inferences. For count and proportion data, such noise factors can generate a so-called overdispersion effect, and the use of binomial and Poisson models underestimates the variability and, consequently, incorrectly indicate significant effects. In this manuscript, we propose a DGLM from a Bayesian perspective, focusing on the case of proportion data, where the overdispersion can be modeled using a random effect that depends on some noise factors. The posterior joint density function was sampled using Monte Carlo Markov Chain algorithms, allowing inferences over the model parameters. An application to a data set on apple tissue culture is presented, for which it is shown that the Bayesian approach is quite feasible, even when limited prior information is available, thereby generating valuable insight for the researcher about its experimental results.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper presents a method for estimating the posterior probability density of the cointegrating rank of a multivariate error correction model. A second contribution is the careful elicitation of the prior for the cointegrating vectors derived from a prior on the cointegrating space. This prior obtains naturally from treating the cointegrating space as the parameter of interest in inference and overcomes problems previously encountered in Bayesian cointegration analysis. Using this new prior and Laplace approximation, an estimator for the posterior probability of the rank is given. The approach performs well compared with information criteria in Monte Carlo experiments. (C) 2003 Elsevier B.V. All rights reserved.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

HE PROBIT MODEL IS A POPULAR DEVICE for explaining binary choice decisions in econometrics. It has been used to describe choices such as labor force participation, travel mode, home ownership, and type of education. These and many more examples can be found in papers by Amemiya (1981) and Maddala (1983). Given the contribution of economics towards explaining such choices, and given the nature of data that are collected, prior information on the relationship between a choice probability and several explanatory variables frequently exists. Bayesian inference is a convenient vehicle for including such prior information. Given the increasing popularity of Bayesian inference it is useful to ask whether inferences from a probit model are sensitive to a choice between Bayesian and sampling theory techniques. Of interest is the sensitivity of inference on coefficients, probabilities, and elasticities. We consider these issues in a model designed to explain choice between fixed and variable interest rate mortgages. Two Bayesian priors are employed: a uniform prior on the coefficients, designed to be noninformative for the coefficients, and an inequality restricted prior on the signs of the coefficients. We often know, a priori, whether increasing the value of a particular explanatory variable will have a positive or negative effect on a choice probability. This knowledge can be captured by using a prior probability density function (pdf) that is truncated to be positive or negative. Thus, three sets of results are compared:those from maximum likelihood (ML) estimation, those from Bayesian estimation with an unrestricted uniform prior on the coefficients, and those from Bayesian estimation with a uniform prior truncated to accommodate inequality restrictions on the coefficients.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

This paper develops a theory that firms seek out new country markets on the basis of expected commercial returns. These expectations depend on judgements about the attractiveness of the market and the firm's competitive position in it, which in turn are influenced by informants. It is the number and strengths of these informants that will underlie the probability of a country being identified and assessed as a new market by any firm.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

We develop a general theoretical framework for exploring the host plant selection behaviour of herbivorous insects. This model can be used to address a number of questions, including the evolution of specialists, generalists, preference hierarchies, and learning. We use our model to: (i) demonstrate the consequences of the extent to which the reproductive success of a foraging female is limited by the rate at which they find host plants (host limitation) or the number of eggs they carry (egg limitation); (ii) emphasize the different consequences of variation in behaviour before and after landing on (locating) a host (termed pre- and post-alighting, respectively); (iii) show that, in contrast to previous predictions, learning can be favoured in post-alighting behaviour-in particular, individuals can be selected to concentrate oviposition on an abundant low-quality host, whilst ignoring a rare higher-quality host; (iv) emphasize the importance of interactions between mechanisms in favouring specialization or learning. (C) 2002 Elsevier Science Ltd.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

The problem of selecting suppliers/partners is a crucial and important part in the process of decision making for companies that intend to perform competitively in their area of activity. The selection of supplier/partner is a time and resource-consuming task that involves data collection and a careful analysis of the factors that can positively or negatively influence the choice. Nevertheless it is a critical process that affects significantly the operational performance of each company. In this work, there were identified five broad selection criteria: Quality, Financial, Synergies, Cost, and Production System. Within these criteria, it was also included five sub-criteria. After the identification criteria, a survey was elaborated and companies were contacted in order to understand which factors have more weight in their decisions to choose the partners. Interpreted the results and processed the data, it was adopted a model of linear weighting to reflect the importance of each factor. The model has a hierarchical structure and can be applied with the Analytic Hierarchy Process (AHP) method or Value Analysis. The goal of the paper it's to supply a selection reference model that can represent an orientation/pattern for a decision making on the suppliers/partners selection process