17 resultados para Bayesian free-knot regression splines
em CentAUR: Central Archive University of Reading - UK
A hierarchical Bayesian model for predicting the functional consequences of amino-acid polymorphisms
Resumo:
Genetic polymorphisms in deoxyribonucleic acid coding regions may have a phenotypic effect on the carrier, e.g. by influencing susceptibility to disease. Detection of deleterious mutations via association studies is hampered by the large number of candidate sites; therefore methods are needed to narrow down the search to the most promising sites. For this, a possible approach is to use structural and sequence-based information of the encoded protein to predict whether a mutation at a particular site is likely to disrupt the functionality of the protein itself. We propose a hierarchical Bayesian multivariate adaptive regression spline (BMARS) model for supervised learning in this context and assess its predictive performance by using data from mutagenesis experiments on lac repressor and lysozyme proteins. In these experiments, about 12 amino-acid substitutions were performed at each native amino-acid position and the effect on protein functionality was assessed. The training data thus consist of repeated observations at each position, which the hierarchical framework is needed to account for. The model is trained on the lac repressor data and tested on the lysozyme mutations and vice versa. In particular, we show that the hierarchical BMARS model, by allowing for the clustered nature of the data, yields lower out-of-sample misclassification rates compared with both a BMARS and a frequen-tist MARS model, a support vector machine classifier and an optimally pruned classification tree.
Resumo:
This study compares the infant mortality profiles of 128 infants from two urban and two rural cemetery sites in medieval England. The aim of this paper is to assess the impact of urbanization and industrialization in terms of endogenous or exogenous causes of death. In order to undertake this analysis, two different methods of estimating gestational age from long bone lengths were used: a traditional regression method and a Bayesian method. The regression method tended to produce more marked peaks at 38 weeks, while the Bayesian method produced a broader range of ages and were more comparable with the expected "natural" mortality profiles. At all the sites, neonatal mortality (28-40 weeks) outweighed post-neonatal mortality (41-48 weeks) with rural Raunds Furnells in Northamptonshire, showing the highest number of neonatal deaths and post-medieval Spitalfields, London, showing a greater proportion of deaths due to exogenous or environmental factors. Of the four sites under study, Wharram Percy in Yorkshire showed the most convincing "natural" infant mortality profile, suggesting the inclusion of all births (i.e., stillbirths and unbaptised infants).
Resumo:
Air-dried and 3 mm pore size sieved soil was amended with neem crude formulations (leaves and cake) @ 3% w/w and a refined product, aza @ 0.05 and 0.1 w/w. Three days after treatment, 500 eggs of M. javanica held in 2 ml water were added in each dish. In another experiment, soil was amended with neem crude formulations @ 10. 5, 2.5 and 1% w/w and refined formulation aza @ 0.025, 0.05, 0.1 and 0.5% w/w. Three days after amendment 1000 plus minus 21 freshly hatched J2 held in 3 ml water were added to the amended soil. Untreated soil was kept as control. Comparison of treatments means showed that all the neem formulations caused significant reduction of hatching. Neem crude formulations were more effective in reducing hatching as compared to commercial product aza. Among the crude formulations, neem leaves were most effective in reducing hatching. In other experiment all the doses of neem crude and refined formulations differed significantly with control in reducing the mobility of juveniles. It was observed that by increasing the dose of the formulations the mobility was reduced accordingly.
Resumo:
Root-knot nematode [RKN] (Meloidogyne incognita) can increase the severity of Verticillium (V dahliae) and Fusarium (F oxysporum f.sp. vasinfectum) wilt diseases in cotton (Gossypium hirsutum). This study was conducted to determine some of the physiological responses caused by nematode invasion that might decrease resistance to vascular wilt diseases. The effect of RKN was investigated on spore germination and protein, carbohydrate and peroxidase content in the xylem fluids extracted from nematode-infected plants. Two cotton cultivars were used with different levels of resistance to both of the wilt pathogens. Spore germination was greater in the xylem fluids from nematode-infected plants than from nematode-free plants. The effect on spore germination was greater in the Fusarium-resistant cultivar (51%). Analysis of these fluids showed a decrease in total protein and carbohydrate levels for both wilt-resistant cultivars, and an increase in peroxidase concentration. Fluids from nematode-free plants of the Verticillium-resistant cultivar contained 46% more peroxidase than the Fusarium-resistant cultivar. The results provide further evidence that the effect of RKN on vascular wilt resistance is systemic and not only local. Changes in metabolites in the xylem pass from the root to the stem, accelerating disease development.
Resumo:
In this paper, Bayesian decision procedures are developed for dose-escalation studies based on bivariate observations of undesirable events and signs of therapeutic benefit. The methods generalize earlier approaches taking into account only the undesirable outcomes. Logistic regression models are used to model the two responses, which are both assumed to take a binary form. A prior distribution for the unknown model parameters is suggested and an optional safety constraint can be included. Gain functions to be maximized are formulated in terms of accurate estimation of the limits of a therapeutic window or optimal treatment of the next cohort of subjects, although the approach could be applied to achieve any of a wide variety of objectives. The designs introduced are illustrated through simulation and retrospective implementation to a completed dose-escalation study. Copyright © 2006 John Wiley & Sons, Ltd.
Resumo:
Recently, various approaches have been suggested for dose escalation studies based on observations of both undesirable events and evidence of therapeutic benefit. This article concerns a Bayesian approach to dose escalation that requires the user to make numerous design decisions relating to the number of doses to make available, the choice of the prior distribution, the imposition of safety constraints and stopping rules, and the criteria by which the design is to be optimized. Results are presented of a substantial simulation study conducted to investigate the influence of some of these factors on the safety and the accuracy of the procedure with a view toward providing general guidance for investigators conducting such studies. The Bayesian procedures evaluated use logistic regression to model the two responses, which are both assumed to be binary. The simulation study is based on features of a recently completed study of a compound with potential benefit to patients suffering from inflammatory diseases of the lung.
Resumo:
This study presents a new simple approach for combining empirical with raw (i.e., not bias corrected) coupled model ensemble forecasts in order to make more skillful interval forecasts of ENSO. A Bayesian normal model has been used to combine empirical and raw coupled model December SST Niño-3.4 index forecasts started at the end of the preceding July (5-month lead time). The empirical forecasts were obtained by linear regression between December and the preceding July Niño-3.4 index values over the period 1950–2001. Coupled model ensemble forecasts for the period 1987–99 were provided by ECMWF, as part of the Development of a European Multimodel Ensemble System for Seasonal to Interannual Prediction (DEMETER) project. Empirical and raw coupled model ensemble forecasts alone have similar mean absolute error forecast skill score, compared to climatological forecasts, of around 50% over the period 1987–99. The combined forecast gives an increased skill score of 74% and provides a well-calibrated and reliable estimate of forecast uncertainty.
Resumo:
In this paper, Bayesian decision procedures are developed for dose-escalation studies based on binary measures of undesirable events and continuous measures of therapeutic benefit. The methods generalize earlier approaches where undesirable events and therapeutic benefit are both binary. A logistic regression model is used to model the binary responses, while a linear regression model is used to model the continuous responses. Prior distributions for the unknown model parameters are suggested. A gain function is discussed and an optional safety constraint is included. Copyright (C) 2006 John Wiley & Sons, Ltd.
Resumo:
Root-knot nematodes (Meloidogyne spp.) are the most significant plant-parasitic nematodes that damage many crops all over the world. The free-living second stage juvenile (J2) is the infective stage that enters plants. The J2s move in the soil water films to reach the root zone. The bacterium Pasteuria penetrans is an obligate parasite of root-knot nematodes, is cosmopolitan, frequently encountered in many climates and environmental conditions and is considered promising for the control of Meloidogyne spp. The infection potential of P. penetrans to nematodes is well studied but not the attachment effects on the movement of root-knot nematode juveniles, image analysis techniques were used to characterize movement of individual juveniles with or without P. penetrans spores attached to their cuticles. Methods include the study of nematode locomotion based on (a) the centroid body point, (b) shape analysis and (c) image stack analysis. All methods proved that individual J2s without P. penetrans spores attached have a sinusoidal forward movement compared with those encumbered with spores. From these separate analytical studies of encumbered and unencumbered nematodes, it was possible to demonstrate how the presence of P. penetrans spores on a nematode body disrupted the normal movement of the nematode.
Resumo:
We have developed a new Bayesian approach to retrieve oceanic rain rate from the Tropical Rainfall Measuring Mission (TRMM) Microwave Imager (TMI), with an emphasis on typhoon cases in the West Pacific. Retrieved rain rates are validated with measurements of rain gauges located on Japanese islands. To demonstrate improvement, retrievals are also compared with those from the TRMM/Precipitation Radar (PR), the Goddard Profiling Algorithm (GPROF), and a multi-channel linear regression statistical method (MLRS). We have found that qualitatively, all methods retrieved similar horizontal distributions in terms of locations of eyes and rain bands of typhoons. Quantitatively, our new Bayesian retrievals have the best linearity and the smallest root mean square (RMS) error against rain gauge data for 16 typhoon overpasses in 2004. The correlation coefficient and RMS of our retrievals are 0.95 and ~2 mm hr-1, respectively. In particular, at heavy rain rates, our Bayesian retrievals outperform those retrieved from GPROF and MLRS. Overall, the new Bayesian approach accurately retrieves surface rain rate for typhoon cases. Accurate rain rate estimates from this method can be assimilated in models to improve forecast and prevent potential damages in Taiwan during typhoon seasons.
Resumo:
Statistical methods of inference typically require the likelihood function to be computable in a reasonable amount of time. The class of “likelihood-free” methods termed Approximate Bayesian Computation (ABC) is able to eliminate this requirement, replacing the evaluation of the likelihood with simulation from it. Likelihood-free methods have gained in efficiency and popularity in the past few years, following their integration with Markov Chain Monte Carlo (MCMC) and Sequential Monte Carlo (SMC) in order to better explore the parameter space. They have been applied primarily to estimating the parameters of a given model, but can also be used to compare models. Here we present novel likelihood-free approaches to model comparison, based upon the independent estimation of the evidence of each model under study. Key advantages of these approaches over previous techniques are that they allow the exploitation of MCMC or SMC algorithms for exploring the parameter space, and that they do not require a sampler able to mix between models. We validate the proposed methods using a simple exponential family problem before providing a realistic problem from human population genetics: the comparison of different demographic models based upon genetic data from the Y chromosome.
Resumo:
We present a model of market participation in which the presence of non-negligible fixed costs leads to random censoring of the traditional double-hurdle model. Fixed costs arise when household resources must be devoted a priori to the decision to participate in the market. These costs, usually of time, are manifested in non-negligible minimum-efficient supplies and supply correspondence that requires modification of the traditional Tobit regression. The costs also complicate econometric estimation of household behavior. These complications are overcome by application of the Gibbs sampler. The algorithm thus derived provides robust estimates of the fixed-costs, double-hurdle model. The model and procedures are demonstrated in an application to milk market participation in the Ethiopian highlands.
Resumo:
Major Depressive Disorder (MDD) has been associated with biased processing and abnormal regulation of negative and positive information, which may result from compromised coordinated activity of prefrontal and subcortical brain regions involved in evaluating emotional information. We tested whether patients with MDD show distributed changes in functional connectivity with a set of independently derived brain networks that have shown high correspondence with different task demands, including stimulus salience and emotional processing. We further explored if connectivity during emotional word processing related to the tendency to engage in positive or negative emotional states. In this study, 25 medication-free MDD patients without current or past comorbidity and matched controls (n=25) performed an emotional word-evaluation task during functional MRI. Using a dual regression approach, individual spatial connectivity maps representing each subject’s connectivity with each standard network were used to evaluate between-group differences and effects of positive and negative emotionality (extraversion and neuroticism, respectively, as measured with the NEO-FFI). Results showed decreased functional connectivity of the medial prefrontal cortex, ventrolateral prefrontal cortex, and ventral striatum with the fronto-opercular salience network in MDD patients compared to controls. In patients, abnormal connectivity was related to extraversion, but not neuroticism. These results confirm the hypothesis of a relative (para)limbic-cortical decoupling that may explain dysregulated affect in MDD. As connectivity of these regions with the salience network was related to extraversion, but not to general depression severity or negative emotionality, dysfunction of this network may be responsible for the failure to sustain engagement in rewarding behavior.
Resumo:
Approximate Bayesian computation (ABC) methods make use of comparisons between simulated and observed summary statistics to overcome the problem of computationally intractable likelihood functions. As the practical implementation of ABC requires computations based on vectors of summary statistics, rather than full data sets, a central question is how to derive low-dimensional summary statistics from the observed data with minimal loss of information. In this article we provide a comprehensive review and comparison of the performance of the principal methods of dimension reduction proposed in the ABC literature. The methods are split into three nonmutually exclusive classes consisting of best subset selection methods, projection techniques and regularization. In addition, we introduce two new methods of dimension reduction. The first is a best subset selection method based on Akaike and Bayesian information criteria, and the second uses ridge regression as a regularization procedure. We illustrate the performance of these dimension reduction techniques through the analysis of three challenging models and data sets.