2 resultados para Spike rush
em Glasgow Theses Service
Resumo:
Understanding how virus strains offer protection against closely related emerging strains is vital for creating effective vaccines. For many viruses, including Foot-and-Mouth Disease Virus (FMDV) and the Influenza virus where multiple serotypes often co-circulate, in vitro testing of large numbers of vaccines can be infeasible. Therefore the development of an in silico predictor of cross-protection between strains is important to help optimise vaccine choice. Vaccines will offer cross-protection against closely related strains, but not against those that are antigenically distinct. To be able to predict cross-protection we must understand the antigenic variability within a virus serotype, distinct lineages of a virus, and identify the antigenic residues and evolutionary changes that cause the variability. In this thesis we present a family of sparse hierarchical Bayesian models for detecting relevant antigenic sites in virus evolution (SABRE), as well as an extended version of the method, the extended SABRE (eSABRE) method, which better takes into account the data collection process. The SABRE methods are a family of sparse Bayesian hierarchical models that use spike and slab priors to identify sites in the viral protein which are important for the neutralisation of the virus. In this thesis we demonstrate how the SABRE methods can be used to identify antigenic residues within different serotypes and show how the SABRE method outperforms established methods, mixed-effects models based on forward variable selection or l1 regularisation, on both synthetic and viral datasets. In addition we also test a number of different versions of the SABRE method, compare conjugate and semi-conjugate prior specifications and an alternative to the spike and slab prior; the binary mask model. We also propose novel proposal mechanisms for the Markov chain Monte Carlo (MCMC) simulations, which improve mixing and convergence over that of the established component-wise Gibbs sampler. The SABRE method is then applied to datasets from FMDV and the Influenza virus in order to identify a number of known antigenic residue and to provide hypotheses of other potentially antigenic residues. We also demonstrate how the SABRE methods can be used to create accurate predictions of the important evolutionary changes of the FMDV serotypes. In this thesis we provide an extended version of the SABRE method, the eSABRE method, based on a latent variable model. The eSABRE method takes further into account the structure of the datasets for FMDV and the Influenza virus through the latent variable model and gives an improvement in the modelling of the error. We show how the eSABRE method outperforms the SABRE methods in simulation studies and propose a new information criterion for selecting the random effects factors that should be included in the eSABRE method; block integrated Widely Applicable Information Criterion (biWAIC). We demonstrate how biWAIC performs equally to two other methods for selecting the random effects factors and combine it with the eSABRE method to apply it to two large Influenza datasets. Inference in these large datasets is computationally infeasible with the SABRE methods, but as a result of the improved structure of the likelihood, we are able to show how the eSABRE method offers a computational improvement, leading it to be used on these datasets. The results of the eSABRE method show that we can use the method in a fully automatic manner to identify a large number of antigenic residues on a variety of the antigenic sites of two Influenza serotypes, as well as making predictions of a number of nearby sites that may also be antigenic and are worthy of further experiment investigation.
Resumo:
The gammacoronavirus, Infectious Bronchitis Virus (IBV), is a respiratory pathogen of chickens. IBV is a constant threat to poultry production as established vaccines are often ineffective against emerging strains. This requires constant and rapid vaccine production by a process of viral attenuation by egg passage, but the essential forces leading to attenuation in the virus have not yet been characterised. Knowledge of these factors will lead to the development of more effective, rationally attenuated, live vaccines and reduction of the mortality and morbidity caused by this pathogen. M41 CK strain was egg passaged four times many years ago at Houghton Poultry Research Station and stored as M41-CK EP4 (stock virus at The Pirbright Institute since 1992). It was the first egg passage to have its genome pyrosequenced and was therefore used as the baseline reference. The overall aim of this project was to analyse deep sequence data obtained from four IBV isolates (called A, A1, C and D) each originating from the common M41-CK EP4 (ep4) and independently passaged multiple times in embryonated chicken eggs (figure 1.1). Highly polymorphic encoding regions of the IBV genome were then identified which are likely involved in the attenuation process through the formation of independent SNPs and/or SNP clusters. This was then used to direct targeted investigation of SNPs during the attenuation process of the four IBV passages. A previously generated deep sequence dataset was used as a preliminary map of attenuation for one virulent strain of IBV. This investigation showed the nucleocapsid and spike as two highly polymorphic encoding regions within the IBV genome with the highest proportion of SNPs compared to encoding region size. This analysis then led to more focussed studies of the nucleocapsid and spike encoding region with the ultimate aim of mapping key attenuating regions and nucleotide positions. The 454 pyrosequencing data and further investigation of nucleocapsid and spike encoding regions have identified the SNPs present at the same nucleotide positions within analysed A, A1, C and D isolates. These SNPs probably play a crucial role in viral attenuation and universal vaccine production but it is not clear if independent SNPs are also involved in loss of virulence. The majority of SNPs accumulated at different nucleotide positions without further continuation in Sanger sequenced egg passages presenting S2 subunit (spike) and nucleocapsid as polymorphic encoding regions which in nature remain highly conserved.