970 resultados para Bacterial artificial chromosome sequencing
Resumo:
Next-generation sequencing (NGS) is a valuable tool for the detection and quantification of HIV-1 variants in vivo. However, these technologies require detailed characterization and control of artificially induced errors to be applicable for accurate haplotype reconstruction. To investigate the occurrence of substitutions, insertions, and deletions at the individual steps of RT-PCR and NGS, 454 pyrosequencing was performed on amplified and non-amplified HIV-1 genomes. Artificial recombination was explored by mixing five different HIV-1 clonal strains (5-virus-mix) and applying different RT-PCR conditions followed by 454 pyrosequencing. Error rates ranged from 0.04-0.66% and were similar in amplified and non-amplified samples. Discrepancies were observed between forward and reverse reads, indicating that most errors were introduced during the pyrosequencing step. Using the 5-virus-mix, non-optimized, standard RT-PCR conditions introduced artificial recombinants in a fraction of at least 30% of the reads that subsequently led to an underestimation of true haplotype frequencies. We minimized the fraction of recombinants down to 0.9-2.6% by optimized, artifact-reducing RT-PCR conditions. This approach enabled correct haplotype reconstruction and frequency estimations consistent with reference data obtained by single genome amplification. RT-PCR conditions are crucial for correct frequency estimation and analysis of haplotypes in heterogeneous virus populations. We developed an RT-PCR procedure to generate NGS data useful for reliable haplotype reconstruction and quantification.
Resumo:
Bacterial biofilms provide cues for the settlement of marine invertebrates such as coral larvae, and are therefore important for the resilience and recovery of coral reefs. This study aimed to better understand how ocean acidification may affect the community composition and diversity of bacterial biofilms on surfaces under naturally reduced pH conditions. Settlement tiles were deployed at coral reefs in Papua New Guinea along pH gradients created by two CO2 seeps, and upper and lower tiles surfaces were sampled 5 and 13 months after deployment. Automated Ribosomal Intergenic Spacer Analysis were used to characterize more than 200 separate bacterial communities, complemented by amplicon sequencing of the bacterial 16S rRNA gene of 16 samples. The bacterial biofilm consisted predominantly of Alpha-, Gamma- and Deltaproteobacteria, as well as Cyanobacteria, Flavobacteriia and Cytophaga, whereas putative settlement-inducing taxa only accounted for a small fraction of the community. Bacterial biofilm composition was heterogeneous with approximately 25% shared operational taxonomic units between samples. Among the observed environmental parameters, pH only had a weak effect on community composition (R² ~ 1%) and did not affect community richness and evenness. In contrast, there were strong differences between upper and lower surfaces (contrasting in light exposure and grazing intensity). There also appeared to be a strong interaction between bacterial biofilm composition and the macroscopic components of the tile community. Our results suggest that on mature settlement surfaces in situ, pH does not have a strong impact on the composition of bacterial biofilms. Other abiotic and biotic factors such as light exposure and interactions with other organisms may be more important in shaping bacterial biofilms than changes in seawater pH.
Resumo:
The pufferfish Fugu rubripes has a genome ≈7.5 times smaller than that of mammals but with a similar number of genes. Although conserved synteny has been demonstrated between pufferfish and mammals across some regions of the genome, there is some controversy as to what extent Fugu will be a useful model for the human genome, e.g., [Gilley, J., Armes, N. & Fried, M. (1997) Nature (London) 385, 305–306]. We report extensive conservation of synteny between a 1.5-Mb region of human chromosome 11 and <100 kb of the Fugu genome in three overlapping cosmids. Our findings support the idea that the majority of DNA in the region of human chromosome 11p13 is intergenic. Comparative analysis of three unrelated genes with quite different roles, WT1, RCN1, and PAX6, has revealed differences in their structural evolution. Whereas the human WT1 gene can generate 16 protein isoforms via a combination of alternative splicing, RNA editing, and alternative start site usage, our data predict that Fugu WT1 is capable of generating only two isoforms. This raises the question of the extent to which the evolution of WT1 isoforms is related to the evolution of the mammalian genitourinary system. In addition, this region of the Fugu genome shows a much greater overall compaction than usual but with significant noncoding homology observed at the PAX6 locus, implying that comparative genomics has identified regulatory elements associated with this gene.
Resumo:
Several microbial systems have been shown to yield advantageous mutations in slowly growing or nongrowing cultures. In one assay system, the stationary-phase mutation mechanism differs from growth-dependent mutation, demonstrating that the two are different processes. This system assays reversion of a lac frameshift allele on an F′ plasmid in Escherichia coli. The stationary-phase mutation mechanism at lac requires recombination proteins of the RecBCD double-strand-break repair system and the inducible error-prone DNA polymerase IV, and the mutations are mostly −1 deletions in small mononucleotide repeats. This mutation mechanism is proposed to occur by DNA polymerase errors made during replication primed by recombinational double-strand-break repair. It has been suggested that this mechanism is confined to the F plasmid. However, the cells that acquire the adaptive mutations show hypermutation of unrelated chromosomal genes, suggesting that chromosomal sites also might experience recombination protein-dependent stationary-phase mutation. Here we test directly whether the stationary-phase mutations in the bacterial chromosome also occur via a recombination protein- and pol IV-dependent mechanism. We describe an assay for chromosomal mutation in cells carrying the F′ lac. We show that the chromosomal mutation is recombination protein- and pol IV-dependent and also is associated with general hypermutation. The data indicate that, at least in these male cells, recombination protein-dependent stationary-phase mutation is a mechanism of general inducible genetic change capable of affecting genes in the bacterial chromosome.
Resumo:
Regulation of chromosome inheritance is essential to ensure proper transmission of genetic information. To accomplish accurate genome segregation, cells organize their chromosomes and actively separate them prior to cytokinesis. In Bacillus subtilis the Spo0J protein is required for accurate chromosome segregation and it regulates the developmental switch from vegetative growth to sporulation. Spo0J is a DNA-binding protein that recognizes at least eight identified parS sites located near the origin of replication. As judged by fluorescence microscopy, Spo0J forms discrete foci associated with the oriC region of the chromosome throughout the cell cycle. In an attempt to determine the mechanisms utilized by Spo0J to facilitate productive chromosome segregation, we have investigated the DNA binding activity of Spo0J. In vivo we find Spo0J associates with several kilobases of DNA flanking its specific binding sites (parS) through a parS-dependent nucleation event that promotes lateral spreading of Spo0J along the chromosome. Using purified components we find that Spo0J has the ability to coat non-specific DNA substrates. These 'Spo0J domains' provide large structures near oriC that could potentially demark, organize or localize the origin region of the chromosome.
Resumo:
Background Phylogeographic reconstruction of some bacterial populations is hindered by low diversity coupled with high levels of lateral gene transfer. A comparison of recombination levels and diversity at seven housekeeping genes for eleven bacterial species, most of which are commonly cited as having high levels of lateral gene transfer shows that the relative contributions of homologous recombination versus mutation for Burkholderia pseudomallei is over two times higher than for Streptococcus pneumoniae and is thus the highest value yet reported in bacteria. Despite the potential for homologous recombination to increase diversity, B. pseudomallei exhibits a relative lack of diversity at these loci. In these situations, whole genome genotyping of orthologous shared single nucleotide polymorphism loci, discovered using next generation sequencing technologies, can provide very large data sets capable of estimating core phylogenetic relationships. We compared and searched 43 whole genome sequences of B. pseudomallei and its closest relatives for single nucleotide polymorphisms in orthologous shared regions to use in phylogenetic reconstruction. Results Bayesian phylogenetic analyses of >14,000 single nucleotide polymorphisms yielded completely resolved trees for these 43 strains with high levels of statistical support. These results enable a better understanding of a separate analysis of population differentiation among >1,700 B. pseudomallei isolates as defined by sequence data from seven housekeeping genes. We analyzed this larger data set for population structure and allele sharing that can be attributed to lateral gene transfer. Our results suggest that despite an almost panmictic population, we can detect two distinct populations of B. pseudomallei that conform to biogeographic patterns found in many plant and animal species. That is, separation along Wallace's Line, a biogeographic boundary between Southeast Asia and Australia. Conclusion We describe an Australian origin for B. pseudomallei, characterized by a single introduction event into Southeast Asia during a recent glacial period, and variable levels of lateral gene transfer within populations. These patterns provide insights into mechanisms of genetic diversification in B. pseudomallei and its closest relatives, and provide a framework for integrating the traditionally separate fields of population genetics and phylogenetics for other bacterial species with high levels of lateral gene transfer.
Massively parallel sequencing and analysis of expressed sequence tags in a successful invasive plant
Resumo:
Background Invasive species pose a significant threat to global economies, agriculture and biodiversity. Despite progress towards understanding the ecological factors associated with plant invasions, limited genomic resources have made it difficult to elucidate the evolutionary and genetic factors responsible for invasiveness. This study presents the first expressed sequence tag (EST) collection for Senecio madagascariensis, a globally invasive plant species. Methods We used pyrosequencing of one normalized and two subtractive libraries, derived from one native and one invasive population, to generate an EST collection. ESTs were assembled into contigs, annotated by BLAST comparison with the NCBI non-redundant protein database and assigned gene ontology (GO) terms from the Plant GO Slim ontologies. Key Results Assembly of the 221 746 sequence reads resulted in 12 442 contigs. Over 50 % (6183) of 12 442 contigs showed significant homology to proteins in the NCBI database, representing approx. 4800 independent transcripts. The molecular transducer GO term was significantly over-represented in the native (South African) subtractive library compared with the invasive (Australian) library. Based on NCBI BLAST hits and literature searches, 40 % of the molecular transducer genes identified in the South African subtractive library are likely to be involved in response to biotic stimuli, such as fungal, bacterial and viral pathogens. Conclusions This EST collection is the first representation of the S. madagascariensis transcriptome and provides an important resource for the discovery of candidate genes associated with plant invasiveness. The over-representation of molecular transducer genes associated with defence responses in the native subtractive library provides preliminary support for aspects of the enemy release and evolution of increased competitive ability hypotheses in this successful invasive. This study highlights the contribution of next-generation sequencing to better understanding the molecular mechanisms underlying ecological hypotheses that are important in successful plant invasions.
Resumo:
Cytogenetic and loss of heterozygosity (LOH) studies have long indicated the presence of a tumor suppressor gene (TSG) on 9p involved in the development of melanoma. Although LOH at 9p has been reported in approximately 60% of melanoma tumors, only 5-10% of these tumors have been shown to carry CDKN2A mutations, raising the possibility that another TSG involved in melanoma maps to chromosome 9p. To investigate this possibility, a panel of 37 melanomas derived from 35 individuals was analyzed for CDKN2A mutations by single-strand conformation polymorphism analysis and sequencing. The melanoma samples were then typed for 15 markers that map to 9p13-24 to investigate LOH trends in this region. In those tumors demonstrating retention of heterozygosity at markers flanking CDKN2A and LOH on one or both sides of the gene, multiplex microsatellite PCR was performed to rule out homozygous deletion of the region encompassing CDKN2A. CDKN2A mutations were found in tumors from 5 patients [5 (14%) of 35], 4 of which demonstrated LOH across the entire region examined. The remaining tumor with no observed LOH carried two point mutations, one on each allele. Although LOH was identified at one or more markers in 22 (59%) of 37 melanoma tumors corresponding to 20 (57%) of 35 individuals, only 11 tumors from 9 individuals [9 (26%) of 35] demonstrated LOH at D9S942 and D9S1748 the markers closest to CDKN2A. Of the remaining 11 tumors with LOH 9 demonstrated LOH at two or more contiguous markers either centromeric and/or telomeric to CDKN2A while retaining heterozygosity at several markers adjacent to CDKN2A. Multiplex PCR revealed one tumor carried a homozygous deletion extending from D9S1748 to the IFN-alpha locus. In the remaining eight tumors, multiplex PCR demonstrated that the observed heterozygosity was not attributable to homozygous deletion and stromal contamination at D9S1748, D9S942, or D9S974, as measured by comparative amplification strengths, which indicates that retention of heterozygosity with flanking LOH does not always indicate a homozygous deletion. This report supports the conclusions of previous studies that a least two TSGs involved in melanoma development in addition to CDKN2A may reside on chromosome 9p.
Resumo:
This item provides supplementary materials for the paper mentioned in the title, specifically a range of organisms used in the study. The full abstract for the main paper is as follows: Next Generation Sequencing (NGS) technologies have revolutionised molecular biology, allowing clinical sequencing to become a matter of routine. NGS data sets consist of short sequence reads obtained from the machine, given context and meaning through downstream assembly and annotation. For these techniques to operate successfully, the collected reads must be consistent with the assumed species or species group, and not corrupted in some way. The common bacterium Staphylococcus aureus may cause severe and life-threatening infections in humans,with some strains exhibiting antibiotic resistance. In this paper, we apply an SVM classifier to the important problem of distinguishing S. aureus sequencing projects from alternative pathogens, including closely related Staphylococci. Using a sequence k-mer representation, we achieve precision and recall above 95%, implicating features with important functional associations.
Resumo:
Next Generation Sequencing (NGS) has revolutionised molec- ular biology, allowing routine clinical sequencing. NGS data consists of short sequence reads, given context through downstream assembly and annotation, a process requiring reads consistent with the assumed species or species group. The common bacterium Staphylococcus aureus may cause severe and life-threatening infections in humans, with some strains exhibiting antibiotic resistance. Here we apply an SVM classifier to the important problem of distinguishing S. aureus sequencing projects from other pathogens, including closely related Staphylococci. Using a sequence k-mer representation, we achieve precision and recall above 95%, implicating features with important functional associations.
Resumo:
Multiple sclerosis (MS) is a common chronic inflammatory disease of the central nervous system. Susceptibility to the disease is affected by both environmental and genetic factors. Genetic factors include haplotypes in the histocompatibility complex (MHC) and over 50 non-MHC loci reported by genome-wide association studies. Amongst these, we previously reported polymorphisms in chromosome 12q13-14 with a protective effect in individuals of European descent. This locus spans 288 kb and contains 17 genes, including several candidate genes which have potentially significant pathogenic and therapeutic implications. In this study, we aimed to fine-map this locus. We have implemented a two-phase study: a variant discovery phase where we have used next-generation sequencing and two target-enrichment strategies [long-range polymerase chain reaction (PCR) and Nimblegen's solution phase hybridization capture] in pools of 25 samples; and a genotyping phase where we genotyped 712 variants in 3577 healthy controls and 3269 MS patients. This study confirmed the association (rs2069502, P = 9.9 × 10−11, OR = 0.787) and narrowed down the locus of association to an 86.5 kb region. Although the study was unable to pinpoint the key-associated variant, we have identified a 42 (genotyped and imputed) single-nucleotide polymorphism haplotype block likely to harbour the causal variant. No evidence of association at previously reported low-frequency variants in CYP27B1 was observed. As part of the study we compared variant discovery performance using two target-enrichment strategies. We concluded that our pools enriched with Nimblegen's solution phase hybridization capture had better sensitivity to detect true variants than the pools enriched with long-range PCR, whilst specificity was better in the long-range PCR-enriched pools compared with solution phase hybridization capture enriched pools; this result has important implications for the design of future fine-mapping studies.
Resumo:
The aim of this study was to investigate through direct sequencing the insulin receptor (INSR) gene in DNA samples from a migraine affected family previously showing linkage to chromosome 19p13 in an attempt to detect disease associated mutations. Migraine is a common debilitating disorder with a significant genetic component. At present, the number and type of genes involved in the common forms of migraine are not clear. The INSR gene on chromosome 19p13.3-13.2 is a gene of interest since a number of single nucleotide polymorphisms (SNPs) located within the gene have been implicated in migraine with (MA) and without aura (MO). Six DNA samples obtained from non-founding migraine affected members of migraine family 1 (MF1) were used in this study. Genomic DNA was sequenced for the INSR gene in exons 1-22 and the promoter region. In the six migraine family member samples, previously reported SNPs were detected within two exonic DNA coding regions of the INSR gene. These SNPs, in exons 13 and 17, do not alter the normal INSR polypeptide sequence. In addition, intron 7 also revealed a DNA base sequence variation. For the 5' untranslated promoter region of the gene, no mutations or polymorphisms were detected. In conclusion, this study detected no INSR mutations in affected members of a chromosome 19 linked migraine pedigree. Hence, migraine linkage to this chromosomal region may involve other candidate genes.
Resumo:
OBJECTIVES: The aims of the study were: (i) to extend our linkage analysis of chromosome 1q microsatellite markers in predominantly migraine with aura pedigrees and (ii) to test the novel FHM-2 ATP1A2 gene for involvement in these migraine affected pedigrees and a previous pedigree (MF14) showing evidence of linkage of markers to C1q31. METHODS: A chromosome 1 scan (31 markers) was performed in 21 multiplex pedigrees affected predominantly with migraine with aura (MA). The known FHM-2 ATP1A2 gene mutations were tested, by sequencing, for the involvement in MA and migraine without aura (MO) in these pedigrees. Sequencing was performed in the coding areas of the ATP1A2 gene through three MA individuals from MF14. RESULTS: Evidence for linkage was obtained at C1q23 to markers spanning the ATP1A2 gene. However, testing of the known ATP1A2 gene mutations (for FHM) in common migraine probands of pedigrees showing excess allele sharing was negative. Sequencing of the entire coding areas of the gene through all the three MA affected from MF14 was also negative for mutations. DISCUSSION: Microsatellite markers on chromosome 1q23 show evidence of excess allele sharing in MA and some MO pedigrees, suggesting linkage to the common forms of migraine and the presence of a susceptibility gene in this region. The FHM-2 (ATP1A2 gene) does not seem to be involved in the common types of migraine. Despite certain clinical characteristics, the genetic correlation between FHM and familial typical migraine remains unclear. Several candidate genes lie within the C1q23 and C1q31 cytogenetic regions; therefore, further studies are needed.