4 resultados para Genome sequence analysis
em Brock University, Canada
Resumo:
The complete genome of an Erwinia amylovora bacteriophage, vB_EamM_Ea35-70 (Ea35-70), is 271,084 bp, encodes 318 putative proteins, and contains one tRNA. Comparative analysis with other Myoviridae genomes suggests that Ea35-70 is related to the Phikzlikevirus genus within the family Myoviridae, since 26% of Ea35-70 proteins share homology to proteins in Pseudomonas phage φKZ.
Resumo:
Genome sequence varies in numerous ways among individuals although the gross architecture is fixed for all humans. Retrotransposons create one of the most abundant structural variants in the human genome and are divided in many families, with certain members in some families, e.g., L1, Alu, SVA, and HERV-K, remaining active for transposition. Along with other types of genomic variants, retrotransponson-derived variants contribute to the whole spectrum of genome variants in humans. With the advancement of sequencing techniques, many human genomes are being sequenced at the individual level, fueling the comparative research on these variants among individuals. In this thesis, the evolution and functional impact of structural variations is examined primarily focusing on retrotransposons in the context of human evolution. The thesis comprises of three different studies on the topics that are presented in three data chapters. First, the recent evolution of all human specific AluYb members, representing the second most active subfamily of Alus, was tracked to identify their source/master copy using a novel approach. All human-specific AluYb elements from the reference genome were extracted, aligned with one another to construct clusters of similar copies and each cluster was analyzed to generate the evolutionary relationship between the members of the cluster. The approach resulted in identification of one major driver copy of all human specific Yb8 and the source copy of the Yb9 lineage. Three new subfamilies within the AluYb family – Yb8a1, Yb10 and Yb11 were also identified, with Yb11 being the youngest and most polymorphic. Second, an attempt to construct a relation between transposable elements (TEs) and tandem repeats (TRs) was made at a genome-wide scale for the first time. Upon sequence comparison, positional cross-checking and other relevant analyses, it was observed that over 20% of all TRs are derived from TEs. This result established the first connection between these two types of repetitive elements, and extends our appreciation for the impact of TEs on genomes. Furthermore, only 6% of these TE-derived TRs follow the already postulated initiation and expansion mechanisms, suggesting that the others are likely to follow a yet-unidentified mechanism. Third, by taking a combination of multiple computational approaches involving all types of genetic variations published so far including transposable elements, the first whole genome sequence of the most recent common ancestor of all modern human populations that diverged into different populations around 125,000-100,000 years ago was constructed. The study shows that the current reference genome sequence is 8.89 million base pairs larger than our common ancestor’s genome, contributed by a whole spectrum of genetic mechanisms. The use of this ancestral reference genome to facilitate the analysis of personal genomes was demonstrated using an example genome and more insightful recent evolutionary analyses involving the Neanderthal genome. The three data chapters presented in this thesis conclude that the tandem repeats and transposable elements are not two entirely distinctly isolated elements as over 20% TRs are actually derived from TEs. Certain subfamilies of TEs themselves are still evolving with the generation of newer subfamilies. The evolutionary analyses of all TEs along with other genomic variants helped to construct the genome sequence of the most recent common ancestor to all modern human populations which provides a better alternative to human reference genome and can be a useful resource for the study of personal genomics, population genetics, human and primate evolution.
Resumo:
The cloned dihydrofolate reductase gene of Saccharomyces cerevisiae (DFR 1) is expressed in Escherichia coli. Bacterial strain JF1754 transformed with plasmids containing DFR 1 is at least 5X more resistant to inhibition by the folate antagonist trimethoprim. Expression of yeast DFR 1 in E. coli suggests it is likely that the gene lacks intervening sequences. The 1.8 kbp DNA fragment encoding yeast dhfr activity probably has its own promotor, as the gene is expressed in both orientations in E. coli. Expression of the yeast dhfr gene cloned into M13 viral vectors allowed positive selection of DFR 1 - M13 bacterial transfectants in medium supplemented with trimethoprim. A series of nested deletions generated by nuclease Bal 31 digestion and by restriction endonuclease cleavage of plasmids containing DFR 1 physically mapped the gene to a 930 bp region between the Pst 1 and Sal 1 cut sites. This is consistent with the 21,000 molecular weight attributed to yeast dhfr in previous reports. From preliminary DNA sequence analysis of the dhfr DNA fragment the 3' terminus of DFR 1 was assigned to a position 27 nucleotides from the Eco Rl cut site on the Bam Hi - Eco Rl DNA segment. Several putative yeast transcription termination consensus sequences were identified 3' to the opal stop codon. DFR 1 is expressed in yeast and it confers resistance to the antifolate methotrexate when the gene is present in 2 - 10 copies per cell. Plasmid-dependent resistance to methotrexate is also observed in a rad 6 background although the effect is somewhat less than that conferred to wild-type or rad 18 cells. Integration of DFR 1 into the yeast genome showed an intermediate sensitivity to folate antagonists. This may suggest a gene dosage effect. No change in petite induction in these yeast strains was observed in transformed cells containing yeast dhfr plasmids. The sensitivity of rad 6 , rad 18 and wild-type cell populations to trimethoprim were unaffected by the presence of DFR 1 in transformants. Moreover, trimethoprim did not induce petites in any strain tested, which normally results if dhfr is inhibited by other antifolates such as methotrexate. This may suggest that the dhfr enzyme is not the only possible target of trimethoprim in yeast. rad 6 mutants showed a very low level of spontaneous petite formation. Methotrexate failed to induce respiratory deficient mutants in this strain which suggested that rad 6 might be an obligate grande. However, ethidium bromide induced petites to a level approximately 50% of that exhibited by wild-type and rad 18 strains.
Resumo:
Surface proteinaceous fibrils, termed fimbriae, were first identified on gram negative bacteria in the 1940s. Fungal fimbriae, discovered some 25 years later, are found on members of all fungal classes. In the present study, polyclonal antiserum raised against the fimbrial proteins of U. vio/acea were used in order to identify antigenically related proteins from Coprinus cinereus and Schizophy//um commune. Two polypeptides with molecular masses of 37 and 39 kDa from C. cinereus were observed and confirm earlier results. A single previously unidentified 50 kDa polypeptide in S. commune crossreacted with the antiserum. The 50 kDa protein was found to consist of 3 isoforms with isoelectric points ranging from 5.6 to 5.8. A fimbrial cDNA derived from U. vio/acea was used to identify DNA restriction fragments from C. cinereus and S. commune showing homology to the fimbrial transcript of U. vio/acea. Heterologous hybridization with this cDNA was used in order to screen a C. cinereus genomic DNA library. A single clone, A2-3A, with a 14 kbp insert showed strong homology to the pfim3-1 cDNA. The region of homology, a 700 bp Xba I fragment, was subcloned into pUG19. This plasmid was refered to as pXX8. DNA sequence determinations of pXX8 and adjacent fragments from A2-3A suggested that the cloned DNA was a portion of the rONA repeat encoding the small subunit rRNA. DNA sequence analysis of pfim3-1 yielded an incomplete open reading frame. The predicted amino acid sequence codes for a 206 amino acid, 22 kDa polypeptide which contains a domain similar to a transmembrane domain from rat leukocyte antigen, GDS3. As well, an untranslated 576 nucleotide domain showed 81 % homology to pXX8 and 830/0 homology to the 188 rRNA sequence of Ustilago maydis. This sequence was found adjacent to a region of adenine-thymine base pairs presumed to represent the polyadenylation sequence of the fimbrial transcript. The size and extent of homology is sufficient to account for the hybridization of pfim3-1 to rDNA. It is suggested that this domain represents a completely novel regulatory domain within eukaryotes that may enable the observed rapid regeneration of fimbriae in U. violacea.