7 resultados para Complete genome sequencing

em Brock University, Canada


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The complete genome of an Erwinia amylovora bacteriophage, vB_EamM_Ea35-70 (Ea35-70), is 271,084 bp, encodes 318 putative proteins, and contains one tRNA. Comparative analysis with other Myoviridae genomes suggests that Ea35-70 is related to the Phikzlikevirus genus within the family Myoviridae, since 26% of Ea35-70 proteins share homology to proteins in Pseudomonas phage φKZ.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The neuropeptide Th1RFamide with the sequence Phe-Met-Arg-Phe-amide was originally isolated in the clam Macrocallista nimbosa (price and Greenberg, 1977). Since its discovery, a large family ofFl\1RFamide-related peptides termed FaRPs have been found to be present in all major animal phyla with functions ranging from modulation of neuronal activity to alteration of muscular contractions. However, little is known about the genetics encoding these peptides, especially in invertebrates. As FaRP-encoding genes have yet to be investigated in the invertebrate Malacostracean subphylum, the isolation and characterization ofFaRP-encoding DNA and mRNA was pursued in this project. The immediate aims of this thesis were: (1) to amplify mRNA sequences of Procambarus clarkii using a degenerate oligonucleotide primer deduced from the common amino acid sequence ofisolated Procambarus FaRPS, (2) to determine if these amplification products encode FaRP gene sequences, and (3) to create a selective cDNA library of sequences recognized by the degenerate oligonucleotide primer. The polymerase chain reaction - rapid amplification of cDNA ends (PCR-RACE) is a procedure in which a single gene-specific primer is used in conjunction with a generalized 3' or 5' primer to amplify copies ofthe region between a single point in the transcript and the 3' or 5' end of cDNA of interest (Frohman et aI., 1988). PCRRACE reactions were optimized with respect to primers used, buffer composition, cycle number, nature ofgenetic substrate to be amplified, annealing, extension and denaturation temperatures and times, and use of reamplification procedures. Amplification products were cloned into plasmid vectors and recombinant products were isolated, as were the recombinant plaques formed in the selective cDNA library. Labeled amplification products were hybridized to recombinant bacteriophage to determine ligated amplification product presence. When sequenced, the five isolated PCR-RACE amplification products were determined not to possess FaRP-encoding sequences. The 200bp, 450bp, and 1500bp sequences showed homology to the Caenorhabditis elegans cosmid K09A11, which encodes for cytochrome P450; transfer-RNA; transposase; and tRNA-Tyr, while the 500bp and 750bp sequences showed homology with the complete genome of the Vaccinia virus. Under the employed amplification conditions the degenerate oligonucleotide primer was observed to bind to and to amplify sequences with either 9 or 10bp of 17bp identity. The selective cDNA library was obselVed to be of extremely low titre. When library titre was increased, white. plaques were isolated. Amplification analysis of eight isolated Agt11 sequences from these plaques indicated an absence of an insertion sequence. The degenerate 17 base oligonucleotide primer synthesized from the common amino acid sequence ofisolated Procambarus FaRPs was thus determined to be non-specific in its binding under the conditions required for its use, and to be insufficient for the isolation and identification ofFaRP-encoding sequences. A more specific primer oflonger sequence, lower degeneracy, and higher melting temperature (TJ is recommended for further investigation into the FaRP-encoding genes of Procambarlls clarkii.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Variations in different types of genomes have been found to be responsible for a large degree of physical diversity such as appearance and susceptibility to disease. Identification of genomic variations is difficult and can be facilitated through computational analysis of DNA sequences. Newly available technologies are able to sequence billions of DNA base pairs relatively quickly. These sequences can be used to identify variations within their specific genome but must be mapped to a reference sequence first. In order to align these sequences to a reference sequence, we require mapping algorithms that make use of approximate string matching and string indexing methods. To date, few mapping algorithms have been tailored to handle the massive amounts of output generated by newly available sequencing technologies. In otrder to handle this large amount of data, we modified the popular mapping software BWA to run in parallel using OpenMPI. Parallel BWA matches the efficiency of multithreaded BWA functions while providing efficient parallelism for BWA functions that do not currently support multithreading. Parallel BWA shows significant wall time speedup in comparison to multithreaded BWA on high-performance computing clusters, and will thus facilitate the analysis of genome sequencing data.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The relative ease to concentrate and purify adenoviruses, their well characterized mid-sized genome, and the ability to delete non-essential regions from their genome to accommodate foreign gene, made adenoviruses a suitable candidate for the construction of vectors. The use of adenoviral vectors in gene therapy, vaccination, and as a general vector system for expressing foreign genes have been documented for some time. In this study, the objective was to rescue a BAV3 E1 or E3 recombinant vector carrying the kanamycin resistant gene, a dominant selectable marker with useful applications in studying vectored gene expression in mammalian cells. To accomplish the objective of this study, more information about BAV3 DNA sequences was required in order to make the manipulation of the virus genome accessible. Therefore, sequencing of the BAV3 genome from 1 1 .7% to 30.8% was carried out. Analysis of the determined sequences revealed the primary structure of important viral gene products coded by E2 including BAV3 DNA pol and precursor to terminal protein. Comparative analysis of these proteins with their counterparts from human and non human adenoviruses revealed important insights as to the evolutionary lineage of BAV3. In order to insert the kanamycin resistance gene in either E1 or E3, it was necessary to delete BAV3 sequences to accommodate the foreign gene so as not to exceed the limit of the packaging capacity of the virus. To construct a recombinant BAV3 in which a foreign gene was inserted in the deleted E1 region, an E1 shuttle vector was constructed. This involved the deletion from the viral sequences a region between 1.3% to 9% and inserting the kanamycin resistance gene to replace the deletion. The E1 shuttle vector contained the left (0%- 53.9%) segment of the genome and was expected to generate BAV3 recombinants that can be grown and propagated in cells that can complement the missing E1 functions. To construct a similar shuttle vector for E3 deletion, DNA sequences extending from 78.9% to 82.5% (1281 bp) were deleted from within the E3 region that had been cloned into a plasmid vector. The deleted region corresponds to those that have been shown to be non-essential for viral replication in cell culture. The resulting plasmid was used to construct another recombinant plasmid with BAV3 DNA sequences extending from 37.1% to 100% and with a deletion of E3 sequences that were replaced by kanamycin resistance gene. This shuttle plasmid was used in cotransfections with digested viral DNA in an attempt to rescue a recombinant BAV3 carrying the kanamycin resistance gene to replace the deleted E3. In spite of repeated attempts of transfection, El or E3 recombinant BAV3 were not isolated. It seems that other approaches should be applied to make a final conclusion on BAV3 infectivity.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Adenoviruses are nonenveloped icosahedral shaped particles. The double stranded DNA viral genome is divided into 5 major early transcription units, designated E1 A, E1 B, and E2 to E4, which are expressed in a regulated manner soon after infection. The gene products of the early region 3 (E3), shown to be nonessential for viral replication in vitro, are believed to be involved in counteracting host immunosurveillance. In order to sequence the E3 region of Bovine adenovirus type 2 (BAV2) it was necessary to determine the restriction map for the plasmid pEA48. A physical restriction endonuclease map for BamHl, Clal, Eco RI, Hindlll, Kpnl, Pstt, Sail, and Xbal was constructed. The DNA insert in pEA48 was determined to be viral in origin using Southern hybridization. A human adenovirus type 5 recombinant plasmid, containing partial DNA fragments of the two transcription units L4 and L5 that lie just outside the E3, was used to localize this region. The recombinant plasmid pEA was subcloned to facilitate sequencing. The DNA sequences between 74.8 and 90.5 map units containing the E3, the hexon associated protein (pVIII), and the fibre gene were determined. Homology comparison revealed that the genes for the hexon associated pV11I and the fibre protein are conserved. The last 70 amino acids of the BAV2 pV11I were the most conserved, showing a similarity of 87 percent with Ad2 pV1I1. A comparison between the predicted amino acid sequences of BAV2 and Ad40, Ad41 , Ad2 and AdS, revealed that they have an identical secondary structure consisting of a tail, a shaft and a knob. The shaft is composed of 22, 15 amino acid motifs, with periodic glycines and hydrophobic residues. The E3 region was found to consist of about 2.3 Kbp and to encode four proteins that were greater than 60 amino acids. However, these four open reading frames did not show significant homology to any other known adenovirus DNA or protein sequence.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Recombinant Adenoviruses (Ads) have been shown to have potential applications in three areas: gene therapy, high level protein expression and recombinant vaccines.' At least three different locations within the Ad genome can be deleted and subsequently used for the insertion of foreign sequences. These include the Early 3 (E3), Early 1 (E1) and Early 4 (E4) regions. Viral vectors of this type have been well studied in Human Ads 2 and 5, however one has not yet been constructed for Bovine Adenovirus Type 2 (BAV2). The E3 region is located between 76.6 and 86 m.u. on the r-strand and is transcribed in a rightward direction. The gene products of the Early 3 region (E3) have been shown to be non-essential for viral replication, in vitro, but are required for host immunosurveillance. This study represents the cloning and reconstitution of a BAV2 E3 deletion mutant. A deletion of 1800bp was made within the E3 region of BAV2 and the thymidine kinase gene was subsequently inserted in the deleted area . . The plasmid pdlE3-4tk1 (23.4Kbp) was constructed and used to to facilitate homologous recombination with the wild type BAV2 to produce a mutant. Southern Blotting and Hybridization results suggest the presence of a BAV2 E3 deletion mutant with thymidine kinase sequences present. The E4 region of Human Adenovirus types 2 and 5 is located at the extreme right end of the genome (91.3 map units - 99.1 map units) and is transcribed in a leftward direction giving rise to a complicated set of differentially spliced mRNAs. Essentially there are 7 open reading frames (ORFs) encoding for at least 7 polypeptides. The gene products encoded by the E4 region have been shown to be essential for the expression of late viral genes, host cell shutoff and normal viral growth. We have cloned and sequenced the right end segment between 90.5 map units and 100 map units of the BAV2 genome. The results show several open reading frames which encode polypeptides exhibiting homology to three polypeptides encoded by the E4 region of human adenovirus type 2. These include the 14kDa protein encoded by ORF1, the 34kDa protein encoded by ORF6 and the 13kDa protein encoded by ORF3. The nucleotide sequence, restriction enzyme map, and ORF map of the E4 region could be very useful in future molecular manipulation of this region and could possibly explain the slow growth rate of BAV2 in MDBK cells.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Genome sequence varies in numerous ways among individuals although the gross architecture is fixed for all humans. Retrotransposons create one of the most abundant structural variants in the human genome and are divided in many families, with certain members in some families, e.g., L1, Alu, SVA, and HERV-K, remaining active for transposition. Along with other types of genomic variants, retrotransponson-derived variants contribute to the whole spectrum of genome variants in humans. With the advancement of sequencing techniques, many human genomes are being sequenced at the individual level, fueling the comparative research on these variants among individuals. In this thesis, the evolution and functional impact of structural variations is examined primarily focusing on retrotransposons in the context of human evolution. The thesis comprises of three different studies on the topics that are presented in three data chapters. First, the recent evolution of all human specific AluYb members, representing the second most active subfamily of Alus, was tracked to identify their source/master copy using a novel approach. All human-specific AluYb elements from the reference genome were extracted, aligned with one another to construct clusters of similar copies and each cluster was analyzed to generate the evolutionary relationship between the members of the cluster. The approach resulted in identification of one major driver copy of all human specific Yb8 and the source copy of the Yb9 lineage. Three new subfamilies within the AluYb family – Yb8a1, Yb10 and Yb11 were also identified, with Yb11 being the youngest and most polymorphic. Second, an attempt to construct a relation between transposable elements (TEs) and tandem repeats (TRs) was made at a genome-wide scale for the first time. Upon sequence comparison, positional cross-checking and other relevant analyses, it was observed that over 20% of all TRs are derived from TEs. This result established the first connection between these two types of repetitive elements, and extends our appreciation for the impact of TEs on genomes. Furthermore, only 6% of these TE-derived TRs follow the already postulated initiation and expansion mechanisms, suggesting that the others are likely to follow a yet-unidentified mechanism. Third, by taking a combination of multiple computational approaches involving all types of genetic variations published so far including transposable elements, the first whole genome sequence of the most recent common ancestor of all modern human populations that diverged into different populations around 125,000-100,000 years ago was constructed. The study shows that the current reference genome sequence is 8.89 million base pairs larger than our common ancestor’s genome, contributed by a whole spectrum of genetic mechanisms. The use of this ancestral reference genome to facilitate the analysis of personal genomes was demonstrated using an example genome and more insightful recent evolutionary analyses involving the Neanderthal genome. The three data chapters presented in this thesis conclude that the tandem repeats and transposable elements are not two entirely distinctly isolated elements as over 20% TRs are actually derived from TEs. Certain subfamilies of TEs themselves are still evolving with the generation of newer subfamilies. The evolutionary analyses of all TEs along with other genomic variants helped to construct the genome sequence of the most recent common ancestor to all modern human populations which provides a better alternative to human reference genome and can be a useful resource for the study of personal genomics, population genetics, human and primate evolution.