888 resultados para Genome Sequence


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Multiple-complete-digest mapping is a DNA mapping technique based on complete-restriction-digest fingerprints of a set of clones that provides highly redundant coverage of the mapping target. The maps assembled from these fingerprints order both the clones and the restriction fragments. Maps are coordinated across three enzymes in the examples presented. Starting with yeast artificial chromosome contigs from the 7q31.3 and 7p14 regions of the human genome, we have produced cosmid-based maps spanning more than one million base pairs. Each yeast artificial chromosome is first subcloned into cosmids at a redundancy of ×15–30. Complete-digest fragments are electrophoresed on agarose gels, poststained, and imaged on a fluorescent scanner. Aberrant clones that are not representative of the underlying genome are rejected in the map construction process. Almost every restriction fragment is ordered, allowing selection of minimal tiling paths with clone-to-clone overlaps of only a few thousand base pairs. These maps demonstrate the practicality of applying the experimental and software-based steps in multiple-complete-digest mapping to a target of significant size and complexity. We present evidence that the maps are sufficiently accurate to validate both the clones selected for sequencing and the sequence assemblies obtained once these clones have been sequenced by a “shotgun” method.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The construction of cDNA clones encoding large-size RNA molecules of biological interest, like coronavirus genomes, which are among the largest mature RNA molecules known to biology, has been hampered by the instability of those cDNAs in bacteria. Herein, we show that the application of two strategies, cloning of the cDNAs into a bacterial artificial chromosome and nuclear expression of RNAs that are typically produced within the cytoplasm, is useful for the engineering of large RNA molecules. A cDNA encoding an infectious coronavirus RNA genome has been cloned as a bacterial artificial chromosome. The rescued coronavirus conserved all of the genetic markers introduced throughout the sequence and showed a standard mRNA pattern and the antigenic characteristics expected for the synthetic virus. The cDNA was transcribed within the nucleus, and the RNA translocated to the cytoplasm. Interestingly, the recovered virus had essentially the same sequence as the original one, and no splicing was observed. The cDNA was derived from an attenuated isolate that replicates exclusively in the respiratory tract of swine. During the engineering of the infectious cDNA, the spike gene of the virus was replaced by the spike gene of an enteric isolate. The synthetic virus replicated abundantly in the enteric tract and was fully virulent, demonstrating that the tropism and virulence of the recovered coronavirus can be modified. This demonstration opens up the possibility of employing this infectious cDNA as a vector for vaccine development in human, porcine, canine, and feline species susceptible to group 1 coronaviruses.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The function of many of the uncharacterized open reading frames discovered by genomic sequencing can be determined at the level of expressed gene products, the proteome. However, identifying the cognate gene from minute amounts of protein has been one of the major problems in molecular biology. Using yeast as an example, we demonstrate here that mass spectrometric protein identification is a general solution to this problem given a completely sequenced genome. As a first screen, our strategy uses automated laser desorption ionization mass spectrometry of the peptide mixtures produced by in-gel tryptic digestion of a protein. Up to 90% of proteins are identified by searching sequence data bases by lists of peptide masses obtained with high accuracy. The remaining proteins are identified by partially sequencing several peptides of the unseparated mixture by nanoelectrospray tandem mass spectrometry followed by data base searching with multiple peptide sequence tags. In blind trials, the method led to unambiguous identification in all cases. In the largest individual protein identification project to date, a total of 150 gel spots—many of them at subpicomole amounts—were successfully analyzed, greatly enlarging a yeast two-dimensional gel data base. More than 32 proteins were novel and matched to previously uncharacterized open reading frames in the yeast genome. This study establishes that mass spectrometry provides the required throughput, the certainty of identification, and the general applicability to serve as the method of choice to connect genome and proteome.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The genome of the Kaposi sarcoma-associated herpesvirus (KSHV or HHV8) was mapped with cosmid and phage genomic libraries from the BC-1 cell line. Its nucleotide sequence was determined except for a 3-kb region at the right end of the genome that was refractory to cloning. The BC-1 KSHV genome consists of a 140.5-kb-long unique coding region flanked by multiple G+C-rich 801-bp terminal repeat sequences. A genomic duplication that apparently arose in the parental tumor is present in this cell culture-derived strain. At least 81 ORFs, including 66 with homology to herpesvirus saimiri ORFs, and 5 internal repeat regions are present in the long unique region. The virus encodes homologs to complement-binding proteins, three cytokines (two macrophage inflammatory proteins and interleukin 6), dihydrofolate reductase, bcl-2, interferon regulatory factors, interleukin 8 receptor, neural cell adhesion molecule-like adhesin, and a D-type cyclin, as well as viral structural and metabolic proteins. Terminal repeat analysis of virus DNA from a KS lesion suggests a monoclonal expansion of KSHV in the KS tumor.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Cells of several major algal groups are evolutionary chimeras of two radically different eukaryotic cells. Most of these “cells within cells” lost the nucleus of the former algal endosymbiont. But after hundreds of millions of years cryptomonads still retain the nucleus of their former red algal endosymbiont as a tiny relict organelle, the nucleomorph, which has three minute linear chromosomes, but their function and the nature of their ends have been unclear. We report extensive cryptomonad nucleomorph sequences (68.5 kb), from one end of each of the three chromosomes of Guillardia theta. Telomeres of the nucleomorph chromosomes differ dramatically from those of other eukaryotes, being repeats of the 23-mer sequence (AG)7AAG6A, not a typical hexamer (commonly TTAGGG). The subterminal regions comprising the rRNA cistrons and one protein-coding gene are exactly repeated at all three chromosome ends. Gene density (one per 0.8 kb) is the highest for any cellular genome. None of the 38 protein-coding genes has spliceosomal introns, in marked contrast to the chlorarachniophyte nucleomorph. Most identified nucleomorph genes are for gene expression or protein degradation; histone, tubulin, and putatively centrosomal ranbpm genes are probably important for chromosome segregation. No genes for primary or secondary metabolism have been found. Two of the three tRNA genes have introns, one in a hitherto undescribed location. Intergenic regions are exceptionally short; three genes transcribed by two different RNA polymerases overlap their neighbors. The reported sequences encode two essential chloroplast proteins, FtsZ and rubredoxin, thus explaining why cryptomonad nucleomorphs persist.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

CTXφ is a filamentous, temperate bacteriophage whose genome includes ctxAB, the genes that encode cholera toxin. In toxigenic isolates of Vibrio cholerae, tandem arrays of prophage DNA, usually interspersed with the related genetic element RS1, are integrated site-specifically within the chromosome. We have discovered that these arrays routinely yield hybrid virions, composed of DNA from two adjacent prophages or from a prophage and a downstream RS1. Coding sequences are always derived from the 5′ prophage whereas most of an intergenic sequence, intergenic region 1, is always derived from the 3′ element. The presence of tandem elements is required for production of virions: V. cholerae strains that contain a solitary prophage rarely yield CTX virions, and the few virions detected result from imprecise excision of prophage DNA. Thus, generation of the replicative form of CTXφ, pCTX, a step that precedes production of virions, does not depend on reversal of the process for site-specific integration of CTXφ DNA into the V. cholerae chromosome. Production of pCTX also does not depend on RecA-mediated homologous recombination between adjacent prophages. We hypothesize that the CTXφ-specific proteins required for replication of pCTX can also function on a chromosomal substrate, and that, unlike the processes used by other integrating phages, production of pCTX and CTXφ does not require excision of the prophage from the chromosome. Use of this replication strategy maximizes vertical transmission of prophage DNA while still enabling dissemination of CTXφ to new hosts.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

HIV type 1 (HIV-1) specifically uses host cell tRNALys-3 as a primer for reverse transcription. The 3′ 18 nucleotides of this tRNA are complementary to a region on the HIV RNA genome known as the primer binding site (PBS). HIV-1 has a strong preference for maintaining a lysine-specific PBS in vivo, and viral genomes with mutated PBS sequences quickly revert to be complementary to tRNALys-3. To investigate the mechanism for the observed PBS reversion events in vitro, we examined the capability of the nucleocapsid protein (NC) to anneal various tRNA primer sequences onto either complementary or noncomplementary PBSs. We show that NC can anneal different full-length tRNAs onto viral RNA transcripts derived from the HIV-1 MAL or HXB2 isolates, provided that the PBS is complementary to the tRNA used. In contrast, NC promotes specific annealing of only tRNALys-3 onto an RNA template (HXB2) whose PBS sequence has been mutated to be complementary to the 3′ 18 nt of human tRNAPro. Moreover, HIV-1 reverse transcriptase extends this binary complex from the proline-specific PBS. The formation of the noncomplementary binary complex does not occur when a chimeric tRNALys/Pro containing proline-specific D and anticodon domains is used as the primer. Thus, elements outside the acceptor-TΨC domains of tRNALys-3 play an important role in preferential primer use in vitro. Our results support the hypothesis that mutant PBS reversion is a result of tRNALys-3 annealing onto and extension from a PBS that specifies an alternate host cell tRNA.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Here we present the successful application of the microarray technology platform to the analysis of DNA polymorphisms. Using the rice genome as a model, we demonstrate the potential of a high-throughput genome analysis method called Diversity Array Technology, DArT‘. In the format presented here the technology is assaying for the presence (or amount) of a specific DNA fragment in a representation derived from the total genomic DNA of an organism or a population of organisms. Two different approaches are presented: the first involves contrasting two representations on a single array while the second involves contrasting a representation with a reference DNA fragment common to all elements of the array. The Diversity Panels created using this method allow genetic fingerprinting of any organism or group of organisms belonging to the gene pool from which the panel was developed. Diversity Arrays enable rapid and economical application of a highly parallel, solid-state genotyping technology to any genome or complex genomic mixtures.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/) is maintained at the European Bioinformatics Institute (EBI) in an international collaboration with the DNA Data Bank of Japan (DDBJ) and GenBank at the NCBI (USA). Data is exchanged amongst the collaborating databases on a daily basis. The major contributors to the EMBL database are individual authors and genome project groups. Webin is the preferred web-based submission system for individual submitters, whilst automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO). Database releases are produced quarterly. Network services allow free access to the most up-to-date data collection via ftp, email and World Wide Web interfaces. EBI’s Sequence Retrieval System (SRS), a network browser for databanks in molecular biology, integrates and links the main nucleotide and protein databases plus many specialized databases. For sequence similarity searching a variety of tools (e.g. Blitz, Fasta, BLAST) are available which allow external users to compare their own sequences against the latest data in the EMBL Nucleotide Sequence Database and SWISS-PROT.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The Mouse Genome Database (MGD) is the community database resource for the laboratory mouse, a key model organism for interpreting the human genome and for understanding human biology and disease (http://www.informatics.jax.org). MGD provides standard nomenclature and consensus map positions for mouse genes and genetic markers; it provides a curated set of mammalian homology records, user-defined chromosomal maps, experimental data sets and the definitive mouse ‘gene to sequence’ reference set for the research community. The integration and standardization of these data sets facilitates the transition between mouse DNA sequence, gene and phenotype annotations. A recent focus on allele and phenotype representations enhances the ability of MGD to organize and present data for exploring the relationship between genotype and phenotype. This link between the genome and the biology of the mouse is especially important as phenotype information grows from large mutagenesis projects and genotype information grows from large-scale sequencing projects.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Upon the completion of the Saccharomyces cerevisiae genomic sequence in 1996 [Goffeau,A. et al. (1997) Nature, 387, 5], several creative and ambitious projects have been initiated to explore the functions of gene products or gene expression on a genome-wide scale. To help researchers take advantage of these projects, the Saccharomyces Genome Database (SGD) has created two new tools, Function Junction and Expression Connection. Together, the tools form a central resource for querying multiple large-scale analysis projects for data about individual genes. Function Junction provides information from diverse projects that shed light on the role a gene product plays in the cell, while Expression Connection delivers information produced by the ever-increasing number of microarray projects. WWW access to SGD is available at genome-www.stanford.edu/Saccharomyces/.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

VIDA is a new virus database that organizes open reading frames (ORFs) from partial and complete genomic sequences from animal viruses. Currently VIDA includes all sequences from GenBank for Herpesviridae, Coronaviridae and Arteriviridae. The ORFs are organized into homologous protein families, which are identified on the basis of sequence similarity relationships. Conserved sequence regions of potential functional importance are identified and can be retrieved as sequence alignments. We use a controlled taxonomical and functional classification for all the proteins and protein families in the database. When available, protein structures that are related to the families have also been included. The database is available for online search and sequence information retrieval at http://www.biochem.ucl.ac.uk/bsm/virus_database/VIDA.html.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Viruses with RNA genomes often capture and redirect host cell components to assist in mechanisms particular to RNA-dependent RNA synthesis. The nidoviruses are an order of positive-stranded RNA viruses, comprising coronaviruses and arteriviruses, that employ a unique strategy of discontinuous transcription, producing a series of subgenomic mRNAs linking a 5′ leader to distal portions of the genome. For the prototype coronavirus mouse hepatitis virus (MHV), heterogeneous nuclear ribonucleoprotein (hnRNP) A1 has been shown to be able to bind in vitro to the negative strand of the intergenic sequence, a cis-acting element found in the leader RNA and preceding each downstream ORF in the genome. hnRNP A1 thus has been proposed as a host factor in MHV transcription. To test this hypothesis genetically, we initially constructed MHV mutants with a very high-affinity hnRNP A1 binding site inserted in place of, or adjacent to, an intergenic sequence in the MHV genome. This inserted hnRNP A1 binding site was not able to functionally replace, or enhance transcription from, the intergenic sequence. This finding led us to test more directly the role of hnRNP A1 by analysis of MHV replication and RNA synthesis in a murine cell line that does not express this protein. The cellular absence of hnRNP A1 had no detectable effect on the production of infectious virus, the synthesis of genomic RNA, or the quantity or quality of subgenomic mRNAs. These results strongly suggest that hnRNP A1 is not a required host factor for MHV discontinuous transcription or genome replication.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Reovirus genome segment S1 encodes protein σ1, which is the receptor binding protein, modulates tissue tropism, and specifies the nature of the antiviral immune response. It makes up less than 2% of reovirus particles and is synthesized in very small amounts in infected cells. Any antiviral strategy aimed at reducing specifically the expression of this genome segment should, in principle, reduce the infectivity of the virus. To test this hypothesis, we have assembled two hammer-head motif-containing ribozymes (Rzs) targeted to cleave at the conserved B and C domains of the reovirus s1 RNA. Protein-independent but Mg2+-dependent sequence-specific cleavage of s1 RNA was achieved by both the Rzs in trans. Cells that transiently express these Rzs, when challenged with reovirus, were protected against the cytopathic effects caused by the virus. This protection correlated with the specific intracellular reduction of s1 transcripts that was due to their cleavage by the Rzs. Rz-treated cells that were challenged with reovirus showed almost complete disappearance of protein σ1 without significantly altering the levels of the other reovirus structural proteins. Thus, Rzs, besides acting as antiviral agents, could be exploited as biological tools to delineate specific functions of target genes.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The release of vast quantities of DNA sequence data by large-scale genome and expressed sequence tag (EST) projects underlines the necessity for the development of efficient and inexpensive ways to link sequence databases with temporal and spatial expression profiles. Here we demonstrate the power of linking cDNA sequence data (including EST sequences) with transcript profiles revealed by cDNA-AFLP, a highly reproducible differential display method based on restriction enzyme digests and selective amplification under high stringency conditions. We have developed a computer program (GenEST) that predicts the sizes of virtual transcript-derived fragments (TDFs) of in silico-digested cDNA sequences retrieved from databases. The vast majority of the resulting virtual TDFs could be traced back among the thousands of TDFs displayed on cDNA-AFLP gels. Sequencing of the corresponding bands excised from cDNA-AFLP gels revealed no inconsistencies. As a consequence, cDNA sequence databases can be screened very efficiently to identify genes with relevant expression profiles. The other way round, it is possible to switch from cDNA-AFLP gels to sequences in the databases. Using the restriction enzyme recognition sites, the primer extensions and the estimated TDF size as identifiers, the DNA sequence(s) corresponding to a TDF with an interesting expression pattern can be identified. In this paper we show examples in both directions by analyzing the plant parasitic nematode Globodera rostochiensis. Various novel pathogenicity factors were identified by combining ESTs from the infective stage juveniles with expression profiles of ∼4000 genes in five developmental stages produced by cDNA-AFLP.