976 resultados para Open Reading Frames (orfs)
Resumo:
A 9.9 kb DNA fragment from the right arm of chromosome VII of Saccharomyces cerevisiae has been sequenced and analysed. The sequence contains four open reading frames (ORFs) longer than 100 amino acids. One gene, PFK1, has already been cloned and sequenced and the other one is the probable yeast gene coding for the beta-subunit of the succinyl-CoA synthetase. The two remaining ORFs share homology with the deduced amino acid sequence (and their physical arrangement is similar to that) of the YHR161c and YHR162w ORFs from chromosome VIII.
Resumo:
If open reading frames (ORFs) have been transmitted primarily by vertical descent, the distributional profile of orthologues of each ORF should be congruent with the organismal tree or a subtree thereof. Distributional patterns not reconciled parsimoniously with tree-like descent and loss are prima facie evidence of lateral gene transfer. Herein, a rigorous criterion for recognizing ORF distributions is described and implemented; it does not require the inference of phylogenetic trees, nor does it assume any specific tree. Because lineage-specific differences in rates of sequence change can also generate unexpected distributional patterns, rate artefacts, were controlled for by requiring pairwise matches between ORFs to exceed a rigorous inclusion threshold, but absence of a match was assessed against a more-permissive exclusion threshold. Applying this dual-threshold criterion to cross-domain and cross-phylum distributional patterns for ORFs in 23 bacterial genomes, a relative abundance of ORFs was observed that find a match in exactly seven other bacterial phyla; 94-99% of these ORFs also find matches among the Archaea and/or Eukarya. In the larger (and some smaller) bacterial genomes, ORFs that find matches in exactly one other bacterial phylum are also relatively abundant, but fewer of these have non-bacterial homologues; most of their matches within the Bacteria are to the Proteobacteria and/or Firmicutes, which cannot be sister lineages to all bacteria. ORFs that are neither distributed universally among the Bacteria, nor necessarily shared with topologically adjacent lineages, are preferentially enriched in large bacterial genomes.
Resumo:
Six open reading frames (ORFs) located on chromosome VII of Saccharomyces cerevisiae (YGR205w, YGR210c, YGR211w, YGR241c, YGR243w and YGR244c) were disrupted in two different genetic backgrounds using short-flanking homology (SFH) gene replacement. Sporulation and tetrad analysis showed that YGR211w, recently identified as the yeast ZPR1 gene, is an essential gene. The other five genes are non-essential, and no phenotypes could be associated to their inactivation. Two of these genes have recently been further characterized: YGR241c (YAP1802) encodes a yeast adaptor protein and YGR244c (LSC2) encodes the b-subunit of the succinyl-CoA ligase. For each ORF, a replacement cassette with long flanking regions homologous to the target locus was cloned in pUG7, and the cognate wild-type gene was cloned in pRS416.
Resumo:
VIDA is a new virus database that organizes open reading frames (ORFs) from partial and complete genomic sequences from animal viruses. Currently VIDA includes all sequences from GenBank for Herpesviridae, Coronaviridae and Arteriviridae. The ORFs are organized into homologous protein families, which are identified on the basis of sequence similarity relationships. Conserved sequence regions of potential functional importance are identified and can be retrieved as sequence alignments. We use a controlled taxonomical and functional classification for all the proteins and protein families in the database. When available, protein structures that are related to the families have also been included. The database is available for online search and sequence information retrieval at http://www.biochem.ucl.ac.uk/bsm/virus_database/VIDA.html.
Resumo:
We report the nucleotide sequence of a 17,893 bp DNA segment from the right arm of Saccharomyces cerevisiae chromosome VII. This fragment begins at 482 kb from the centromere. The sequence includes the BRF1 gene, encoding TFIIIB70, the 5' portion of the GCN5 gene, an open reading frame (ORF) previously identified as ORF MGA1, whose translation product shows similarity to heat-shock transcription factors and five new ORFs. Among these, YGR250 encodes a polypeptide that harbours a domain present in several polyA binding proteins. YGR245 is similar to a putative Schizosaccharomyces pombe gene, YGR248 shows significant similarity with three ORFs of S. cerevisiae situated on different chromosomes, while the remaining two ORFs, YGR247 and YGR251, do not show significant similarity to sequences present in databases.
Resumo:
We report the sequence of a 9000 bp fragment from the right arm of Saccharomyces cerevisiae chromosome VII. Analysis of the sequence revealed four complete previously unknown open reading frames, which were named G7587, G7589, G7591 and G7594 following standard rules for provisional nomenclature. Outstanding features of some of these proteins were the homology of the putative protein coded by G7589 with proteins involved in transcription regulation and the transmembrane domains predicted in the putative protein coded by G7591.
Resumo:
Mammalian gene expression displays widespread circadian oscillations. Rhythmic transcription underlies the core clock mechanism, but it cannot explain numerous observations made at the level of protein rhythmicity. We have used ribosome profiling in mouse liver to measure the translation of mRNAs into protein around the clock and at high temporal and nucleotide resolution. We discovered, transcriptome-wide, extensive rhythms in ribosome occupancy and identified a core set of approximately 150 mRNAs subject to particularly robust daily changes in translation efficiency. Cycling proteins produced from nonoscillating transcripts revealed thus-far-unknown rhythmic regulation associated with specific pathways (notably in iron metabolism, through the rhythmic translation of transcripts containing iron responsive elements), and indicated feedback to the rhythmic transcriptome through novel rhythmic transcription factors. Moreover, estimates of relative levels of core clock protein biosynthesis that we deduced from the data explained known features of the circadian clock better than did mRNA expression alone. Finally, we identified uORF translation as a novel regulatory mechanism within the clock circuitry. Consistent with the occurrence of translated uORFs in several core clock transcripts, loss-of-function of Denr, a known regulator of reinitiation after uORF usage and of ribosome recycling, led to circadian period shortening in cells. In summary, our data offer a framework for understanding the dynamics of translational regulation, circadian gene expression, and metabolic control in a solid mammalian organ.
Resumo:
Background: Approximately 40% of mammalian mRNA sequences contain AUG trinucleotides upstream of the main coding sequence, with a quarter of these AUGs demarcating open reading frames of 20 or more codons. In order to investigate whether these open reading frames may encode functional peptides, we have carried out a comparative genomic analysis of human and mouse mRNA 'untranslated regions' using sequences from the RefSeq mRNA sequence database. Results: We have identified over 200 upstream open reading frames which are strongly conserved between the human and mouse genomes. Consensus sequences associated with efficient initiation of translation are overrepresented at the AUG trinucleotides of these upstream open reading frames, while comparative analysis of their DNA and putative peptide sequences shows evidence of purifying selection. Conclusion: The occurrence of a large number of conserved upstream open reading frames, in association with features consistent with protein translation, strongly suggests evolutionary maintenance of the coding sequence and indicates probable functional expression of the peptides encoded within these upstream open reading frames.
Resumo:
A 17.6 kb DNA fragment from the right arm of chromosome VII of Saccharomyces cerevisiae has been sequenced and analysed. The sequence contains twelve open reading frames (ORFs) longer than 100 amino acids. Three genes had already been cloned and sequenced: CCT, ADE3 and TR-I. Two ORFs are similar to other yeast genes: G7722 with the YAL023 (PMT2) and PMT1 genes, encoding two integral membrane proteins, and G7727 with the first half of the genes encoding elongation factors 1gamma, TEF3 and TEF4. Two other ORFs, G7742 and G7744, are most probably yeast orthologues of the human and Paracoccus denitrificans electron-transferring flavoproteins (beta chain) and of the Escherichia coli phosphoserine phosphohydrolase. The five remaining identified ORFs do not show detectable homology with other protein sequences deposited in data banks. The sequence has been deposited in the EMBL data library under Accession Number Z49133.
Resumo:
Minor lymphocyte stimulating (Mls) antigens specifically stimulate T cell responses that are restricted to particular T cell receptor (TCR) beta chain variable domains. The Mls phenotype is genetically controlled by an open reading frame (orf) located in the 3' long terminal repeat of mouse mammary tumor virus (MMTV); however, the mechanism of action of the orf gene product is unknown. Whereas predicted orf amino acid sequences show strong overall homology, the 20-30 COOH-terminal residues are strikingly polymorphic. This polymorphic region correlates with TCR V beta specificity. We have generated monoclonal antibodies to a synthetic peptide encompassing the 19 COOH-terminal amino acid residues of Mtv-7 orf, which encodes the Mls-1a determinant. We show here that these antibodies block Mls responses in vitro and can interfere specifically with thymic clonal deletion of Mls-1a reactive V beta 6+ T cells in neonatal mice. Furthermore, the antibodies can inhibit V beta 6+ T cell responses in vivo to an infectious MMTV that shares orf sequence homology and TCR specificity with Mtv-7. These results confirm the predicted extracellular localization of the orf COOH terminus and imply that the orf proteins of both endogenous and exogenous MMTV interact directly with TCR V beta.
Resumo:
Group II introns are widely believed to have been ancestors of spliceosomal introns, yet little is known about their own evolutionary history. In order to address the evolution of mobile group II introns, we have compiled 71 open reading frames (ORFs) related to group II intron reverse transcriptases and subjected their derived amino acid sequences to phylogenetic analysis. The phylogenetic tree was rooted with reverse transcriptases (RTs) of non-long terminal repeat retroelements, and the inferred phylogeny reveals two major clusters which we term the mitochondrial and chloroplast-like lineages. Bacterial ORFs are mainly positioned at the bases of the two lineages but with weak bootstrap support. The data give an overview of an apparently high degree of horizontal transfer of group II intron ORFs, mostly among related organisms but also between organelles and bacteria. The Zn domain (nuclease) and YADD motif (RT active site) were lost multiple times during evolution. Differences in domain structures suggest that the oldest ORFs were concise, while the ORF in the mitochondrial lineage subsequently expanded in three locations. The data are consistent with a bacterial origin for mobile group II introns.
Resumo:
Harmless bacteria inhabiting inner plant tissues are termed endophytes. Population fluctuations in the endophytic bacterium Pantoea agglomerans associated with two species of field cultured citrus plants were monitored over a two-year period. The results demonstrated that populations of P. agglomerans fluctuated in Citrus reticulata but not C. sinensis. A cryptic plasmid pPA3.0 (2.9 kb) was identified in 35 out of 44 endophytic isolates of P. agglomerans and was subsequently sequenced. The origins of replication were identified and nine out of 18 open reading frames (ORFs) revealed homology with described proteins. Notably, two ORFs were related to cellular transport systems and plasmid maintenance. Plasmid pPA3.0 was cloned and the gfp gene inserted to generate the pPAGFP vector. The vector was introduced into P. agglomerans isolates and revealed stability was dependent on the isolate genotype, ninety-percent stability values were reached after 60 hours of bacterial cultivation in most evaluated isolates. In order to definitively establish P. agglomerans as an endophyte, the non-transformed bacterium was reintroduced into in vitro cultivated seedlings and the density of inner tissue colonization in inoculated plants was estimated by bacterium re-isolation, while the tissue niches preferred by the bacterium were investigated by scanning electronic microscopy (SEM). Cells from P. agglomerans (strain ARB18) at similar densities were re-isolated from roots, stems and leaves and colonization of parenchyma and xylem tissues were observed. Data suggested that P. agglomerans is a ubiquitous citrus endophyte harboring cryptic plasmids. These characteristics suggest the potential to use the bacterium as a vehicle to introduce new genes in host plants via endophytic bacterial transformation.
Resumo:
The complete nucleotide sequence of the genomic RNA from the insect picorna-like virus Drosophila C virus (DCV) was determined. The DCV sequence predicts a genome organization different to that of other RNA virus families whose sequences are known. The single-stranded positive-sense genomic RNA is 9264 nucleotides in length and contains two large open reading frames (ORFs) which are separated by 191 nucleotides. The 5' ORF contains regions of similarities with the RNA-dependent RNA polymerase, helicase and protease domains of viruses from the picornavirus, comovirus and sequivirus families. The 3' ORF encodes the capsid proteins as confirmed by N-terminal sequence analysis of these proteins. The capsid protein coding region is unusual in two ways: firstly the cistron appears to lack an initiating methionine and secondly no subgenomic RNA is produced, suggesting that the proteins may be translated through internal initiation of translation from the genomic length RNA. The finding of this novel genome organization for DCV shows that this virus is not a member of the Picornaviridae as previously thought, but belongs to a distinct and hitherto unrecognized virus family.
Resumo:
We showed in 1988 that there are two strains of Chlamydia psittaci which infect the koala (Phascolarctos cinereus). In order to further investigate the role of these chlamydial strains in pathogenesis, we have attempted to identify genes of koala type I strain chlamydial which are involved in the immunogenic response, Transformation of Escherichia coli with a plasmid containing a 6.3-kb fragment (pKOC-10) of C. psittaci DNA caused the appearance of a specific chlamydial lipopolysaccharide (LPS) epitope on the host strain. The smallest DNA fragment capable of inducing the expression of chlamydial LPS was an Xbal fragment, 2.4 kb in size (pKOC-5). DNA sequence analysis of the complete fragment revealed regions of high identity, at the amino acid level, to the gseA genes of C. pneomoniae, C. psittaci 6BC and C. trachomatis, and the kdtA gene of E. coli which code for transferases catalysing the addition of 3-deoxy-D-manno-octulosonic acid (Kdo) residues to lipid A. Two open reading frames (ORFs) of 1,314 and 501 nucleotides in size, within the 2.4-kb fragment, were evident, and mRNA species corresponding to these ORFs were detected by Northern analysis. Both ORF1 and ORF2 are required for the appearance of chlamydia-specific LPS on the surface of recombinant E. coli.
Resumo:
Surrogate methods for detecting lateral gene transfer are those that do not require inference of phylogenetic trees. Herein I apply four such methods to identify open reading frames (ORFs) in the genome of Escherichia coli K12 that may have arisen by lateral gene transfer. Only two of these methods detect the same ORFs more frequently than expected by chance, whereas several intersections contain many fewer ORFs than expected. Each of the four methods detects a different non-random set of ORFs. The methods may detect lateral ORFs of different relative ages; testing this hypothesis will require rigorous inference of trees. (C) 2001 Federation of European Microbiological Societies. Published by Elsevier Science BN. All rights reserved.