850 resultados para Reading frames
Resumo:
If open reading frames (ORFs) have been transmitted primarily by vertical descent, the distributional profile of orthologues of each ORF should be congruent with the organismal tree or a subtree thereof. Distributional patterns not reconciled parsimoniously with tree-like descent and loss are prima facie evidence of lateral gene transfer. Herein, a rigorous criterion for recognizing ORF distributions is described and implemented; it does not require the inference of phylogenetic trees, nor does it assume any specific tree. Because lineage-specific differences in rates of sequence change can also generate unexpected distributional patterns, rate artefacts, were controlled for by requiring pairwise matches between ORFs to exceed a rigorous inclusion threshold, but absence of a match was assessed against a more-permissive exclusion threshold. Applying this dual-threshold criterion to cross-domain and cross-phylum distributional patterns for ORFs in 23 bacterial genomes, a relative abundance of ORFs was observed that find a match in exactly seven other bacterial phyla; 94-99% of these ORFs also find matches among the Archaea and/or Eukarya. In the larger (and some smaller) bacterial genomes, ORFs that find matches in exactly one other bacterial phylum are also relatively abundant, but fewer of these have non-bacterial homologues; most of their matches within the Bacteria are to the Proteobacteria and/or Firmicutes, which cannot be sister lineages to all bacteria. ORFs that are neither distributed universally among the Bacteria, nor necessarily shared with topologically adjacent lineages, are preferentially enriched in large bacterial genomes.
Resumo:
Six open reading frames (ORFs) located on chromosome VII of Saccharomyces cerevisiae (YGR205w, YGR210c, YGR211w, YGR241c, YGR243w and YGR244c) were disrupted in two different genetic backgrounds using short-flanking homology (SFH) gene replacement. Sporulation and tetrad analysis showed that YGR211w, recently identified as the yeast ZPR1 gene, is an essential gene. The other five genes are non-essential, and no phenotypes could be associated to their inactivation. Two of these genes have recently been further characterized: YGR241c (YAP1802) encodes a yeast adaptor protein and YGR244c (LSC2) encodes the b-subunit of the succinyl-CoA ligase. For each ORF, a replacement cassette with long flanking regions homologous to the target locus was cloned in pUG7, and the cognate wild-type gene was cloned in pRS416.
Resumo:
We report the nucleotide sequence of a 17,893 bp DNA segment from the right arm of Saccharomyces cerevisiae chromosome VII. This fragment begins at 482 kb from the centromere. The sequence includes the BRF1 gene, encoding TFIIIB70, the 5' portion of the GCN5 gene, an open reading frame (ORF) previously identified as ORF MGA1, whose translation product shows similarity to heat-shock transcription factors and five new ORFs. Among these, YGR250 encodes a polypeptide that harbours a domain present in several polyA binding proteins. YGR245 is similar to a putative Schizosaccharomyces pombe gene, YGR248 shows significant similarity with three ORFs of S. cerevisiae situated on different chromosomes, while the remaining two ORFs, YGR247 and YGR251, do not show significant similarity to sequences present in databases.
Resumo:
A 9.9 kb DNA fragment from the right arm of chromosome VII of Saccharomyces cerevisiae has been sequenced and analysed. The sequence contains four open reading frames (ORFs) longer than 100 amino acids. One gene, PFK1, has already been cloned and sequenced and the other one is the probable yeast gene coding for the beta-subunit of the succinyl-CoA synthetase. The two remaining ORFs share homology with the deduced amino acid sequence (and their physical arrangement is similar to that) of the YHR161c and YHR162w ORFs from chromosome VIII.
Resumo:
We report the sequence of a 9000 bp fragment from the right arm of Saccharomyces cerevisiae chromosome VII. Analysis of the sequence revealed four complete previously unknown open reading frames, which were named G7587, G7589, G7591 and G7594 following standard rules for provisional nomenclature. Outstanding features of some of these proteins were the homology of the putative protein coded by G7589 with proteins involved in transcription regulation and the transmembrane domains predicted in the putative protein coded by G7591.
Resumo:
Mammalian gene expression displays widespread circadian oscillations. Rhythmic transcription underlies the core clock mechanism, but it cannot explain numerous observations made at the level of protein rhythmicity. We have used ribosome profiling in mouse liver to measure the translation of mRNAs into protein around the clock and at high temporal and nucleotide resolution. We discovered, transcriptome-wide, extensive rhythms in ribosome occupancy and identified a core set of approximately 150 mRNAs subject to particularly robust daily changes in translation efficiency. Cycling proteins produced from nonoscillating transcripts revealed thus-far-unknown rhythmic regulation associated with specific pathways (notably in iron metabolism, through the rhythmic translation of transcripts containing iron responsive elements), and indicated feedback to the rhythmic transcriptome through novel rhythmic transcription factors. Moreover, estimates of relative levels of core clock protein biosynthesis that we deduced from the data explained known features of the circadian clock better than did mRNA expression alone. Finally, we identified uORF translation as a novel regulatory mechanism within the clock circuitry. Consistent with the occurrence of translated uORFs in several core clock transcripts, loss-of-function of Denr, a known regulator of reinitiation after uORF usage and of ribosome recycling, led to circadian period shortening in cells. In summary, our data offer a framework for understanding the dynamics of translational regulation, circadian gene expression, and metabolic control in a solid mammalian organ.
Resumo:
VIDA is a new virus database that organizes open reading frames (ORFs) from partial and complete genomic sequences from animal viruses. Currently VIDA includes all sequences from GenBank for Herpesviridae, Coronaviridae and Arteriviridae. The ORFs are organized into homologous protein families, which are identified on the basis of sequence similarity relationships. Conserved sequence regions of potential functional importance are identified and can be retrieved as sequence alignments. We use a controlled taxonomical and functional classification for all the proteins and protein families in the database. When available, protein structures that are related to the families have also been included. The database is available for online search and sequence information retrieval at http://www.biochem.ucl.ac.uk/bsm/virus_database/VIDA.html.
Resumo:
In late 1994 and early 1995, Ebola (EBO) virus dramatically reemerged in Africa, causing human disease in the Ivory Coast and Zaire. Analysis of the entire glycoprotein genes of these viruses and those of other EBO virus subtypes has shown that the virion glycoprotein (130 kDa) is encoded in two reading frames, which are linked by transcriptional editing. This editing results in the addition of an extra nontemplated adenosine within a run of seven adenosines near the middle of the coding region. The primary gene product is a smaller (50-70 kDa), nonstructural, secreted glycoprotein, which is produced in large amounts and has an unknown function. Phylogenetic analysis indicates that EBO virus subtypes are genetically diverse and that the recent Ivory Coast isolate represents a new (fourth) subtype of EBO virus. In contrast, the EBO virus isolate from the 1995 outbreak in Kikwit, Zaire, is virtually identical to the virus that caused a similar epidemic in Yambuku, Zaire, almost 20 years earlier. This genetic stability may indicate that EBO viruses have coevolved with their natural reservoirs and do not change appreciably in the wild.
Resumo:
Background: Approximately 40% of mammalian mRNA sequences contain AUG trinucleotides upstream of the main coding sequence, with a quarter of these AUGs demarcating open reading frames of 20 or more codons. In order to investigate whether these open reading frames may encode functional peptides, we have carried out a comparative genomic analysis of human and mouse mRNA 'untranslated regions' using sequences from the RefSeq mRNA sequence database. Results: We have identified over 200 upstream open reading frames which are strongly conserved between the human and mouse genomes. Consensus sequences associated with efficient initiation of translation are overrepresented at the AUG trinucleotides of these upstream open reading frames, while comparative analysis of their DNA and putative peptide sequences shows evidence of purifying selection. Conclusion: The occurrence of a large number of conserved upstream open reading frames, in association with features consistent with protein translation, strongly suggests evolutionary maintenance of the coding sequence and indicates probable functional expression of the peptides encoded within these upstream open reading frames.
Resumo:
Minor lymphocyte stimulating (Mls) antigens specifically stimulate T cell responses that are restricted to particular T cell receptor (TCR) beta chain variable domains. The Mls phenotype is genetically controlled by an open reading frame (orf) located in the 3' long terminal repeat of mouse mammary tumor virus (MMTV); however, the mechanism of action of the orf gene product is unknown. Whereas predicted orf amino acid sequences show strong overall homology, the 20-30 COOH-terminal residues are strikingly polymorphic. This polymorphic region correlates with TCR V beta specificity. We have generated monoclonal antibodies to a synthetic peptide encompassing the 19 COOH-terminal amino acid residues of Mtv-7 orf, which encodes the Mls-1a determinant. We show here that these antibodies block Mls responses in vitro and can interfere specifically with thymic clonal deletion of Mls-1a reactive V beta 6+ T cells in neonatal mice. Furthermore, the antibodies can inhibit V beta 6+ T cell responses in vivo to an infectious MMTV that shares orf sequence homology and TCR specificity with Mtv-7. These results confirm the predicted extracellular localization of the orf COOH terminus and imply that the orf proteins of both endogenous and exogenous MMTV interact directly with TCR V beta.
Resumo:
Background: Translational errors can result in bypassing of the main viral protein reading frames and the production of alternate reading frame (ARF) or cryptic peptides. Within HIV, there are many such ARFs in both sense and the antisense directions of transcription. These ARFs have the potential to generate immunogenic peptides called cryptic epitopes (CE). Both antiretroviral drug therapy and the immune system exert a mutational pressure on HIV-1. Immune pressure exerted by ARF CD8(+) T cells on the virus has already been observed in vitro. HAART has also been described to select HIV-1 variants for drug escape mutations. Since the mutational pressure exerted on one location of the HIV-1 genome can potentially affect the 3 reading frames, we hypothesized that ARF responses would be affected by this drug pressure in vivo. Methodology/Principal findings: In this study we identified new ARFs derived from sense and antisense transcription of HIV-1. Many of these ARFs are detectable in circulating viral proteins. They are predominantly found in the HIV-1 env nucleotide region. We measured T cell responses to 199 HIV-1 CE encoded within 13 sense and 34 antisense HIV-1 ARFs. We were able to observe that these ARF responses are more frequent and of greater magnitude in chronically infected individuals compared to acutely infected patients, and in patients on HAART, the breadth of ARF responses increased. Conclusions/Significance: These results have implications for vaccine design and unveil the existence of potential new epitopes that could be included as vaccine targets.
Resumo:
Harmless bacteria inhabiting inner plant tissues are termed endophytes. Population fluctuations in the endophytic bacterium Pantoea agglomerans associated with two species of field cultured citrus plants were monitored over a two-year period. The results demonstrated that populations of P. agglomerans fluctuated in Citrus reticulata but not C. sinensis. A cryptic plasmid pPA3.0 (2.9 kb) was identified in 35 out of 44 endophytic isolates of P. agglomerans and was subsequently sequenced. The origins of replication were identified and nine out of 18 open reading frames (ORFs) revealed homology with described proteins. Notably, two ORFs were related to cellular transport systems and plasmid maintenance. Plasmid pPA3.0 was cloned and the gfp gene inserted to generate the pPAGFP vector. The vector was introduced into P. agglomerans isolates and revealed stability was dependent on the isolate genotype, ninety-percent stability values were reached after 60 hours of bacterial cultivation in most evaluated isolates. In order to definitively establish P. agglomerans as an endophyte, the non-transformed bacterium was reintroduced into in vitro cultivated seedlings and the density of inner tissue colonization in inoculated plants was estimated by bacterium re-isolation, while the tissue niches preferred by the bacterium were investigated by scanning electronic microscopy (SEM). Cells from P. agglomerans (strain ARB18) at similar densities were re-isolated from roots, stems and leaves and colonization of parenchyma and xylem tissues were observed. Data suggested that P. agglomerans is a ubiquitous citrus endophyte harboring cryptic plasmids. These characteristics suggest the potential to use the bacterium as a vehicle to introduce new genes in host plants via endophytic bacterial transformation.
Resumo:
A thrombin-like enzyme, named BjussuSP-I, isolated from Bothrops jararacussu snake venom, is an acidic single-chain glycoprotein with M-r = 61,000, pI similar to 3.8 and 6% sugar. BjussuSP-I shows high proteolytic activity upon synthetic substrates, such as S-2238 and S-2288. It also shows procoagulant and kallikrein-like activity, but is unable to act on platelets and plasmin. These activities are inhibited by specific inhibitors of this class of enzymes. The complete cDNA sequence of BjussuSP-I with 696 bp encodes open reading frames of 232 amino acid residues, which conserve the common domains of thrombin-like serine proteases. BjussuSP-I shows a high structural homology with other thrombin-like enzymes from snake venoms where common amino acid residues are identified as those corresponding to the catalytic site and subsites S1, S2 and S3 already reported. In this study, we also demonstrated the importance of N-linked glycans, to improve thrombin-like activity of BjussuSP-I toxin. (c) 2007 Elsevier Masson SAS. All rights reserved.
Resumo:
The dnaA region of Wolbachia, an intracellular bacterial parasite of insects, is unique. A glnA cognate was found upstream of the dnaA gene, while neither of the two open reading frames detected downstream of dnaA has any homologue in the database. This unusual gene arrangement may reflect requirements associated with the unique ecological niche this agent occupies.
Resumo:
The complete nucleotide sequence of the genomic RNA from the insect picorna-like virus Drosophila C virus (DCV) was determined. The DCV sequence predicts a genome organization different to that of other RNA virus families whose sequences are known. The single-stranded positive-sense genomic RNA is 9264 nucleotides in length and contains two large open reading frames (ORFs) which are separated by 191 nucleotides. The 5' ORF contains regions of similarities with the RNA-dependent RNA polymerase, helicase and protease domains of viruses from the picornavirus, comovirus and sequivirus families. The 3' ORF encodes the capsid proteins as confirmed by N-terminal sequence analysis of these proteins. The capsid protein coding region is unusual in two ways: firstly the cistron appears to lack an initiating methionine and secondly no subgenomic RNA is produced, suggesting that the proteins may be translated through internal initiation of translation from the genomic length RNA. The finding of this novel genome organization for DCV shows that this virus is not a member of the Picornaviridae as previously thought, but belongs to a distinct and hitherto unrecognized virus family.