976 resultados para OPEN READING FRAMES
Resumo:
The function of many of the uncharacterized open reading frames discovered by genomic sequencing can be determined at the level of expressed gene products, the proteome. However, identifying the cognate gene from minute amounts of protein has been one of the major problems in molecular biology. Using yeast as an example, we demonstrate here that mass spectrometric protein identification is a general solution to this problem given a completely sequenced genome. As a first screen, our strategy uses automated laser desorption ionization mass spectrometry of the peptide mixtures produced by in-gel tryptic digestion of a protein. Up to 90% of proteins are identified by searching sequence data bases by lists of peptide masses obtained with high accuracy. The remaining proteins are identified by partially sequencing several peptides of the unseparated mixture by nanoelectrospray tandem mass spectrometry followed by data base searching with multiple peptide sequence tags. In blind trials, the method led to unambiguous identification in all cases. In the largest individual protein identification project to date, a total of 150 gel spots—many of them at subpicomole amounts—were successfully analyzed, greatly enlarging a yeast two-dimensional gel data base. More than 32 proteins were novel and matched to previously uncharacterized open reading frames in the yeast genome. This study establishes that mass spectrometry provides the required throughput, the certainty of identification, and the general applicability to serve as the method of choice to connect genome and proteome.
Resumo:
Increased histone acetylation has been correlated with increased transcription, and regions of heterochromatin are generally hypoacetylated. In investigating the cause-and-effect relationship between histone acetylation and gene activity, we have characterized two yeast histone deacetylase complexes. Histone deacetylase-A (HDA) is an ≈350-kDa complex that is highly sensitive to the deacetylase inhibitor trichostatin A. Histone deacetylase-B (HDB) is an ≈600-kDa complex that is much less sensitive to trichostatin A. The HDA1 protein (a subunit of the HDA activity) shares sequence similarity to RPD3, a factor required for optimal transcription of certain yeast genes. RPD3 is associated with the HDB activity. HDA1 also shares similarity to three new open reading frames in yeast, designated HOS1, HOS2, and HOS3. We find that both hda1 and rpd3 deletions increase acetylation levels in vivo at all sites examined in both core histones H3 and H4, with rpd3 deletions having a greater impact on histone H4 lysine positions 5 and 12. Surprisingly, both hda1 and rpd3 deletions increase repression at telomeric loci, which resemble heterochromatin with rpd3 having a greater effect. In addition, rpd3 deletions retard full induction of the PHO5 promoter fused to the reporter lacZ. These data demonstrate that histone acetylation state has a role in regulating both heterochromatic silencing and regulated gene expression.
A computationally directed screen identifying interacting coiled coils from Saccharomyces cerevisiae
Resumo:
Computational methods can frequently identify protein-interaction motifs in otherwise uncharacterized open reading frames. However, the identification of candidate ligands for these motifs (e.g., so that partnering can be determined experimentally in a directed manner) is often beyond the scope of current computational capabilities. One exception is provided by the coiled-coil interaction motif, which consists of two or more α helices that wrap around each other: the ligands for coiled-coil sequences are generally other coiled-coil sequences, thereby greatly simplifying the motif/ligand recognition problem. Here, we describe a two-step approach to identifying protein–protein interactions mediated by two-stranded coiled coils that occur in Saccharomyces cerevisiae. Coiled coils from the yeast genome are first predicted computationally, by using the multicoil program, and associations between coiled coils are then determined experimentally by using the yeast two-hybrid assay. We report 213 unique interactions between 162 putative coiled-coil sequences. We evaluate the resulting interactions, focusing on associations identified between components of the spindle pole body (the yeast centrosome).
Resumo:
Group II introns are widely believed to have been ancestors of spliceosomal introns, yet little is known about their own evolutionary history. In order to address the evolution of mobile group II introns, we have compiled 71 open reading frames (ORFs) related to group II intron reverse transcriptases and subjected their derived amino acid sequences to phylogenetic analysis. The phylogenetic tree was rooted with reverse transcriptases (RTs) of non-long terminal repeat retroelements, and the inferred phylogeny reveals two major clusters which we term the mitochondrial and chloroplast-like lineages. Bacterial ORFs are mainly positioned at the bases of the two lineages but with weak bootstrap support. The data give an overview of an apparently high degree of horizontal transfer of group II intron ORFs, mostly among related organisms but also between organelles and bacteria. The Zn domain (nuclease) and YADD motif (RT active site) were lost multiple times during evolution. Differences in domain structures suggest that the oldest ORFs were concise, while the ORF in the mitochondrial lineage subsequently expanded in three locations. The data are consistent with a bacterial origin for mobile group II introns.
Resumo:
Using the representation difference analysis technique, we have identified a novel gene, Ian4, which is preferentially expressed in hematopoietic precursor 32D cells transfected with wild-type versus mutant forms of the Bcr/Abl oncogene. Ian4 expression was undetectable in 32D cells transfected with v-src, oncogenic Ha-ras or v-Abl. Murine Ian4 maps to chromosome 6, 25 cM from the centromere. The Ian4 mRNA contains two open reading frames (ORFs) separated by 5 nt. The first ORF has the potential to encode for a polypeptide of 67 amino acids without apparent homology to known proteins. The second ORF encodes a protein of 301 amino acids with a GTP/ATP-binding site in the N-terminus and a hydrophobic domain in the extreme C-terminus. The IAN-4 protein resides in the mitochondrial outer membrane and the last 20 amino acids are necessary for this localization. The IAN-4 protein has GTP-binding activity and shares sequence homology with a novel family of putative GTP-binding proteins: the immuno-associated nucleotide (IAN) family.
Resumo:
The non-coding RNAs database (http://biobases.ibch.poznan.pl/ncRNA/) contains currently available data on RNAs, which do not have long open reading frames and act as riboregulators. Non-coding RNAs are involved in the specific recognition of cellular nucleic acid targets through complementary base pairing to control cell growth and differentiation. Some of them are connected with several well known developmental and neurobehavioral disorders. We have divided them into four groups. This paper is a short introduction to the database and presents its latest, updated edition.
Resumo:
We present here the complete genome sequence of a common avian clone of Pasteurella multocida, Pm70. The genome of Pm70 is a single circular chromosome 2,257,487 base pairs in length and contains 2,014 predicted coding regions, 6 ribosomal RNA operons, and 57 tRNAs. Genome-scale evolutionary analyses based on pairwise comparisons of 1,197 orthologous sequences between P. multocida, Haemophilus influenzae, and Escherichia coli suggest that P. multocida and H. influenzae diverged ≈270 million years ago and the γ subdivision of the proteobacteria radiated about 680 million years ago. Two previously undescribed open reading frames, accounting for ≈1% of the genome, encode large proteins with homology to the virulence-associated filamentous hemagglutinin of Bordetella pertussis. Consistent with the critical role of iron in the survival of many microbial pathogens, in silico and whole-genome microarray analyses identified more than 50 Pm70 genes with a potential role in iron acquisition and metabolism. Overall, the complete genomic sequence and preliminary functional analyses provide a foundation for future research into the mechanisms of pathogenesis and host specificity of this important multispecies pathogen.
Resumo:
The flavoprotein (R)-(+)-mandelonitrile lyase (MDL; EC 4.1.2.10), which plays a key role in cyanogenesis in rosaceous stone fruits, occurs in black cherry (Prunus serotina Ehrh.) homogenates as several closely related isoforms. Biochemical and molecular biological methods were used to investigate MDL microheterogeneity and function in this species. Three novel MDL cDNAs of high sequence identity (designated MDL2, MDL4, and MDL5) were isolated. Like MDL1 and MDL3 cDNAs (Z. Hu, J.E. Poulton [1997] Plant Physiol 115: 1359–1369), they had open reading frames that predicted a flavin adenine dinucleotide-binding site, multiple N-glycosylation sites, and an N-terminal signal sequence. The N terminus of an MDL isoform purified from seedlings matched the derived amino acid sequence of the MDL4 cDNA. Genomic sequences corresponding to the MDL1, MDL2, and MDL4 cDNAs were obtained by polymerase chain reaction amplification of genomic DNA. Like the previously reported mdl3 gene, these genes are interrupted at identical positions by three short, conserved introns. Given their overall similarity, we conclude that the genes mdl1, mdl2, mdl3, mdl4, and mdl5 are derived from a common ancestral gene and constitute members of a gene family. Genomic Southern-blot analysis showed that this family has approximately eight members. Northern-blot analysis using gene-specific probes revealed differential expression of the genes mdl1, mdl2, mdl3, mdl4, and mdl5.
Resumo:
Two different RNA editing systems have been described in the kinetoplast-mitochondrion of trypanosomatid protists. The first involves the precise insertion and deletion of U residues mostly within the coding regions of maxicircle-encoded mRNAs to produce open reading frames. This editing is mediated by short overlapping complementary guide RNAs encoded in both the maxicircle and the minicircle molecules and involves a series of enzymatic cleavage-ligation steps. The second editing system is a C34 to U34 modification in the anticodon of the imported tRNATrp, thereby permitting the decoding of the UGA stop codon as tryptophan. U-insertion editing probably originated in an ancestor of the kinetoplastid lineage and appears to have evolved in some cases by the replacement of the original pan-edited cryptogene with a partially edited cDNA. The driving force for the evolutionary fixation of these retroposition events was postulated to be the stochastic loss of entire minicircle sequence classes and their encoded guide RNAs upon segregation of the single kinetoplast DNA network into daughter cells at cell division. A large plasticity in the relative abundance of minicircle sequence classes has been observed during cell culture in the laboratory. Computer simulations provide theoretical evidence for this plasticity if a random distribution and segregation model of minicircles is assumed. The possible evolutionary relationship of the C to U and U-insertion editing systems is discussed.
Resumo:
Two distinct cDNA clones encoding for the glutamate decarboxylase (GAD) isoenzymes GAD1 and GAD2 from Arabidopsis (L.) Heynh. were characterized. The open reading frames for GAD1 and GAD2 were expressed in Escherichia coli and the recombinant proteins were purified by affinity chromatography. Analysis of the recombinant proteins by sodium dodecyl sulfate-polyacrylamide gel electrophoresis and immunoblot analysis suggest that GAD1 and GAD2 encode for 58- and 56-kD peptides, respectively. The enzymatic activities of the pure recombinant GAD1 and GAD2 proteins were stimulated 35- and 13-fold, respectively, by Ca2+/calmodulin but not by Ca2+ or calmodulin alone. Southern-blot analysis of genomic DNA suggests that there is only one copy of each gene in Arabidopsis. The GAD1 transcript and a corresponding 58-kD peptide were detected in roots only. Conversely, the GAD2 transcript and a corresponding 56-kD peptide were detected in all organs tested. The specific activity, GAD2 transcript, and 56-kD peptide increased in leaves of plants treated with 10 mm NH4Cl, 5 mm NH4NO3, 5 mm glutamic acid, or 5 mm glutamine as the sole nitrogen source compared with samples from plants treated with 10 mm KNO3. The results from these experiments suggest that in leaves GAD activity is partially controlled by gene expression or RNA stability. Results from preliminary analyses of different tissues imply that these tendencies were not the same in flower stalks and flowers, suggesting that other factors may control GAD activity in these organs. The results from this investigation demonstrate that GAD activity in leaves is altered by different nitrogen treatments, suggesting that GAD2 may play a unique role in nitrogen metabolism.
Resumo:
The human cytomegalovirus (HCMV) early glycoprotein products of the US11 and US2 open reading frames cause increased turnover of major histocompatibility complex (MHC) class I heavy chains. Since US2 is homologous to another HCMV gene (US3), we hypothesized that the US3 gene product also may affect MHC class I expression. In cells constitutively expressing the HCMV US3 gene, MHC class I heavy chains formed a stable complex with beta 2-microglobulin. However, maturation of the N-linked glycan of MHC class I heavy chains was impaired in US3+ cells. The glycoprotein product of US3 (gpUS3) occurs mostly in a high-mannose form and coimmunoprecipitates with beta 2-microglobulin associated class I heavy chains. Mature class I molecules were detected at steady state on the surface of US3+ cells, as in control cells. Substantial perinuclear accumulation of heavy chains was observed in US3+ cells. The data suggest that gpUS3 impairs egress of MHC class I heavy chains from the endoplasmic reticulum.
Resumo:
The antimycobacterial compound ethambutol [Emb; dextro-2,2'-(ethylenediimino)-di-1-butanol] is used to treat tuberculosis as well as disseminated infections caused by Mycobacterium avium. The critical target for Emb lies in the pathway for the biosynthesis of cell wall arabinogalactan, but the molecular mechanisms for drug action and resistance are unknown. The cellular target for Emb was sought using drug resistance, via target overexpression by a plasmid vector, as a selection tool. This strategy led to the cloning of the M. avium emb region which rendered the otherwise susceptible Mycobacterium smegmatis host resistant to Emb. This region contains three complete open reading frames (ORFs), embR, embA, and embB. The translationally coupled embA and embB genes are necessary and sufficient for an Emb-resistant phenotype which depends on gene copy number, and their putative novel membrane proteins are homologous to each other. The predicted protein encoded by embR, which is related to known transcriptional activators from Streptomyces, is expendable for the phenotypic expression of Emb resistance, but an intact divergent promoter region between embR and embAB is required. An Emb-sensitive cell-free assay for arabinan biosynthesis shows that overexpression of embAB is associated with high-level Emb-resistant arabinosyl transferase activity, and that embR appears to modulate the in vitro level of this activity. These data suggest that embAB encode the drug target of Emb, the arabinosyl transferase responsible for the polymerization of arabinose into the arabinan of arabinogalactan, and that overproduction of this Emb-sensitive target leads to Emb resistance.
Resumo:
The whole genome sequence (1.83 Mbp) of Haemophilus influenzae strain Rd was searched to identify tandem oligonucleotide repeat sequences. Loss or gain of one or more nucleotide repeats through a recombination-independent slippage mechanism is known to mediate phase variation of surface molecules of pathogenic bacteria, including H. influenzae. This facilitates evasion of host defenses and adaptation to the varying microenvironments of the host. We reasoned that iterative nucleotides could identify novel genes relevant to microbe-host interactions. Our search of the Rd genome sequence identified 9 novel loci with multiple (range 6-36, mean 22) tandem tetranucleotide repeats. All were found to be located within putative open reading frames and included homologues of hemoglobin-binding proteins of Neisseria, a glycosyltransferase (IgtC gene product) of Neisseria, and an adhesin of Yersinia. These tetranucleotide repeat sequences were also shown to be present in two other epidemiologically different H. influenzae type b strains, although the number and distribution of repeats was different. Further characterization of the IgtC gene showed that it was involved in phenotypic switching of a lipopolysaccharide epitope and that this variable expression was associated with changes in the number of tetranucleotide repeats. Mutation of IgtC resulted in attenuated virulence of H. influenzae in an infant rat model of invasive infection. These data indicate the rapidity, economy, and completeness with which whole genome sequences can be used to investigate the biology of pathogenic bacteria.
Resumo:
Agrobacterium tumefaciens, a bacterial plant pathogen, when transformed with plasmid constructs containing greater than unit length DNA of tomato leaf curl geminivirus accumulates viral replicative form DNAs indistinguishable from those produced in infected plants. The accumulation of the viral DNA species depends on the presence of two origins of replication in the DNA constructs and is drastically reduced by introducing mutations into the viral replication-associated protein (Rep or C1) ORF, indicating that an active viral replication process is occurring in the bacterial cell. The accumulation of these viral DNA species is not affected by mutations or deletions in the other viral open reading frames. The observation that geminivirus DNA replication functions are supported by the bacterial cellular machinery provides evidence for the theory that these circular single-stranded DNA viruses have evolved from prokaryotic episomal replicons.
Resumo:
A chromosomal locus required for copper resistance and competitive fitness was cloned from a strain of Pseudomonas fluorescens isolated from copper-contaminated agricultural soil. Sequence analysis of this locus revealed six open reading frames with homology to genes involved in cytochrome c biogenesis in other bacteria, helC, cycJ, cycK, tipB, cycL, and cycH, with the closest similarity being to the aeg-46.5(yej) region of the Escherichia coli chromosome. The proposed functions of these genes in other bacteria include the binding, transport, and coupling of heme to apocytochrome c in the periplasm of these Gram-negative bacteria. Putative heme-binding motifs were present in the predicted products of cycK and cycL, and TipB contained a putative disulfide oxidoreductase active site proposed to maintain the heme-binding site of the apocytochrome in a reduced state for ligation of heme. Tn3-gus mutagenesis showed that expression of the genes was constitutive but enhanced by copper, and confirmed that the genes function both in copper resistance and production of active cytochrome c. However, two mutants in cycH were copper-sensitive and oxidase-positive, suggesting that the functions of these genes, rather than cytochrome c oxidase itself, were required for resistance to copper.