76 resultados para PROTEIN-CODING GENES
em National Center for Biotechnology Information - NCBI
Resumo:
The fate of redundant genes resulting from genome duplication is poorly understood. Previous studies indicated that ribosomal RNA genes from one parental origin are epigenetically silenced during interspecific hybridization or polyploidization. Regulatory mechanisms for protein-coding genes in polyploid genomes are unknown, partly because of difficulty in studying expression patterns of homologous genes. Here we apply amplified fragment length polymorphism (AFLP)–cDNA display to perform a genome-wide screen for orthologous genes silenced in Arabidopsis suecica, an allotetraploid derived from Arabidopsis thaliana and Cardaminopsis arenosa. We identified ten genes that are silenced from either A. thaliana or C. arenosa origin in A. suecica and located in four of the five A. thaliana chromosomes. These genes represent a variety of RNA and predicted proteins including four transcription factors such as TCP3. The silenced genes in the vicinity of TCP3 are hypermethylated and reactivated by blocking DNA methylation, suggesting epigenetic regulation is involved in the expression of orthologous genes in polyploid genomes. Compared with classic genetic mutations, epigenetic regulation may be advantageous for selection and adaptation of polyploid species during evolution and development.
Resumo:
Few promoters are active at high levels in all cells. Of these, the majority encode structural RNAs transcribed by RNA polymerases I or III and are not accessible for the expression of proteins. An exception are the small nuclear RNAs (snRNAs) transcribed by RNA polymerase II. Although snRNA biosynthesis is unique and thought not to be compatible with synthesis of functional mRNA, we have tested these promoters for their ability to express functional mRNAs. We have used the murine U1a and U1b snRNA gene promoters to express the Escherichia coli lacZ gene and the human alpha-globin gene from either episomal or integrated templates by transfection, or infection into a variety of mammalian cell types. Equivalent expression of beta-galactosidase was obtained from < 250 nucleotides of 5'-flanking sequence containing the complete promoter of either U1 snRNA gene or from the 750-nt cytomegalovirus promoter and enhancer regions. The mRNA was accurately initiated at the U1 start site, efficiently spliced and polyadenylylated, and localized to polyribosomes. Recombinant adenovirus containing the U1b-lacZ chimeric gene transduced and expressed beta-galactosidase efficiently in human 293 cells and airway epithelial cells in culture. Viral vectors containing U1 snRNA promoters may be an attractive alternative to vectors containing viral promoters for persistent high-level expression of therapeutic genes or proteins.
Resumo:
The nucleotide sequences of four genes encoding Trimeresurus gramineus (green habu snake, crotalinae) venom gland phospholipase A2 (PLA2; phosphatidylcholine 2-acylhydrolase, EC 3.1.1.4) isozymes were compared internally and externally with those of six genes encoding Trimeresurus flavoviridis (habu snake, crotalinae) venom gland PLA2 isozymes. The numbers of nucleotide substitutions per site (KN) for the noncoding regions including introns were one-third to one-eighth of the numbers of nucleotide substitutions per synonymous site (KS) for the protein-coding regions of exons, indicating that the noncoding regions are much more conserved than the protein-coding regions. The KN values for the introns were found to be nearly equivalent to those of introns of T. gramineus and T. flavoviridis TATA box-binding protein genes, which are assumed to be a general (nonvenomous) gene. Thus, it is evident that the introns of venom gland PLA2 isozyme genes have evolved at a similar rate to those of nonvenomous genes. The numbers of nucleotide substitutions per nonsynonymous site (KA) were close to or larger than the KS values for the protein-coding regions in venom gland PLA2 isozyme genes. All of the data combined reveal that Darwinian-type accelerated evolution has universally occurred only in the protein-coding regions of crotalinae snake venom PLA2 isozyme genes.
Resumo:
Cells of several major algal groups are evolutionary chimeras of two radically different eukaryotic cells. Most of these “cells within cells” lost the nucleus of the former algal endosymbiont. But after hundreds of millions of years cryptomonads still retain the nucleus of their former red algal endosymbiont as a tiny relict organelle, the nucleomorph, which has three minute linear chromosomes, but their function and the nature of their ends have been unclear. We report extensive cryptomonad nucleomorph sequences (68.5 kb), from one end of each of the three chromosomes of Guillardia theta. Telomeres of the nucleomorph chromosomes differ dramatically from those of other eukaryotes, being repeats of the 23-mer sequence (AG)7AAG6A, not a typical hexamer (commonly TTAGGG). The subterminal regions comprising the rRNA cistrons and one protein-coding gene are exactly repeated at all three chromosome ends. Gene density (one per 0.8 kb) is the highest for any cellular genome. None of the 38 protein-coding genes has spliceosomal introns, in marked contrast to the chlorarachniophyte nucleomorph. Most identified nucleomorph genes are for gene expression or protein degradation; histone, tubulin, and putatively centrosomal ranbpm genes are probably important for chromosome segregation. No genes for primary or secondary metabolism have been found. Two of the three tRNA genes have introns, one in a hitherto undescribed location. Intergenic regions are exceptionally short; three genes transcribed by two different RNA polymerases overlap their neighbors. The reported sequences encode two essential chloroplast proteins, FtsZ and rubredoxin, thus explaining why cryptomonad nucleomorphs persist.
Resumo:
The phenomenon of RNA editing has been found to occur in chloroplasts of several angiosperm plants. Comparative analysis of the entire nucleotide sequence of a gymnosperm [Pinus thunbergii (black pine)] chloroplast genome allowed us to predict several potential editing sites in its transcripts. Forty-nine such sites from 14 genes/ORFs were analyzed by sequencing both cDNAs from the transcripts and the corresponding chloroplast DNA regions, and 26 RNA editing sites were identified in the transcripts from 12 genes/ORFs, indicating that chloroplast RNA editing is not restricted to angiosperms but occurs in the gymnosperm, too. All the RNA editing events are C-to-U conversions; however, many new codon substitutions and creation of stop codons that have not so far been reported in angiosperm chloroplasts were observed. The most striking is that two editing events result in the creation of an initiation and a stop codon within a single transcript, leading to the formation of a new reading frame of 33 codons. The predicted product is highly homologous to that deduced from the ycf7 gene (ORF31), which is conserved in the chloroplast genomes of many other plant species.
Resumo:
Intracellular transport is essential for morphogenesis and functioning of the cell. The kinesin superfamily proteins (KIFs) have been shown to transport membranous organelles and protein complexes in a microtubule- and ATP-dependent manner. More than 30 KIFs have been reported in mice. However, the nomenclature of KIFs has not been clearly established, resulting in various designations and redundant names for a single KIF. Here, we report the identification and classification of all KIFs in mouse and human genome transcripts. Previously unidentified murine KIFs were found by a PCR-based search. The identification of all KIFs was confirmed by a database search of the total human genome. As a result, there are a total of 45 KIFs. The nomenclature of all KIFs is presented. To understand the function of KIFs in intracellular transport in a single tissue, we focused on the brain. The expression of 38 KIFs was detected in brain tissue by Northern blotting or PCR using cDNA. The brain, mainly composed of highly differentiated and polarized cells such as neurons and glia, requires a highly complex intracellular transport system as indicated by the increased number of KIFs for their sophisticated functions. It is becoming increasingly clear that the cell uses a number of KIFs and tightly controls the direction, destination, and velocity of transportation of various important functional molecules, including mRNA. This report will set the foundation of KIF and intracellular transport research.
Resumo:
Gene number can be considered a pragmatic measure of biological complexity, but reliable data is scarce. Estimates for vertebrates are 50-100,000 genes per haploid genome, whereas invertebrate estimates fall below 25,000. We wished to test the hypothesis that the origin of vertebrates coincided with extensive gene creation. A prediction is that gene number will differ sharply between invertebrate and vertebrate members of the chordate phylum. A gene number estimation method requiring limited sequence sampling of genomic DNA was developed and validated by using data for Caenorhabditis elegans. Using the method, we estimated that the invertebrate chordate Ciona intestinalis has 15,500 protein-coding genes (±3,700). This number is significantly lower than gene numbers of vertebrate chordates, but similar to those of invertebrates in distantly related phyla. The data indicate that evolution of vertebrates was accompanied by a dramatic increase in protein-coding capacity of the genome.
Resumo:
Pre-mRNA splicing is among the last known nuclear events before export of mature mRNA to the cytoplasm. At present, it is not known whether splicing and mRNA export are biochemically coupled processes. In this study, we have injected pre-mRNAs containing a single intron or the same mRNAs lacking an intron (Δi-mRNAs) into Xenopus oocyte nuclei. We find that the spliced mRNAs are exported much more rapidly and efficiently than the identical Δi-mRNAs. Moreover, competition studies using excess Δi-mRNA indicate that different factor(s) are involved in the inefficient export of Δi-mRNA vs. the efficient export of spliced mRNA. Consistent with this conclusion, spliced mRNA and Δi-mRNA, though identical in sequence, are assembled into different messenger ribonucleoprotein particles (mRNP) in vitro. Strikingly, the mRNA in the spliced mRNP, but not in the Δi-mRNP, is exported rapidly and efficiently. We conclude that splicing generates a specific nucleoprotein complex that targets mRNA for export. Our results, revealing a link between splicing and efficient mRNA export, may explain the reports that an intron is required for efficient expression of many protein-coding genes in metazoans.
Resumo:
The determination of complete genome sequences provides us with an opportunity to describe and analyze evolution at the comprehensive level of genomes. Here we compare nine genomes with respect to their protein coding genes at two levels: (i) we compare genomes as “bags of genes” and measure the fraction of orthologs shared between genomes and (ii) we quantify correlations between genes with respect to their relative positions in genomes. Distances between the genomes are related to their divergence times, measured as the number of amino acid substitutions per site in a set of 34 orthologous genes that are shared among all the genomes compared. We establish a hierarchy of rates at which genomes have changed during evolution. Protein sequence identity is the most conserved, followed by the complement of genes within the genome. Next is the degree of conservation of the order of genes, whereas gene regulation appears to evolve at the highest rate. Finally, we show that some genomes are more highly organized than others: they show a higher degree of the clustering of genes that have orthologs in other genomes.
Resumo:
The recently sequenced genome of the parasitic bacterium Mycoplasma genitalium contains only 468 identified protein-coding genes that have been dubbed a minimal gene complement [Fraser, C.M., Gocayne, J.D., White, O., Adams, M.D., Clayton, R.A., et al. (1995) Science 270, 397-403]. Although the M. genitalium gene complement is indeed the smallest among known cellular life forms, there is no evidence that it is the minimal self-sufficient gene set. To derive such a set, we compared the 468 predicted M. genitalium protein sequences with the 1703 protein sequences encoded by the other completely sequenced small bacterial genome, that of Haemophilus influenzae. M. genitalium and H. influenzae belong to two ancient bacterial lineages, i.e., Gram-positive and Gram-negative bacteria, respectively. Therefore, the genes that are conserved in these two bacteria are almost certainly essential for cellular function. It is this category of genes that is most likely to approximate the minimal gene set. We found that 240 M. genitalium genes have orthologs among the genes of H. influenzae. This collection of genes falls short of comprising the minimal set as some enzymes responsible for intermediate steps in essential pathways are missing. The apparent reason for this is the phenomenon that we call nonorthologous gene displacement when the same function is fulfilled by nonorthologous proteins in two organisms. We identified 22 nonorthologous displacements and supplemented the set of orthologs with the respective M. genitalium genes. After examining the resulting list of 262 genes for possible functional redundancy and for the presence of apparently parasite-specific genes, 6 genes were removed. We suggest that the remaining 256 genes are close to the minimal gene set that is necessary and sufficient to sustain the existence of a modern-type cell. Most of the proteins encoded by the genes from the minimal set have eukaryotic or archaeal homologs but seven key proteins of DNA replication do not. We speculate that the last common ancestor of the three primary kingdoms had an RNA genome. Possibilities are explored to further reduce the minimal set to model a primitive cell that might have existed at a very early stage of life evolution.
Resumo:
We have isolated a novel cDNA, that appears to represent a new class of ion channels, by using the yeast two-hybrid system and the SH3 domain of the neural form of Src (N-src) as a bait. The encoded polypeptide, BCNG-1, is distantly related to cyclic nucleotide-gated channels and the voltage-gated channels, Eag and H-erg. BCNG-1 is expressed exclusively in the brain, as a glycosylated protein of ≈132 kDa. Immunohistochemical analysis indicates that BCNG-1 is preferentially expressed in specific subsets of neurons in the neocortex, hippocampus, and cerebellum, in particular pyramidal neurons and basket cells. Within individual neurons, the BCNG-1 protein is localized to either the dendrites or the axon terminals depending on the cell type. Southern blot analysis shows that several other BCNG-related sequences are present in the mouse genome, indicating the emergence of an entire subfamily of ion channel coding genes. These findings suggest the existence of a new type of ion channel, which is potentially able to modulate membrane excitability in the brain and could respond to regulation by cyclic nucleotides.
Resumo:
Typical general transcription factors, such as TATA binding protein and TFII B, have not yet been identified in any member of the Trypanosomatidae family of parasitic protozoa. Interestingly, mRNA coding genes do not appear to have discrete transcriptional start sites, although in most cases they require an RNA polymerase that has the biochemical properties of eukaryotic RNA polymerase II. A discrete transcription initiation site may not be necessary for mRNA synthesis since the sequences upstream of each transcribed coding region are trimmed from the nascent transcript when a short m7G-capped RNA is added during mRNA maturation. This short 39 nt m7G-capped RNA, the spliced leader (SL) sequence, is expressed as an ∼100 nt long RNA from a set of reiterated, though independently transcribed, genes in the trypanosome genome. Punctuation of the 5′ end of mRNAs by a m7G cap-containing spliced leader is a developing theme in the lower eukaryotic world; organisms as diverse as Euglena and nematode worms, including Caenorhabditis elegans, utilize SL RNA in their mRNA maturation programs. Towards understanding the coordination of SL RNA and mRNA expression in trypanosomes, we have begun by characterizing SL RNA gene expression in the model trypanosome Leptomonas seymouri. Using a homologous in vitro transcription system, we demonstrate in this study that the SL RNA is transcribed by RNA polymerase II. During SL RNA transcription, accurate initiation is determined by an initiator element with a loose consensus of CYAC/AYR(+1). This element, as well as two additional basal promoter elements, is divergent in sequence from the basal transcription elements seen in other eukaryotic gene promoters. We show here that the in vitro transcription extract contains a binding activity that is specific for the initiator element and thus may participate in recruiting RNA polymerase II to the SL RNA gene promoter.
Resumo:
Chlorarachniophyte algae contain a complex, multi-membraned chloroplast derived from the endosymbiosis of a eukaryotic alga. The vestigial nucleus of the endosymbiont, called the nucleomorph, contains only three small linear chromosomes with a haploid genome size of 380 kb and is the smallest known eukaryotic genome. Nucleotide sequence data from a subtelomeric fragment of chromosome III were analyzed as a preliminary investigation of the coding capacity of this vestigial genome. Several housekeeping genes including U6 small nuclear RNA (snRNA), ribosomal proteins S4 and S13, a core protein of the spliceosome [small nuclear ribonucleoprotein (snRNP) E], and a cip-like protease (clpP) were identified. Expression of these genes was confirmed by combinations of Northern blot analysis, in situ hybridization, immunocytochemistry, and cDNA analysis. The protein-encoding genes are typically eukaryotic in overall structure and their messenger RNAs are polyadenylylated. A novel feature is the abundance of 18-, 19-, or 20-nucleotide introns; the smallest spliceosomal introns known. Two of the genes, U6 and S13, overlap while another two genes, snRNP E and clpP, are cotranscribed in a single mRNA. The overall gene organization is extraordinarily compact, making the nucleomorph a unique model for eukaryotic genomics.
Resumo:
We report that promoters for two murine acute-phase protein (APP) genes, complement factor 3 (C3) and serum amyloid A3 (SAA3), can increase recombinant protein expression in response to inflammatory stimuli in vivo. To deliver APP promoter-luciferase reporter gene constructs to the liver, where most endogenous APP synthesis occurs, we introduced them into a nonreplicating adenovirus vector and injected the purified viruses intravenously into mice. When compared with the low levels of basal luciferase expression observed prior to inflammatory challenge, markedly increased expression from the C3 promoter was detected in liver in response to both lipopolysaccharide (LPS) and turpentine, and lower-level inducible expression was also found in lung. In contrast, expression from the SAA3 promoter was found only in liver and was much more responsive to LPS than to turpentine. After LPS challenge, hepatic luciferase expression increased rapidly and in proportion to the LPS dose. Use of cytokine-inducible promoters in gene transfer vectors may make it possible to produce antiinflammatory proteins in vivo in direct relationship to the intensity and duration of an individual's inflammatory response. By providing endogenously controlled production of recombinant antiinflammatory proteins, this approach might limit the severity of the inflammatory response without interfering with the beneficial components of host defense and immunity.
Resumo:
In Trypanosoma brucei, transcription by RNA polymerase II and 5′ capping of messenger RNA are uncoupled: a capped spliced leader is trans spliced to every RNA. This decoupling makes it possible to have protein-coding gene transcription driven by RNA polymerase I. Indeed, indirect evidence suggests that the genes for the major surface glycoproteins, variant surface glycoproteins (VSGs) in bloodstream-form trypanosomes, are transcribed by RNA polymerase I. In a single trypanosome, only one VSG expression site is maximally transcribed at any one time, and it has been speculated that transcription takes place at a unique site within the nucleus, perhaps in the nucleolus. We tested this by using fluorescence in situ hybridization. With probes that cover about 50 kb of the active 221 expression site, we detected nuclear transcripts of this site in a single fluorescent spot, which did not colocalize with the nucleolus. Analysis of marker gene-tagged active expression site DNA by fluorescent DNA in situ hybridization confirmed the absence of association with the nucleolus. Even an active expression site in which the promoter had been replaced by an rDNA promoter did not colocalize with the nulceolus. As expected, marker genes inserted in the rDNA array predominantly colocalize with the nucleolus, whereas the tubulin gene arrays do not. We conclude that transcription of the active VSG expression site does not take place in the nucleolus.