958 resultados para PROTEIN-CODING GENES


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Gene number can be considered a pragmatic measure of biological complexity, but reliable data is scarce. Estimates for vertebrates are 50-100,000 genes per haploid genome, whereas invertebrate estimates fall below 25,000. We wished to test the hypothesis that the origin of vertebrates coincided with extensive gene creation. A prediction is that gene number will differ sharply between invertebrate and vertebrate members of the chordate phylum. A gene number estimation method requiring limited sequence sampling of genomic DNA was developed and validated by using data for Caenorhabditis elegans. Using the method, we estimated that the invertebrate chordate Ciona intestinalis has 15,500 protein-coding genes (±3,700). This number is significantly lower than gene numbers of vertebrate chordates, but similar to those of invertebrates in distantly related phyla. The data indicate that evolution of vertebrates was accompanied by a dramatic increase in protein-coding capacity of the genome.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Pre-mRNA splicing is among the last known nuclear events before export of mature mRNA to the cytoplasm. At present, it is not known whether splicing and mRNA export are biochemically coupled processes. In this study, we have injected pre-mRNAs containing a single intron or the same mRNAs lacking an intron (Δi-mRNAs) into Xenopus oocyte nuclei. We find that the spliced mRNAs are exported much more rapidly and efficiently than the identical Δi-mRNAs. Moreover, competition studies using excess Δi-mRNA indicate that different factor(s) are involved in the inefficient export of Δi-mRNA vs. the efficient export of spliced mRNA. Consistent with this conclusion, spliced mRNA and Δi-mRNA, though identical in sequence, are assembled into different messenger ribonucleoprotein particles (mRNP) in vitro. Strikingly, the mRNA in the spliced mRNP, but not in the Δi-mRNP, is exported rapidly and efficiently. We conclude that splicing generates a specific nucleoprotein complex that targets mRNA for export. Our results, revealing a link between splicing and efficient mRNA export, may explain the reports that an intron is required for efficient expression of many protein-coding genes in metazoans.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The determination of complete genome sequences provides us with an opportunity to describe and analyze evolution at the comprehensive level of genomes. Here we compare nine genomes with respect to their protein coding genes at two levels: (i) we compare genomes as “bags of genes” and measure the fraction of orthologs shared between genomes and (ii) we quantify correlations between genes with respect to their relative positions in genomes. Distances between the genomes are related to their divergence times, measured as the number of amino acid substitutions per site in a set of 34 orthologous genes that are shared among all the genomes compared. We establish a hierarchy of rates at which genomes have changed during evolution. Protein sequence identity is the most conserved, followed by the complement of genes within the genome. Next is the degree of conservation of the order of genes, whereas gene regulation appears to evolve at the highest rate. Finally, we show that some genomes are more highly organized than others: they show a higher degree of the clustering of genes that have orthologs in other genomes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The recently sequenced genome of the parasitic bacterium Mycoplasma genitalium contains only 468 identified protein-coding genes that have been dubbed a minimal gene complement [Fraser, C.M., Gocayne, J.D., White, O., Adams, M.D., Clayton, R.A., et al. (1995) Science 270, 397-403]. Although the M. genitalium gene complement is indeed the smallest among known cellular life forms, there is no evidence that it is the minimal self-sufficient gene set. To derive such a set, we compared the 468 predicted M. genitalium protein sequences with the 1703 protein sequences encoded by the other completely sequenced small bacterial genome, that of Haemophilus influenzae. M. genitalium and H. influenzae belong to two ancient bacterial lineages, i.e., Gram-positive and Gram-negative bacteria, respectively. Therefore, the genes that are conserved in these two bacteria are almost certainly essential for cellular function. It is this category of genes that is most likely to approximate the minimal gene set. We found that 240 M. genitalium genes have orthologs among the genes of H. influenzae. This collection of genes falls short of comprising the minimal set as some enzymes responsible for intermediate steps in essential pathways are missing. The apparent reason for this is the phenomenon that we call nonorthologous gene displacement when the same function is fulfilled by nonorthologous proteins in two organisms. We identified 22 nonorthologous displacements and supplemented the set of orthologs with the respective M. genitalium genes. After examining the resulting list of 262 genes for possible functional redundancy and for the presence of apparently parasite-specific genes, 6 genes were removed. We suggest that the remaining 256 genes are close to the minimal gene set that is necessary and sufficient to sustain the existence of a modern-type cell. Most of the proteins encoded by the genes from the minimal set have eukaryotic or archaeal homologs but seven key proteins of DNA replication do not. We speculate that the last common ancestor of the three primary kingdoms had an RNA genome. Possibilities are explored to further reduce the minimal set to model a primitive cell that might have existed at a very early stage of life evolution.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Petunia hybrida is a popular bedding plant that has a long history as a genetic model system. We report the whole-genome sequencing and assembly of inbred derivatives of its two wild parents, P. axillaris N and P. inflata S6. The assemblies include 91.3% and 90.2% coverage of their diploid genomes (1.4 Gb; 2n = 14) containing 32,928 and 36,697 protein-coding genes, respectively. The genomes reveal that the Petunia lineage has experienced at least two rounds of hexaploidization: the older gamma event, which is shared with most Eudicots, and a more recent Solanaceae event that is shared with tomato and other solanaceous species. Transcription factors involved in the shift from bee to moth pollination reside in particularly dynamic regions of the genome, which may have been key to the remarkable diversity of floral colour patterns and pollination systems. The high-quality genome sequences will enhance the value of Petunia as a model system for research on unique biological phenomena such as small RNAs, symbiosis, self-incompatibility and circadian rhythms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Do non-coding RNAs that are derived from the introns and exons of protein-coding and non-protein-coding genes represent a fundamental advance in the genetic operating system of higher organisms? Recent evidence from comparative genomics and molecular genetics indicates that this might be the case. If so, there will be profound consequences for our understanding of the genetics of these organisms, and in particular how the trajectories of differentiation and development and the differences among individuals and species are genomically programmed. But how might this hypothesis be tested?

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Mammalian promoters can be separated into two classes, conserved TATA box-enriched promoters, which initiate at a welldefined site, and more plastic, broad and evolvable CpG-rich promoters. We have sequenced tags corresponding to several hundred thousand transcription start sites (TSSs) in the mouse and human genomes, allowing precise analysis of the sequence architecture and evolution of distinct promoter classes. Different tissues and families of genes differentially use distinct types of promoters. Our tagging methods allow quantitative analysis of promoter usage in different tissues and show that differentially regulated alternative TSSs are a common feature in protein-coding genes and commonly generate alternative N termini. Among the TSSs, we identified new start sites associated with the majority of exons and with 3' UTRs. These data permit genome-scale identification of tissue-specific promoters and analysis of the cis-acting elements associated with them.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The mammalian transcriptome harbours shadowy entities that resist classification and analysis. In analogy with pseudogenes, we define pseudo-messenger RNA to be RNA molecules that resemble protein- coding mRNA, but cannot encode full-length proteins owing to disruptions of the reading frame. Using a rigorous computational pipeline, which rules out sequencing errors, we identify 10,679 pseudo - messenger RNAs ( approximately half of which are transposonassociated) among the 102,801 FANTOM3 mouse cDNAs: just over 10% of the FANTOM3 transcriptome. These comprise not only transcribed pseudogenes, but also disrupted splice variants of otherwise protein- coding genes. Some may encode truncated proteins, only a minority of which appear subject to nonsense- mediated decay. The presence of an excess of transcripts whose only disruptions are opal stop codons suggests that there are more selenoproteins than currently estimated. We also describe compensatory frameshifts, where a segment of the gene has changed frame but remains translatable. In summary, we survey a large class of non- standard but potentially functional transcripts that are likely to encode genetic information and effect biological processes in novel ways. Many of these transcripts do not correspond cleanly to any identifiable object in the genome, implying fundamental limits to the goal of annotating all functional elements at the genome sequence level.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

T he international FANTOM consortium aims to produce a comprehensive picture of the mammalian transcriptome, based upon an extensive cDNA collection and functional annotation of full-length enriched cDNAs. The previous dataset, FANTOM(2), comprised 60,770 full- length enriched cDNAs. Functional annotation revealed that this cDNA dataset contained only about half of the estimated number of mouse protein- coding genes, indicating that a number of cDNAs still remained to be collected and identified. To pursue the complete gene catalog that covers all predicted mouse genes, cloning and sequencing of full- length enriched cDNAs has been continued since FANTOM2. In FANTOM3, 42,031 newly isolated cDNAs were subjected to functional annotation, and the annotation of 4,347 FANTOM2 cDNAs was updated. To accomplish accurate functional annotation, we improved our automated annotation pipeline by introducing new coding sequence prediction programs and developed a Web- based annotation interface for simplifying the annotation procedures to reduce manual annotation errors. Automated coding sequence and function prediction was followed with manual curation and review by expert curators. A total of 102,801 full- length enriched mouse cDNAs were annotated. Out of 102,801 transcripts, 56,722 were functionally annotated as protein coding ( including partial or truncated transcripts), providing to our knowledge the greatest current coverage of the mouse proteome by full- length cDNAs. The total number of distinct non- protein- coding transcripts increased to 34,030. The FANTOM3 annotation system, consisting of automated computational prediction, manual curation, and. nal expert curation, facilitated the comprehensive characterization of the mouse transcriptome, and could be applied to the transcriptomes of other species.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Novel, low-abundance microbial species can be easily overlooked in standard polymerase chain reaction (PCR)-based surveys. We used community genomic data obtained without PCR or cultivation to reconstruct DNA fragments bearing unusual 16S ribosomal RNA ( rRNA) and protein-coding genes from organisms belonging to novel archaeal lineages. The organisms are minor components of all biofilms growing in pH 0.5 to 1.5 solutions within the Richmond Mine, California. Probes specific for 16S rRNA showed that the fraction less than 0.45 micrometers in diameter is dominated by these organisms. Transmission electron microscope images revealed that the cells are pleomorphic with unusual folded membrane protrusions and have apparent volumes of < 0.006 cubic micrometer.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Protein coding genes are comprised of protein-coding exons and non-protein-coding introns. The process of splicing involves removal of the introns and joining of the exons to form a mature messenger RNA, which subsequently undergoes translation into polypeptide. The spliceosome is a large, RNA/protein assembly of five small nuclear RNAs as well as over 300 proteins, which catalyzes intron removal and exon ligation. The selection of specific exons for inclusion in the mature messenger RNA is spatiotemporally regulated and results in production of an enormous diversity of polypeptides from a single gene locus. This phenomenon, known as alternative splicing, is regulated, in part, by protein splicing factors, which target the spliceosome to exon/intron boundaries. The first part of my dissertation (Chapters II and III) focuses on the discovery and characterization of the 45 kilodalton FK506 binding protein (FKBP45), which I discovered in the silk moth, Bombyx mori, as a U1 small nuclear RNA binding protein. This protein family binds the immunosuppressants FK506 and rapamycin and contains peptidyl-prolyl cis-trans isomerase activity, which converts polypeptides from cis to trans about a proline residue. This is the first time that an FKBP has been identified in the spliceosome. The second section of my dissertation (Chapters IV, V, VI and VII) is an investigation of the potential role of small nuclear RNA sequence variants in the control of splicing. I identified 46 copies of small nuclear RNAs in the 6X whole genome shotgun of the Bombyx mori p50T strain. These variants may play a role in differential binding of specific proteins that mediate alternative splicing. Along these lines, further investigation of U2 snRNA sequence variants in Bombyx mori demonstrated that some U2 snRNAs preferentially assemble into high molecular weight spliceosomal complexes over others. Expression of snRNA variants may represent another mechanism by which the cell is able to fine tune the splicing process.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Protein coding genes are comprised of protein-coding exons and non-protein-coding introns. The process of splicing involves removal of the introns and joining of the exons to form a mature messenger RNA, which subsequently undergoes translation into polypeptide. The spliceosome is a large, RNA/protein assembly of five small nuclear RNAs as well as over 300 proteins, which catalyzes intron removal and exon ligation. The selection of specific exons for inclusion in the mature messenger RNA is spatio-temporally regulated and results in production of an enormous diversity of polypeptides from a single gene locus. This phenomenon, known as alternative splicing, is regulated, in part, by protein splicing factors, which target the spliceosome to exon/intron boundaries. The first part of my dissertation (Chapters II and III) focuses on the discovery and characterization of the 45 kilodalton FK506 binding protein (FKBP45), which I discovered in the silk moth, Bombyx mori, as a U1 small nuclear RNA binding protein. This protein family binds the immunosuppressants FK506 and rapamycin and contains peptidyl-prolyl cis-trans isomerase activity, which converts polypeptides from cis to trans about a proline residue. This is the first time that an FKBP has been identified in the spliceosome. The second section of my dissertation (Chapters IV, V, VI and VII) is an investigation of the potential role of small nuclear RNA sequence variants in the control of splicing. I identified 46 copies of small nuclear RNAs in the 6X whole genome shotgun of the Bombyx mori p50T strain. These variants may play a role in differential binding of specific proteins that mediate alternative splicing. Along these lines, further investigation of U2 snRNA sequence variants in Bombyx mori demonstrated that some U2 snRNAs preferentially assemble into high molecular weight spliceosomal complexes over others. Expression of snRNA variants may represent another mechanism by which the cell is able to fine tune the splicing process.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Gene regulation is a complex and tightly controlled process that defines cell function in physiological and abnormal states. Programmable gene repression technologies enable loss-of-function studies for dissecting gene regulation mechanisms and represent an exciting avenue for gene therapy. Established and recently developed methods now exist to modulate gene sequence, epigenetic marks, transcriptional activity, and post-transcriptional processes, providing unprecedented genetic control over cell phenotype. Our objective was to apply and develop targeted repression technologies for regenerative medicine, genomics, and gene therapy applications. We used RNA interference to control cell cycle regulation in myogenic differentiation and enhance the proliferative capacity of tissue engineered cartilage constructs. These studies demonstrate how modulation of a single gene can be used to guide cell differentiation for regenerative medicine strategies. RNA-guided gene regulation with the CRISPR/Cas9 system has rapidly expanded the targeted repression repertoire from silencing single protein-coding genes to modulation of genes, promoters, and other distal regulatory elements. In order to facilitate its adaptation for basic research and translational applications, we demonstrated the high degree of specificity for gene targeting, gene silencing, and chromatin modification possible with Cas9 repressors. The specificity and effectiveness of RNA-guided transcriptional repressors for silencing endogenous genes are promising characteristics for mechanistic studies of gene regulation and cell phenotype. Furthermore, our results support the use of Cas9-based repressors as a platform for novel gene therapy strategies. We developed an in vivo AAV-based gene repression system for silencing endogenous genes in a mouse model. Together, these studies demonstrate the utility of gene repression tools for guiding cell phenotype and the potential of the RNA-guided CRISPR/Cas9 platform for applications such as causal studies of gene regulatory mechanisms and gene therapy.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

There is growing evidence that the complexity of higher organisms does not correlate with the ‘complexity’ of the genome (the human genome contains fewer protein coding genes than corn, and many genes are preserved across species). Rather, complexity is associated with the complexity of the pathways and processes whereby the cell utilises the deoxyribonucleic acid molecule, and much else, in the process of phenotype formation. These pro- cesses include the activity of the epigenome, noncoding ribonucleic acids, alternative splicing and post-transla- tional modifications. Not accidentally, all of these pro- cesses appear to be of particular importance for the human brain, the most complex organ in nature. Because these processes can be highly environmentally reactive, they are a key to understanding behavioural plasticity and highlight the importance of the developmental process in explaining behavioural outcomes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Os microRNAs (miRNAs) são curtas cadeias de RNA não codificante, com cerca de 18 a 25 nucleotídeos, que regulam os níveis de mRNAs que são produzidos a partir de genes codificantes de proteínas. A descoberta dos miRNAs e a sua subsequente caracterização estrutural e funcional revelou a existência de um novo processo de regulação pós-transcricional da expressão génica em células eucarióticas que afeta uma grande variedade de funções celulares. A senescência acompanha o processo de evelhecimento dos organismos e é manifestada pela perda da capacidade proliferativa das células em resposta a diversos fatores de stress que desencadeiam alterações moleculares específicas. Na última década foram identificados e caracterizados vários miRNAs que participam na regulação do fenótipo da senescência celular, quer através da modulação de vias de sinalização endógenas que controlam a progressão do ciclo celular, quer através da secreção de factores de sinalização. Vários estudos têm também revelado a enorme potencialidade dos miRNAs como biomarcadores e alvos moleculares de novas abordagens terapêuticas. No futuro, é expectável que os avanços científicos possam ser transferidos para a prática clínica com vista a uma efetiva prevenção, vigilância e tratamento do envelhecimento prematuro e de doenças associadas ao envelhecimento.