43 resultados para SPLICEOSOMAL INTRONS
em Université de Lausanne, Switzerland
Resumo:
The ribonucleotide reductase gene tandem bnrdE/bnrdF in SPbeta-related prophages of different Bacillus spp. isolates presents different configurations of intervening sequences, comprising one to three of six non-homologous splicing elements. Insertion sites of group I introns and intein DNA are clustered in three relatively short segments encoding functionally important domains of the ribonucleotide reductase. Comparison of the bnrdE homologs reveals mutual exclusion of a group I intron and an intein coding sequence flanking the codon that specifies a conserved cysteine. In vivo splicing was demonstrated for all introns. However, for two of them a part of the mRNA precursor molecules remains unspliced. Intergenic bnrdE-bnrdF regions are unexpectedly long, comprising between 238 and 541 nt. The longest encodes a putative polypeptide related to HNH homing endonucleases.
Resumo:
Retroposed genes (retrogenes) originate via the reverse transcription of mature messenger RNAs from parental source genes and are therefore usually devoid of introns. Here, we characterize a particular set of mammalian retrogenes that acquired introns upon their emergence and thus represent rare cases of intron gain in mammals. We find that although a few retrogenes evolved introns in their coding or 3' untranslated regions (untranslated region, UTR), most introns originated together with untranslated exons in the 5' flanking regions of the retrogene insertion site. They emerged either de novo or through fusions with 5' UTR exons of host genes into which the retrogenes inserted. Generally, retrogenes with introns display high transcription levels and show broader spatial expression patterns than other retrogenes. Our experimental expression analyses of individual intron-containing retrogenes show that 5' UTR introns may indeed promote higher expression levels, at least in part through encoded regulatory elements. By contrast, 3' UTR introns may lead to downregulation of expression levels via nonsense-mediated decay mechanisms. Notably, the majority of retrogenes with introns in their 5' flanks depend on distant, sometimes bidirectional CpG dinucleotide-enriched promoters for their expression that may be recruited from other genes in the genomic vicinity. We thus propose a scenario where the acquisition of new 5' exon-intron structures was directly linked to the recruitment of distant promoters by these retrogenes, a process potentially facilitated by the presence of proto-splice sites in the genomic vicinity of retrogene insertion sites. Thus, the primary role and selective benefit of new 5' introns (and UTR exons) was probably initially to span the often substantial distances to potent CpG promoters driving retrogene transcription. Later in evolution, these introns then obtained additional regulatory roles in fine tuning retrogene expression levels. Our study provides novel insights regarding mechanisms underlying the origin of new introns, the evolutionary relevance of intron gain, and the origin of new gene promoters.
Resumo:
Proteins PRPF31, PRPF3 and PRPF8 (RP-PRPFs) are ubiquitously expressed components of the spliceosome, a macromolecular complex that processes nearly all pre-mRNAs. Although these spliceosomal proteins are conserved in eukaryotes and are essential for survival, heterozygous mutations in human RP-PRPF genes lead to retinitis pigmentosa, a hereditary disease restricted to the eye. Using cells from patients with 10 different mutations, we show that all clinically relevant RP-PRPF defects affect the stoichiometry of spliceosomal small nuclear RNAs (snRNAs), the protein composition of tri-small nuclear ribonucleoproteins and the kinetics of spliceosome assembly. These mutations cause inefficient splicing in vitro and affect constitutive splicing ex-vivo by impairing the removal of at least 9% of endogenously expressed introns. Alternative splicing choices are also affected when RP-PRPF defects are present. Furthermore, we show that the steady-state levels of snRNAs and processed pre-mRNAs are highest in the retina, indicating a particularly elevated splicing activity. Our results suggest a role for PRPFs defects in the etiology of PRPF-linked retinitis pigmentosa, which appears to be a truly systemic splicing disease. Although these mutations cause widespread and important splicing defects, they are likely tolerated by the majority of human tissues but are critical for retinal cell survival.
Resumo:
Cryptic exons or pseudoexons are typically activated by point mutations that create GT or AG dinucleotides of new 5' or 3' splice sites in introns, often in repetitive elements. Here we describe two cases of tetrahydrobiopterin deficiency caused by mutations improving the branch point sequence and polypyrimidine tracts of repeat-containing pseudoexons in the PTS gene. In the first case, we demonstrate a novel pathway of antisense Alu exonization, resulting from an intronic deletion that removed the poly(T)-tail of antisense AluSq. The deletion brought a favorable branch point sequence within proximity of the pseudoexon 3' splice site and removed an upstream AG dinucleotide required for the 3' splice site repression on normal alleles. New Alu exons can thus arise in the absence of poly(T)-tails that facilitated inclusion of most transposed elements in mRNAs by serving as polypyrimidine tracts, highlighting extraordinary flexibility of Alu repeats in shaping intron-exon structure. In the other case, a PTS pseudoexon was activated by an A>T substitution 9 nt upstream of its 3' splice site in a LINE-2 sequence, providing the first example of a disease-causing exonization of the most ancient interspersed repeat. These observations expand the spectrum of mutational mechanisms that introduce repetitive sequences in mature transcripts and illustrate the importance of intronic mutations in alternative splicing and phenotypic variability of hereditary disorders.
Resumo:
Electron microscopic analysis of heteroduplexes between the most distantly related Xenopus vitellogenin genes (A genes X B genes) has revealed the distribution of homologous regions that have been preferentially conserved after the duplication events that gave rise to the multigene family in Xenopus laevis. DNA sequence analysis was limited to the region downstream of the transcription initiation site of the Xenopus genes A1, B1 and B2 and a comparison with the Xenopus A2 and the major chicken vitellogenin gene is presented. Within the coding regions of the first three exons, nucleotide substitutions resulting in amino acid changes accumulate at a rate similar to that observed in globin genes. This suggests that the duplication event which led to the formation of the A and B ancestral genes in Xenopus laevis occurred about 150 million years ago. Homologous exons of the A1-A2 and B1-B2 gene pairs, which formed about 30 million years ago, show a quite similar sequence divergence. In contrast, A1-A2 homologous introns seem to have evolved much faster than their B1-B2 counterparts.
Resumo:
Lancelets ('amphioxus') are the modern survivors of an ancient chordate lineage, with a fossil record dating back to the Cambrian period. Here we describe the structure and gene content of the highly polymorphic approximately 520-megabase genome of the Florida lancelet Branchiostoma floridae, and analyse it in the context of chordate evolution. Whole-genome comparisons illuminate the murky relationships among the three chordate groups (tunicates, lancelets and vertebrates), and allow not only reconstruction of the gene complement of the last common chordate ancestor but also partial reconstruction of its genomic organization, as well as a description of two genome-wide duplications and subsequent reorganizations in the vertebrate lineage. These genome-scale events shaped the vertebrate genome and provided additional genetic variation for exploitation during vertebrate evolution.
Resumo:
BACKGROUND: Despite the continuous production of genome sequence for a number of organisms, reliable, comprehensive, and cost effective gene prediction remains problematic. This is particularly true for genomes for which there is not a large collection of known gene sequences, such as the recently published chicken genome. We used the chicken sequence to test comparative and homology-based gene-finding methods followed by experimental validation as an effective genome annotation method. RESULTS: We performed experimental evaluation by RT-PCR of three different computational gene finders, Ensembl, SGP2 and TWINSCAN, applied to the chicken genome. A Venn diagram was computed and each component of it was evaluated. The results showed that de novo comparative methods can identify up to about 700 chicken genes with no previous evidence of expression, and can correctly extend about 40% of homology-based predictions at the 5' end. CONCLUSIONS: De novo comparative gene prediction followed by experimental verification is effective at enhancing the annotation of the newly sequenced genomes provided by standard homology-based methods.
Resumo:
Islet-brain 1 (IB1), a regulator of the pancreatic beta-cell function in the rat, is homologous to JIP-1, a murine inhibitor of c-Jun amino-terminal kinase (JNK). Whether IB1 and JIP-1 are present in humans was not known. We report the sequence of the 2133-bp human IB1 cDNA, the expression, structure, and fine-mapping of the human IB1 gene, and the characterization of an IB1 pseudogene. Human IB1 is 94% identical to rat IB1. The tissue-specific expression of IB1 in human is similar to that observed in rodent. The IB1 gene contains 12 exons and maps to chromosome 11 (11p11.2-p12), a region that is deleted in DEFECT-11 syndrome. Apart from an IB1 pseudogene on chromosome 17 (17q21), no additional IB1-related gene was found in the human genome. Our data indicate that the sequence and expression pattern of IB1 are highly conserved between rodent and human and provide the necessary tools to investigate whether IB1 is involved in human diseases.
Resumo:
MicroRNAs (miRNAs) are major post-transcriptional regulators of gene expression, yet their origins and functional evolution in mammals remain little understood due to the lack of appropriate comparative data. Using RNA sequencing, we have generated extensive and comparable miRNA data for five organs in six species that represent all main mammalian lineages and birds (the evolutionary outgroup) with the aim to unravel the evolution of mammalian miRNAs. Our analyses reveal an overall expansion of miRNA repertoires in mammals, with threefold accelerated birth rates of miRNA families in placentals and marsupials, facilitated by the de novo emergence of miRNAs in host gene introns. Generally, our analyses suggest a high rate of miRNA family turnover in mammals with many newly emerged miRNA families being lost soon after their formation. Selectively preserved mammalian miRNA families gradually evolved higher expression levels, as well as altered mature sequences and target gene repertoires, and were apparently mainly recruited to exert regulatory functions in nervous tissues. However, miRNAs that originated on the X chromosome evolved high expression levels and potentially diverse functions during spermatogenesis, including meiosis, through selectively driven duplication-divergence processes. Overall, our study thus provides detailed insights into the birth and evolution of mammalian miRNA genes and the associated selective forces.
Resumo:
BACKGROUND: The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. RESULTS: We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. CONCLUSION: There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular - but possibly clusters of genes more generally - might be linked to the presence of promoter, enhancer or inhibitor motifs that serve to regulate more than just one gene. Therefore, deletions, inversions or relocations of individual genes could destroy the regulation of the clustered genes in this region. The existence of such a regulation network might explain the evolutionary conservation of gene order and orientation over the course of hundreds of millions of years of vertebrate evolution. Another possible explanation for the highly conserved gene order might be the existence of a regulator not located immediately next to its corresponding gene but further away since a relocation or inversion would possibly interrupt this interaction. Different ParaHox clusters were found to have experienced differential gene loss in teleosts. Yet the complete set of these homeobox genes was maintained, albeit distributed over almost twice the number of chromosomes. Selection due to dosage effects and/or stoichiometric disturbance might act more strongly to maintain a modal number of homeobox genes (and possibly transcription factors more generally) per genome, yet permit the accumulation of other (non regulatory) genes associated with these homeobox gene clusters.
Resumo:
AIMS: To identify the molecular basis for a low CYP1A2 metabolic status, as determined by a caffeine phenotyping test, in a 71-year-old, nonsmoking, Caucasian woman who presented with very high clozapine concentrations despite being administered a standard dose of the drug. METHODS: The nucleotide sequence of the 7 exons, exon-intron boundaries and 5'-flanking region of the CYP1A2 gene was analysed by direct sequencing. RESULTS: Only one heterozygous point mutation was identified in the donor splice site of intron 6 (3534G > A) of CYP1A2. This mutation could cause abnormal RNA splicing and therefore lead to a truncated nonfunctional enzyme. No other carrier of this mutation was identified in a population of 100 unrelated healthy Caucasians. CONCLUSIONS: This is the first report of a splice-site mutation affecting the CYP1A2 gene. This polymorphism is a likely explanation for the low CYP1A2 activity associated with high clozapine concentrations in this patient.
Resumo:
The inv(16) and related t(16;16) are found in 10% of all cases with de novo acute myeloid leukemia. In these rearrangements the core binding factor beta (CBFB) gene on 16q22 is fused to the smooth muscle myosin heavy chain gene (MYH11) on 16p13. To gain insight into the mechanisms causing the inv(16) we have analysed 24 genomic CBFB-MYH11 breakpoints. All breakpoints in CBFB are located in a 15-Kb intron. More than 50% of the sequenced 6.2 Kb of this intron consists of human repetitive elements. Twenty-one of the 24 breakpoints in MYH11 are located in a 370-bp intron. The remaining three breakpoints in MYH11 are located more upstream. The localization of three breakpoints adjacent to a V(D)J recombinase signal sequence in MYH11 suggests a V(D)J recombinase-mediated rearrangement in these cases. V(D)J recombinase-associated characteristics (small nucleotide deletions and insertions of random nucleotides) were detected in six other cases. CBFB and MYH11 duplications were detected in four of six cases tested.
Resumo:
Hypertension is the first single modifiable cause of disease burden worldwide. Genes encoding proteins that are involved in the metabolism (CYP3A5) and transport (ABCB1) of drugs and hormones might contribute to blood pressure control in humans. Indeed, recent data have suggested that CYP3A5 and ABCB1 gene polymorphisms are associated with blood pressure in the rat as well as in humans. Interestingly, the effects of these genes on blood pressure appear to be modified by dietary salt intake. This review summarizes what is known regarding the relationships of the ABCB1 and CYP3A5 genes with blood pressure, and discusses the potential underlying mechanisms of the association. If the role of these genes in blood pressure control is confirmed in other populations and other ethnic groups, these findings would point toward a new pathway for blood pressure control in humans.
Resumo:
To elucidate the evolutionary history of snow voles, genus Chionomys, we studied the phylogeography of Chionomysnivalis across its range and investigated its relationships with two congeneric species, Chionomysgud and Chionomysroberti, using independent molecular markers. Analyses were based on mitochondrial (~940 bp cyt b) and Y-chromosomal (~2020 bp from three introns) genetic variation. Our data provide conclusive evidence for a Caucasian and Middle Eastern origin for the three species and a subsequent westward expansion of C.nivalis. In addition, we discuss the taxonomic status of the genus Chionomys in relation to the genus Microtus.
Resumo:
It is often supposed that a protein's rate of evolution and its amino acid content are determined by the function and anatomy of the protein. Here we examine an alternative possibility, namely that the requirement to specify in the unprocessed RNA, in the vicinity of intron-exon boundaries, information necessary for removal of introns (e.g., exonic splice enhancers) affects both amino acid usage and rates of protein evolution. We find that the majority of amino acids show skewed usage near intron-exon boundaries, and that differences in the trends for the 2-fold and 4-fold blocks of both arginine and leucine show this to be owing to effects mediated at the nucleotide level. More specifically, there is a robust relationship between the extent to which an amino acid is preferred/avoided near boundaries and its enrichment/paucity in splice enhancers. As might then be expected, the rate of evolution is lowest near intron-exon boundaries, at least in part owing to splice enhancers, such that domains flanking intron-exon junctions evolve on average at under half the rate of exon centres from the same gene. In contrast, the rate of evolution of intronless retrogenes is highest near the domains where intron-exon junctions previously resided. The proportion of sequence near intron-exon boundaries is one of the stronger predictors of a protein's rate of evolution in mammals yet described. We conclude that after intron insertion selection favours modification of amino acid content near intron-exon junctions, so as to enable efficient intron removal, these changes then being subject to strong purifying selection even if nonoptimal for protein function. Thus there exists a strong force operating on protein evolution in mammals that is not explained directly in terms of the biology of the protein.