120 resultados para PROTO-SPLICE SITES
em Université de Lausanne, Switzerland
Resumo:
Retroposed genes (retrogenes) originate via the reverse transcription of mature messenger RNAs from parental source genes and are therefore usually devoid of introns. Here, we characterize a particular set of mammalian retrogenes that acquired introns upon their emergence and thus represent rare cases of intron gain in mammals. We find that although a few retrogenes evolved introns in their coding or 3' untranslated regions (untranslated region, UTR), most introns originated together with untranslated exons in the 5' flanking regions of the retrogene insertion site. They emerged either de novo or through fusions with 5' UTR exons of host genes into which the retrogenes inserted. Generally, retrogenes with introns display high transcription levels and show broader spatial expression patterns than other retrogenes. Our experimental expression analyses of individual intron-containing retrogenes show that 5' UTR introns may indeed promote higher expression levels, at least in part through encoded regulatory elements. By contrast, 3' UTR introns may lead to downregulation of expression levels via nonsense-mediated decay mechanisms. Notably, the majority of retrogenes with introns in their 5' flanks depend on distant, sometimes bidirectional CpG dinucleotide-enriched promoters for their expression that may be recruited from other genes in the genomic vicinity. We thus propose a scenario where the acquisition of new 5' exon-intron structures was directly linked to the recruitment of distant promoters by these retrogenes, a process potentially facilitated by the presence of proto-splice sites in the genomic vicinity of retrogene insertion sites. Thus, the primary role and selective benefit of new 5' introns (and UTR exons) was probably initially to span the often substantial distances to potent CpG promoters driving retrogene transcription. Later in evolution, these introns then obtained additional regulatory roles in fine tuning retrogene expression levels. Our study provides novel insights regarding mechanisms underlying the origin of new introns, the evolutionary relevance of intron gain, and the origin of new gene promoters.
Resumo:
AIMS: To identify the molecular basis for a low CYP1A2 metabolic status, as determined by a caffeine phenotyping test, in a 71-year-old, nonsmoking, Caucasian woman who presented with very high clozapine concentrations despite being administered a standard dose of the drug. METHODS: The nucleotide sequence of the 7 exons, exon-intron boundaries and 5'-flanking region of the CYP1A2 gene was analysed by direct sequencing. RESULTS: Only one heterozygous point mutation was identified in the donor splice site of intron 6 (3534G > A) of CYP1A2. This mutation could cause abnormal RNA splicing and therefore lead to a truncated nonfunctional enzyme. No other carrier of this mutation was identified in a population of 100 unrelated healthy Caucasians. CONCLUSIONS: This is the first report of a splice-site mutation affecting the CYP1A2 gene. This polymorphism is a likely explanation for the low CYP1A2 activity associated with high clozapine concentrations in this patient.
Resumo:
Cryptic exons or pseudoexons are typically activated by point mutations that create GT or AG dinucleotides of new 5' or 3' splice sites in introns, often in repetitive elements. Here we describe two cases of tetrahydrobiopterin deficiency caused by mutations improving the branch point sequence and polypyrimidine tracts of repeat-containing pseudoexons in the PTS gene. In the first case, we demonstrate a novel pathway of antisense Alu exonization, resulting from an intronic deletion that removed the poly(T)-tail of antisense AluSq. The deletion brought a favorable branch point sequence within proximity of the pseudoexon 3' splice site and removed an upstream AG dinucleotide required for the 3' splice site repression on normal alleles. New Alu exons can thus arise in the absence of poly(T)-tails that facilitated inclusion of most transposed elements in mRNAs by serving as polypyrimidine tracts, highlighting extraordinary flexibility of Alu repeats in shaping intron-exon structure. In the other case, a PTS pseudoexon was activated by an A>T substitution 9 nt upstream of its 3' splice site in a LINE-2 sequence, providing the first example of a disease-causing exonization of the most ancient interspersed repeat. These observations expand the spectrum of mutational mechanisms that introduce repetitive sequences in mature transcripts and illustrate the importance of intronic mutations in alternative splicing and phenotypic variability of hereditary disorders.
Resumo:
It is often supposed that a protein's rate of evolution and its amino acid content are determined by the function and anatomy of the protein. Here we examine an alternative possibility, namely that the requirement to specify in the unprocessed RNA, in the vicinity of intron-exon boundaries, information necessary for removal of introns (e.g., exonic splice enhancers) affects both amino acid usage and rates of protein evolution. We find that the majority of amino acids show skewed usage near intron-exon boundaries, and that differences in the trends for the 2-fold and 4-fold blocks of both arginine and leucine show this to be owing to effects mediated at the nucleotide level. More specifically, there is a robust relationship between the extent to which an amino acid is preferred/avoided near boundaries and its enrichment/paucity in splice enhancers. As might then be expected, the rate of evolution is lowest near intron-exon boundaries, at least in part owing to splice enhancers, such that domains flanking intron-exon junctions evolve on average at under half the rate of exon centres from the same gene. In contrast, the rate of evolution of intronless retrogenes is highest near the domains where intron-exon junctions previously resided. The proportion of sequence near intron-exon boundaries is one of the stronger predictors of a protein's rate of evolution in mammals yet described. We conclude that after intron insertion selection favours modification of amino acid content near intron-exon junctions, so as to enable efficient intron removal, these changes then being subject to strong purifying selection even if nonoptimal for protein function. Thus there exists a strong force operating on protein evolution in mammals that is not explained directly in terms of the biology of the protein.
Resumo:
We evaluated 25 protocol variants of 14 independent computational methods for exon identification, transcript reconstruction and expression-level quantification from RNA-seq data. Our results show that most algorithms are able to identify discrete transcript components with high success rates but that assembly of complete isoform structures poses a major challenge even when all constituent elements are identified. Expression-level estimates also varied widely across methods, even when based on similar transcript models. Consequently, the complexity of higher eukaryotic genomes imposes severe limitations on transcript recall and splice product discrimination that are likely to remain limiting factors for the analysis of current-generation RNA-seq data.
Resumo:
Purpose: Retinitis pigmentosa (RP; MIM 268000) is a hereditary disease characterized by poor night vision and progressive loss of photoreceptors, eventually leading to blindness. This degenerative process primarily affects peripheral vision due to the loss of rods. Autosomal recessive RP (arRP) is clinically and genetically heterogeneous. It has been associated with mutations in different genes, including CRB1 (Crumbs homolog 1). The aim of this study was to determine the causative gene in a Tunisian patient with arRP born to non consanguineous parents.Methods: Four accessible family members were included. They underwent full ophthalmic examination with best corrected Snellen visual acuity, fundus photography and fluoroangiography. Haplotype analyses were used to test linkage in the family to 20 arRP loci, including ABCA4, LRAT, USH2A, RP29, CERKL, CNGA1, CNGB1, CRB1, EYS, RP28, MERTK, NR2E3, PDE6A, PDE6B, RGR, RHO, RLBP1, TULP1. All exons and intron-exon junctions of candidate genes not excluded by haplotype analysis were PCR amplified and directly sequenced.Results: A 39 aged affected member was individualized. Best corrected visual acuity was OR: 20/63, OS: 20/80. Visual loss began at the third decade. Funduscopic examination and FA revealed typical advanced RP changes with bone spicule-shaped pigment deposits in the posterior pole and the mild periphery along with retinal atrophy, narrowing of the vessels and waxy optic discs. Haplotypes analysis revealed homozygosity with microsatellites markers D1S412 and D1S413 on chromosome 1q31.3. These markers flanked the CRB1 gene. Our results excluded linkage of all the other arRP loci/ genes tested. Sequencing of the 12 coding exons and splice sites of CRB1 gene disclosed a homozygous missense mutation in exon 7 at nucleotide c.(2291 G>A), resulting in an Arg to Hist substitution (p.R764H).Conclusions: R764H is a novel mutation associated with CRB1-related arRP. Previously, an R764C mutation was observed. Extending the mutation spectrum of CRB1 with additional families is important for genotype-phenotype correlations.
Resumo:
Approximately 520 Wilson disease-causing mutations in the ATP7B gene have been described to date. In this study we report DNA and RNA analyses carried out for molecular characterization of a consensus sequence splicing mutation found in homozygosity in a Swiss Wilson disease patient. RNA analysis of 1946 +6 T→C in both the peripheral lymphoblasts and liver resulted in the production in the propositus of only an alternative transcript lacking exons 6, 7, and 8 resulting most likely in alterations of cell biochemistry and disease. The patient presents an early form of severe hepatic disease characterized by hepatosplenomegaly, reduced hepatic function, anemia and thrombocytopenia indicating that 1946 +6 T→C is a severe mutation. Since identical results were obtained from both peripheral lymphoblasts and liver they also suggest that RNA studies of illegitimate transcripts can be safely used for molecular characterization of ATP7B splicing mutations, thus improving genetic counseling and diagnosis of Wilson disease. Moreover these studies, contribute to reveal the exact molecular mechanisms producing Wilson disease.
Resumo:
Thyroid hormones are involved in the regulation of growth and metabolism in all vertebrates. Transthyretin is one of the extracellular proteins with high affinity for thyroid hormones which determine the partitioning of these hormones between extracellular compartments and intracellular lipids. During vertebrate evolution, both the tissue pattern of expression and the structure of the gene for transthyretin underwent characteristic changes. The purpose of this study was to characterize the position of Insectivora in the evolution of transthyretin in eutherians, a subclass of Mammalia. Transthyretin was identified by thyroxine binding and Western analysis in the blood of adult shrews, hedgehogs, and moles. Transthyretin is synthesized in the liver and secreted into the bloodstream, similar to the situation for other adult eutherians, birds, and diprotodont marsupials, but different from that for adult fish, amphibians, reptiles, monotremes, and Australian polyprotodont marsupials. For the characterization of the structure of the gene and the processing of mRNA for transthyretin, cDNA libraries were prepared from RNA from hedgehog and shrew livers, and full-length cDNA clones were isolated and sequenced. Sections of genomic DNA in the regions coding for the splice sites between exons 1 and 2 were synthesized by polymerase chain reaction and sequenced. The location of splicing was deduced from comparison of genomic with cDNA nucleotide sequences. Changes in the nucleotide sequence of the transthyretin gene during evolution are most pronounced in the region coding for the N-terminal region of the protein. Both the derived overall amino sequences and the N-terminal regions of the transthyretins in Insectivora were found to be very similar to those in other eutherians but differed from those found in marsupials, birds, reptiles, amphibians, and fish. Also, the pattern of transthyretin precursor mRNA splicing in Insectivora was more similar to that in other eutherians than to that in marsupials, reptiles, and birds. Thus, in contrast to the marsupials, with a different pattern of transthyretin gene expression in the evolutionarily "older" polyprotodonts compared with the evolutionarily "younger" diprotodonts, no separate lineages of transthyretin evolution could be identified in eutherians. We conclude that transthyretin gene expression in the liver of adult eutherians probably appeared before the branching of the lineages leading to modern eutherian species.
Resumo:
PURPOSE: Retinitis pigmentosa (RP; MIM 268000) is a hereditary disease characterized by poor night vision and progressive loss of photoreceptors, eventually leading to blindness. This degenerative process primarily affects peripheral vision due to the loss of rods. Autosomal recessive RP (arRP) is clinically and genetically heterogeneous. It has been associated with mutations in different genes, including CRB1 (crumbs homolog 1). The aim of this study was to determine the causative gene in a Tunisian patient with arRP born to non-consanguineous parents. METHODS: Four accessible family members were included. They underwent full ophthalmic examination with best-corrected Snellen visual acuity, fundus photography and fluorescein angiography. Haplotype analysis was used to evaluate homozygosity in the family to 20 arRP loci. All exons and intron-exon junctions of candidate genes not excluded by haplotype analysis were PCR amplified and directly sequenced. RESULTS: The proband was a 43-year-old female patient. Best-corrected visual acuity was 20/63 (right eye) and 20/80 (left eye). Visual loss began during the third decade. Funduscopic examination and fluorescein angiography revealed typical advanced RP changes with bone spicule-like pigment deposits in the posterior pole and the midperiphery along with retinal atrophy, narrowing of the vessels, and waxy optic discs. Haplotype analysis revealed homozygosity with microsatellite markers D1S412 and D1S413 on chromosome 1q31.3. These markers flanked CRB1. Our results excluded linkage of all the other arRP loci/genes tested. Sequencing of the 12 coding exons and splice sites of CRB1 disclosed a homozygous missense mutation in exon 7 at nucleotide c. 2291G>A, resulting in an arginine to histidine substitution (p.R764H). CONCLUSIONS: R764H is a novel mutation associated with CRB1-related arRP. Previously, an R764C mutation was reported. Extending the mutation spectrum of CRB1 with additional families is important for genotype-phenotype correlations and characterization of the scope of mutation.
Resumo:
Analyzing the type and frequency of patient-specific mutations that give rise to Duchenne muscular dystrophy (DMD) is an invaluable tool for diagnostics, basic scientific research, trial planning, and improved clinical care. Locus-specific databases allow for the collection, organization, storage, and analysis of genetic variants of disease. Here, we describe the development and analysis of the TREAT-NMD DMD Global database (http://umd.be/TREAT_DMD/). We analyzed genetic data for 7,149 DMD mutations held within the database. A total of 5,682 large mutations were observed (80% of total mutations), of which 4,894 (86%) were deletions (1 exon or larger) and 784 (14%) were duplications (1 exon or larger). There were 1,445 small mutations (smaller than 1 exon, 20% of all mutations), of which 358 (25%) were small deletions and 132 (9%) small insertions and 199 (14%) affected the splice sites. Point mutations totalled 756 (52% of small mutations) with 726 (50%) nonsense mutations and 30 (2%) missense mutations. Finally, 22 (0.3%) mid-intronic mutations were observed. In addition, mutations were identified within the database that would potentially benefit from novel genetic therapies for DMD including stop codon read-through therapies (10% of total mutations) and exon skipping therapy (80% of deletions and 55% of total mutations).
Resumo:
We report the largest international study on Glanzmann thrombasthenia (GT), an inherited bleeding disorder where defects of the ITGA2B and ITGB3 genes cause quantitative or qualitative defects of the αIIbβ3 integrin, a key mediator of platelet aggregation. Sequencing of the coding regions and splice sites of both genes in members of 76 affected families identified 78 genetic variants (55 novel) suspected to cause GT. Four large deletions or duplications were found by quantitative real-time PCR. Families with mutations in either gene were indistinguishable in terms of bleeding severity that varied even among siblings. Families were grouped into type I and the rarer type II or variant forms with residual αIIbβ3 expression. Variant forms helped identify genes encoding proteins mediating integrin activation. Splicing defects and stop codons were common for both ITGA2B and ITGB3 and essentially led to a reduced or absent αIIbβ3 expression; included was a heterozygous c.1440-13_c.1440-1del in intron 14 of ITGA2B causing exon skipping in seven unrelated families. Molecular modeling revealed how many missense mutations induced subtle changes in αIIb and β3 domain structure across both subunits, thereby interfering with integrin maturation and/or function. Our study extends knowledge of GT and the pathophysiology of an integrin.
Resumo:
Alternative splicing produces multiple isoforms from the same gene, thus increasing the number of transcripts of the species. Alternative splicing is a virtually ubiquitous mechanism in eukaryotes, for example more than 90% of protein-coding genes in human are alternatively spliced. Recent evolutionary studies showed that alternative splicing is a fast evolving and highly species- specific mechanism. The rapid evolution of alternative splicing was considered as a contribution to the phenotypic diversity between species. However, the function of many isoforms produced by alternative splicing remains unclear and they might be the result of noisy splicing. Thus, the functional relevance of alternative splicing and the evolutionary mechanisms of its rapid divergence among species are still poorly understood. During my thesis, I performed a large-scale analysis of the regulatory mechanisms that drive the rapid evolution of alternative splicing. To study the evolution of alternative splicing regulatory mechanisms, I used an extensive RNA-sequencing dataset comprising 12 tetrapod species (human, chimpanzee and bonobo, gorilla, orangutan, macaque, marmoset, mouse, opossum, platypus, chicken and frog) and 8 tissues (cerebellum, brain, heart, kidney, liver, testis, placenta and ovary). To identify the catalogue of alternative splicing eis-acting regulatory elements in the different tetrapod species, I used a previously defined computational approach. This approach is a statistical analysis of exons/introns and splice sites composition and relies on a principle of compensation between splice sites strength and the presence of additional regulators. With an evolutionary comparative analysis of the exonic eis-acting regulators, I showed that these regulatory elements are generally shared among primates and more conserved than non-regulatory elements. In addition, I showed that the usage of these regulatory elements is also more conserved than expected by chance. In addition to the identification of species- specific eis-acting regulators, these results may explain the rapid evolution of alternative splicing. I also developed a new approach based on evolutionary sequence changes and corresponding alternative splicing changes to identify potential splicing eis-acting regulators in primates. The identification of lineage-specific substitutions and corresponding lineage-specific alternative splicing changes, allowed me to annotate the genomic sequences that might have played a role in the alternative splicing pattern differences among primates. Finally, I showed that the identified splicing eis-acting regulator datasets are enriched in human disease-causing mutations, thus confirming their biological relevance.
Resumo:
BACKGROUND AND PURPOSE: Transgenic mice overexpressing Notch2 in the uvea exhibit a hyperplastic ciliary body leading to increased IOP and glaucoma. The aim of this study was to investigate the possible presence of NOTCH2 variants in patients with primary open-angle glaucoma (POAG). METHODS: We screened DNA samples from 130 patients with POAG for NOTCH2 variants by denaturing high-performance liquid chromatography after PCR amplification and validated our data by direct Sanger sequencing. RESULTS: No mutations were observed in the coding regions of NOTCH2 or in the splice sites. 19 known SNPs (single nucleotide polymorphisms) were detected. An SNP located in intron 24, c.[4005+45A>G], was seen in 28.5% of the patients (37/130 patients). As this SNP is reported to have a minor allele frequency of 7% in the 1000 genomes database, it could be associated with POAG. However, we evaluated its frequency in an ethnic-matched control group of 96 subjects unaffected by POAG and observed a frequency of 29%, indicating that it was not related to POAG. CONCLUSION: NOTCH2 seemed to be a good candidate for POAG as it is expressed in the anterior segment in the human eye. However, mutational analysis did not show any causative mutation. This study also shows that proper ethnic-matched control groups are essential in association studies and that values given in databases are sometimes misleading.
Resumo:
This report presents systematic empirical annotation of transcript products from 399 annotated protein-coding loci across the 1% of the human genome targeted by the Encyclopedia of DNA elements (ENCODE) pilot project using a combination of 5' rapid amplification of cDNA ends (RACE) and high-density resolution tiling arrays. We identified previously unannotated and often tissue- or cell-line-specific transcribed fragments (RACEfrags), both 5' distal to the annotated 5' terminus and internal to the annotated gene bounds for the vast majority (81.5%) of the tested genes. Half of the distal RACEfrags span large segments of genomic sequences away from the main portion of the coding transcript and often overlap with the upstream-annotated gene(s). Notably, at least 20% of the resultant novel transcripts have changes in their open reading frames (ORFs), most of them fusing ORFs of adjacent transcripts. A significant fraction of distal RACEfrags show expression levels comparable to those of known exons of the same locus, suggesting that they are not part of very minority splice forms. These results have significant implications concerning (1) our current understanding of the architecture of protein-coding genes; (2) our views on locations of regulatory regions in the genome; and (3) the interpretation of sequence polymorphisms mapping to regions hitherto considered to be "noncoding," ultimately relating to the identification of disease-related sequence alterations.
Resumo:
Variation in cellular gene expression levels has been shown to be inherited. Expression is controlled at transcriptional and post-transcriptional levels. Internal ribosome entry sites (IRES) are used by viruses to bypass inhibition of cap-dependent translation, and by eukaryotic cells to control translation under conditions when protein synthesis is inhibited. We aimed at identifying genomic determinants of variability in IRES-mediated translation of viral [Encephalomyocarditis virus (EMCV)] and cellular IRES [X-linked inhibitor-of-apoptosis (XIAP) and c-myc]. Bicistronic lentiviral constructs expressing two fluorescent reporters were used to transduce laboratory and B lymphoblastoid cell lines [15 CEPH pedigrees (n = 205) and 50 unrelated individuals]. IRES efficiency varied according to cell type and among individuals. Control of IRES activity has a significant genetic component (h(2) of 0.47 and 0.36 for EMCV and XIAP, respectively). Quantitative linkage analysis identified a suggestive locus (LOD 2.35) on chromosome 18q21.2, and genome-wide association analysis revealed of a cluster of SNPs on chromosome 3, intronic to the FHIT gene, marginally associated (P = 5.9E-7) with XIAP IRES function. This study illustrates the in vitro generation of intermediate phenotypes by using cell lines for the evaluation of genetic determinants of control of elements such as IRES.