238 resultados para INTRONS


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Skipping of internal exons during removal of introns from pre-mRNA must be avoided for proper expression of most eukaryotic genes. Despite significant understanding of the mechanics of intron removal, mechanisms that ensure inclusion of internal exons in multi-intron pre-mRNAs remain mysterious. Using a natural two-intron yeast gene, we have identified distinct RNA–RNA complementarities within each intron that prevent exon skipping and ensure inclusion of internal exons. We show that these complementarities are positioned to act as intron identity elements, bringing together only the appropriate 5′ splice sites and branchpoints. Destroying either intron self-complementarity allows exon skipping to occur, and restoring the complementarity using compensatory mutations rescues exon inclusion, indicating that the elements act through formation of RNA secondary structure. Introducing new pairing potential between regions near the 5′ splice site of intron 1 and the branchpoint of intron 2 dramatically enhances exon skipping. Similar elements identified in single intron yeast genes contribute to splicing efficiency. Our results illustrate how intron secondary structure serves to coordinate splice site pairing and enforce exon inclusion. We suggest that similar elements in vertebrate genes could assist in the splicing of very large introns and in the evolution of alternative splicing.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The efficient expression of therapeutic genes in target cells or tissues is an important component of efficient and safe gene therapy. Utilizing regulatory elements from the human cytokeratin 18 (K18) gene, including 5′ genomic sequences and one of its introns, we have developed a novel expression cassette that can efficiently express reporter genes, as well as the human cystic fibrosis transmembrane conductance regulator (CFTR) gene, in cultured lung epithelial cells. CFTR transcripts expressed from the native K18 enhancer/promoter include two alternative splicing products, due to the activation of two cryptic splice sites in the CFTR coding region. Modification of the K18 intron and CFTR cDNA sequences eliminated the cryptic splice sites without changing the CFTR amino acid sequence, and led to enhanced CFTR mRNA and protein expression as well as biological function. Transgenic expression analysis in mice showed that the modified expression cassette can direct efficient and epithelium-specific expression of the Escherichia coli LacZ gene in the airways of fetal lungs, with no detectable expression in lung fibroblasts or endothelial cells. This is the first expression cassette which selectively directs lung transgene expression for CFTR gene therapy to airway epithelia.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

To understand the factors specifically affecting tRNA nuclear export, we adapted in situ hybridization procedures to locate endogenous levels of individual tRNA families in wild-type and mutant yeast cells. Our studies of tRNAs encoded by genes lacking introns show that nucleoporin Nup116p affects both poly(A) RNA and tRNA export, whereas Nup159p affects only poly(A) RNA export. Los1p is similar to exportin-t, which facilitates vertebrate tRNA export. A los1 deletion mutation affects tRNA but not poly(A) RNA export. The data support the notion that Los1p and exportin-t are functional homologues. Because LOS1 is nonessential, tRNA export in vertebrate and yeast cells likely involves factors in addition to exportin-t. Mutation of RNA1, which encodes RanGAP, causes nuclear accumulation of tRNAs and poly(A) RNA. Many yeast mutants, including those with the rna1-1 mutation, affect both pre-tRNA splicing and RNA export. Our studies of the location of intron-containing pre-tRNAs in the rna1-1 mutant rule out the possibility that this results from tRNA export occurring before splicing. Our results also argue against inappropriate subnuclear compartmentalization causing defects in pre-tRNA splicing. Rather, the data support “feedback” of nucleus/cytosol exchange to the pre-tRNA splicing machinery.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We analyze the three-dimensional structure of proteins by a computer program that finds regions of sequence that contain module boundaries, defining a module as a segment of polypeptide chain bounded in space by a specific given distance. The program defines a set of “linker regions” that have the property that if an intron were to be placed into each linker region, the protein would be dissected into a set of modules all less than the specified diameter. We test a set of 32 proteins, all of ancient origin, and a corresponding set of 570 intron positions, to ask if there is a statistically significant excess of intron positions within the linker regions. For 28-Å modules, a standard size used historically, we find such an excess, with P < 0.003. This correlation is neither due to a compositional or sequence bias in the linker regions nor to a surface bias in intron positions. Furthermore, a subset of 20 introns, which can be putatively identified as old, lies even more explicitly within the linker regions, with P < 0.0003. Thus, there is a strong correlation between intron positions and three-dimensional structural elements of ancient proteins as expected by the introns-early approach. We then study a range of module diameters and show that, as the diameter varies, significant peaks of correlation appear for module diameters centered at 21.7, 27.6, and 32.9 Å. These preferred module diameters roughly correspond to predicted exon sizes of 15, 22, and 30 residues. Thus, there are significant correlations between introns, modules, and a quantized pattern of the lengths of polypeptide chains, which is the prediction of the “Exon Theory of Genes.”

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Exon/intron architecture varies across the eukaryotic kingdom with large introns and small exons the rule in vertebrates and the opposite in lower eukaryotes. To investigate the relationship between exon and intron size in pre-mRNA processing, internally expanded exons were placed in vertebrate genes with small and large introns. Both exon and intron size influenced splicing phenotype. Intron size dictated if large exons were efficiently recognized. When introns were large, large exons were skipped; when introns were small, the same large exons were included. Thus, large exons were incompatible for splicing if and only if they were flanked by large introns. Both intron and exon size became problematic at ≈500 nt, although both exon and intron sequence influenced the size at which exons and introns failed to be recognized. These results indicate that present-day gene architecture reflects at least in part limitations on exon recognition. Furthermore, these results strengthen models that invoke pairing of splice sites during recognition of pre-mRNAs, and suggest that vertebrate consensus sequences support pairing across either introns or exons.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We identified a viral IL-10 homolog encoded by an ORF (UL111a) within the human cytomegalovirus (CMV) genome, which we designated cmvIL-10. cmvIL-10 can bind to the human IL-10 receptor and can compete with human IL-10 for binding sites, despite the fact that these two proteins are only 27% identical. cmvIL-10 requires both subunits of the IL-10 receptor complex to induce signal transduction events and biological activities. The structure of the cmvIL-10 gene is unique by itself. The gene retained two of four introns of the IL-10 gene, but the length of the introns was reduced. We demonstrated that cmvIL-10 is expressed in CMV-infected cells. Thus, expression of cmvIL-10 extends the range of counter measures developed by CMV to circumvent detection and destruction by the host immune system.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cells of several major algal groups are evolutionary chimeras of two radically different eukaryotic cells. Most of these “cells within cells” lost the nucleus of the former algal endosymbiont. But after hundreds of millions of years cryptomonads still retain the nucleus of their former red algal endosymbiont as a tiny relict organelle, the nucleomorph, which has three minute linear chromosomes, but their function and the nature of their ends have been unclear. We report extensive cryptomonad nucleomorph sequences (68.5 kb), from one end of each of the three chromosomes of Guillardia theta. Telomeres of the nucleomorph chromosomes differ dramatically from those of other eukaryotes, being repeats of the 23-mer sequence (AG)7AAG6A, not a typical hexamer (commonly TTAGGG). The subterminal regions comprising the rRNA cistrons and one protein-coding gene are exactly repeated at all three chromosome ends. Gene density (one per 0.8 kb) is the highest for any cellular genome. None of the 38 protein-coding genes has spliceosomal introns, in marked contrast to the chlorarachniophyte nucleomorph. Most identified nucleomorph genes are for gene expression or protein degradation; histone, tubulin, and putatively centrosomal ranbpm genes are probably important for chromosome segregation. No genes for primary or secondary metabolism have been found. Two of the three tRNA genes have introns, one in a hitherto undescribed location. Intergenic regions are exceptionally short; three genes transcribed by two different RNA polymerases overlap their neighbors. The reported sequences encode two essential chloroplast proteins, FtsZ and rubredoxin, thus explaining why cryptomonad nucleomorphs persist.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The structures of the genes encoding the α1 and β1 subunits of murine soluble guanylyl cyclase (sGC) were determined. Full-length cDNAs isolated from mouse lungs encoding the α1 (2.5 kb) and β1 (3.3 kb) subunits are presented in this report. The α1 sGC gene is approximately 26.4 kb and contains nine exons, whereas the β1 sGC gene spans 22 kb and consists of 14 exons. The positions of exon/intron boundaries and the sizes of introns for both genes are described. Comparison of mouse genomic organization with the Human Genome Database predicted the exon/intron boundaries of the human genes and revealed that human and mouse α1 and β1 sGC genes have similar structures. Both mouse genes are localized on the third chromosome, band 3E3-F1, and are separated by a fragment that is 2% of the chromosomal length. The 5′ untranscribed regions of α1 and β1 subunit genes were subcloned into luciferase reporter constructs, and the functional analysis of promoter activity was performed in murine neuroblastoma N1E-115 cells. Our results indicate that the 5′ untranscribed regions for both genes possess independent promoter activities and, together with the data on chromosomal localization, suggest independent regulation of both genes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cloning and sequencing of the upstream region of the gene of the CC chemokine HCC-1 led to the discovery of an adjacent gene coding for a CC chemokine that was named “HCC-2.” The two genes are separated by 12-kbp and reside in a head-to-tail orientation on chromosome 17. At variance with the genes for HCC-1 and other human CC chemokines, which have a three-exon-two-intron structure, the HCC-2 gene consists of four exons and three introns. Expression of HCC-2 and HCC-1 as studied by Northern analysis revealed, in addition to the regular, monocistronic mRNAs, a common, bicistronic transcript. In contrast to HCC-1, which is expressed constitutively in numerous human tissues, HCC-2 is expressed only in the gut and the liver. HCC-2 shares significant sequence homology with CKβ8 and the murine chemokines C10, CCF18/MRP-2, and macrophage inflammatory protein 1γ, which all contain six instead of four conserved cysteines. The two additional cysteines of HCC-2 form a third disulfide bond, which anchors the COOH-terminal domain to the core of the molecule. Highly purified recombinant HCC-2 was tested on neutrophils, eosinophils, monocytes, and lymphocytes and was found to exhibit marked functional similarities to macrophage inflammatory protein 1α. It is a potent chemoattractant and inducer of enzyme release in monocytes and a moderately active attractant for eosinophils. Desensitization studies indicate that HCC-2 acts mainly via CC chemokine receptor CCR1.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

5′-End fragments of two genes encoding plastid-localized acetyl-CoA carboxylase (ACCase; EC 6.4.1.2) of wheat (Triticum aestivum) were cloned and sequenced. The sequences of the two genes, Acc-1,1 and Acc-1,2, are 89% identical. Their exon sequences are 98% identical. The amino acid sequence of the biotin carboxylase domain encoded by Acc-1,1 and Acc-1,2 is 93% identical with the maize plastid ACCase but only 80–84% identical with the cytosolic ACCases from other plants and from wheat. Four overlapping fragments of cDNA covering the entire coding region were cloned by PCR and sequenced. The wheat plastid ACCase ORF contains 2,311 amino acids with a predicted molecular mass of 255 kDa. A putative transit peptide is present at the N terminus. Comparison of the genomic and cDNA sequences revealed introns at conserved sites found in the genes of other plant multifunctional ACCases, including two introns absent from the wheat cytosolic ACCase genes. Transcription start sites of the plastid ACCase genes were estimated from the longest cDNA clones obtained by 5′-RACE (rapid amplification of cDNA ends). The untranslated leader sequence encoded by the Acc-1 genes is at least 130–170 nucleotides long and is interrupted by an intron. Southern analysis indicates the presence of only one copy of the gene in each ancestral chromosome set. The gene maps near the telomere on the short arm of chromosomes 2A, 2B, and 2D. Identification of three different cDNAs, two corresponding to genes Acc-1,1 and Acc-1,2, indicates that all three genes are transcriptionally active.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Sequence analysis of chloroplast and mitochondrial large subunit rRNA genes from over 75 green algae disclosed 28 new group I intron-encoded proteins carrying a single LAGLIDADG motif. These putative homing endonucleases form four subfamilies of homologous enzymes, with the members of each subfamily being encoded by introns sharing the same insertion site. We showed that four divergent endonucleases from the I-CreI subfamily cleave the same DNA substrates. Mapping of the 66 amino acids that are conserved among the members of this subfamily on the 3-dimensional structure of I-CreI bound to its recognition sequence revealed that these residues participate in protein folding, homodimerization, DNA recognition and catalysis. Surprisingly, only seven of the 21 I-CreI amino acids interacting with DNA are conserved, suggesting that I-CreI and its homologs use different subsets of residues to recognize the same DNA sequence. Our sequence comparison of all 45 single-LAGLIDADG proteins identified so far suggests that these proteins share related structures and that there is a weak pressure in each subfamily to maintain identical protein–DNA contacts. The high sequence variability we observed in the DNA-binding site of homologous LAGLIDADG endonucleases provides insight into how these proteins evolve new DNA specificity.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Taking advantage of the ongoing Dictyostelium genome sequencing project, we have assembled >73 kb of genomic DNA in 15 contigs harbouring 15 genes and one pseudogene of Rho-related proteins. Comparison with EST sequences revealed that every gene is interrupted by at least one and up to four introns. For racC extensive alternative splicing was identified. Northern blot analysis showed that mRNAs for racA, racE, racG, racH and racI were present at all stages of development, whereas racJ and racL were expressed only at late stages. Amino acid sequences have been analysed in the context of Rho-related proteins of other organisms. Rac1a/1b/1c, RacF1/F2 and to a lesser extent RacB and the GTPase domain of RacA can be grouped in the Rac subfamily. None of the additional Dictyostelium Rho-related proteins belongs to any of the well-defined subfamilies, like Rac, Cdc42 or Rho. RacD and RacA are unique in that they lack the prenylation motif characteristic of Rho proteins. RacD possesses a 50 residue C-terminal extension and RacA a 400 residue C-terminal extension that contains a proline-rich region, two BTB domains and a novel C-terminal domain. We have also identified homologues for RacA in Drosophila and mammals, thus defining a new subfamily of Rho proteins, RhoBTB.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Group II introns are widely believed to have been ancestors of spliceosomal introns, yet little is known about their own evolutionary history. In order to address the evolution of mobile group II introns, we have compiled 71 open reading frames (ORFs) related to group II intron reverse transcriptases and subjected their derived amino acid sequences to phylogenetic analysis. The phylogenetic tree was rooted with reverse transcriptases (RTs) of non-long terminal repeat retroelements, and the inferred phylogeny reveals two major clusters which we term the mitochondrial and chloroplast-like lineages. Bacterial ORFs are mainly positioned at the bases of the two lineages but with weak bootstrap support. The data give an overview of an apparently high degree of horizontal transfer of group II intron ORFs, mostly among related organisms but also between organelles and bacteria. The Zn domain (nuclease) and YADD motif (RT active site) were lost multiple times during evolution. Differences in domain structures suggest that the oldest ORFs were concise, while the ORF in the mitochondrial lineage subsequently expanded in three locations. The data are consistent with a bacterial origin for mobile group II introns.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Defects in the XPG DNA repair endonuclease gene can result in the cancer-prone disorders xeroderma pigmentosum (XP) or the XP–Cockayne syndrome complex. While the XPG cDNA sequence was known, determination of the genomic sequence was required to understand its different functions. In cells from normal donors, we found that the genomic sequence of the human XPG gene spans 30 kb, contains 15 exons that range from 61 to 1074 bp and 14 introns that range from 250 to 5763 bp. Analysis of the splice donor and acceptor sites using an information theory-based approach revealed three splice sites with low information content, which are components of the minor (U12) spliceosome. We identified six alternatively spliced XPG mRNA isoforms in cells from normal donors and from XPG patients: partial deletion of exon 8, partial retention of intron 8, two with alternative exons (in introns 1 and 6) and two that retained complete introns (introns 3 and 9). The amount of alternatively spliced XPG mRNA isoforms varied in different tissues. Most alternative splice donor and acceptor sites had a relatively high information content, but one has the U12 spliceosome sequence. A single nucleotide polymorphism has allele frequencies of 0.74 for 3507G and 0.26 for 3507C in 91 donors. The human XPG gene contains multiple splice sites with low information content in association with multiple alternatively spliced isoforms of XPG mRNA.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Of the rules used by the splicing machinery to precisely determine intron–exon boundaries only a fraction is known. Recent evidence suggests that specific short sequences within exons help in defining these boundaries. Such sequences are known as exonic splicing enhancers (ESE). A possible bioinformatical approach to studying ESE sequences is to compare genes that harbor introns with genes that do not. For this purpose two non-redundant samples of 719 intron-containing and 63 intron-lacking human genes were created. We performed a statistical analysis on these datasets of intron-containing and intron-lacking human coding sequences and found a statistically significant difference (P = 0.01) between these samples in terms of 5–6mer oligonucleotide distributions. The difference is not created by a few strong signals present in the majority of exons, but rather by the accumulation of multiple weak signals through small variations in codon frequencies, codon biases and context-dependent codon biases between the samples. A list of putative novel human splicing regulation sequences has been elucidated by our analysis.