260 resultados para Introns


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Linkage disequilibrium between polymorphisms in a natural population may result from various evolutionary forces, including random genetic drift due to sampling of gametes during reproduction, restricted migration between subpopulations in a subdivided population, or epistatic selection. In this report, we present evidence that the majority of significant linkage disequilibria observed in introns of the alcohol dehydrogenase locus (Adh) of Drosophila pseudoobscura are due to epistatic selection maintaining secondary structure of precursor mRNA (pre-mRNA). Based on phylogenetic-comparative analysis and a likelihood approach, we propose secondary structure models of Adh pre-mRNA for the regions of the adult intron and intron 2 where clustering of linkage disequilibria has been observed. Furthermore, we applied the likelihood ratio test to the phylogenetically predicted secondary structure in intron 1. In contrast to the other two structures, polymorphisms associated with the more conserved stem-loop structure of intron 1 are in low frequency, and linkage disequilibria have not been observed. These findings are qualitatively consistent with a model of compensatory fitness interactions. This model assumes that mutations disrupting pairing in a secondary structural element are individually deleterious if they destabilize a functionally important structure; a second "compensatory" mutation, however, may restabilize the structure and restore fitness.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Genes for glycolytic and Calvin-cycle glyceraldehyde-3-phosphate dehydrogenase (GAPDH) of higher eukaryotes derive from ancient gene duplications which occurred in eubacterial genomes; both were transferred to the nucleus during the course of endosymbiosis. We have cloned cDNAs encoding chloroplast and cytosolic GAPDH from the early-branching photosynthetic protist Euglena gracilis and have determined the structure of its nuclear gene for cytosolic GAPDH. The gene contains four introns which possess unusual secondary structures, do not obey the GT-AG rule, and are flanked by 2- to 3-bp direct repeats. A gene phylogeny for these sequences in the context of eubacterial homologues indicates that euglenozoa, like higher eukaryotes, have obtained their GAPDH genes from eubacteria via endosymbiotic (organelle-to-nucleus) gene transfer. The data further suggest that the early-branching protists Giardia lamblia and Entamoeba histolytica--which lack mitochondria--and portions of the trypanosome lineage have acquired GAPDH genes from eubacterial donors which did not ultimately give rise to contemporary membrane-bound organelles. Evidence that "cryptic" (possibly ephemeral) endosymbioses during evolution may have entailed successful gene transfer is preserved in protist nuclear gene sequences.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The exon theory of genes proposes that the introns of protein-encoding nuclear genes are remnants of the DNA spacers between ancient minigenes. The discovery of an intron at a predicted position in the triose-phosphate isomerase (EC 5.3.1.1) gene of Culex mosquitoes has been hailed as an evidential pillar of the theory. We have found that that intron is also present in Aedes mosquitoes, which are closely related to Culex, but not in the phylogenetically more distant Anopheles, nor in the fly Calliphora vicina, nor in the moth Spodoptera littoralis. The presence of this intron in Culex and Aedes is parsimoniously explained as the result of an insertion in a recent common ancestor of these two species rather than as the remnant of an ancient intron. The absence of the intron in 19 species of very diverse organisms requires at least 10 independent evolutionary losses in order to be consistent with the exon theory.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The biosynthesis of gibberellins (GAs) after GA12-aldehyde involves a series of oxidative steps that lead to the formation of bioactive GAs. Previously, a cDNA clone encoding a GA 20-oxidase [gibberellin, 2-oxoglutarate:oxygen oxidoreductase (20-hydroxylating, oxidizing), EC 1.14.11.-] was isolated by immunoscreening a cDNA library from liquid endosperm of pumpkin (Cucurbita maxima L.) with antibodies against partially purified GA 20-oxidase. Here, we report isolation of a genomic clone for GA 20-oxidase from a genomic library of the long-day species Arabidopsis thaliana Heynh., strain Columbia, by using the pumpkin cDNA clone as a heterologous probe. This genomic clone contains a GA 20-oxidase gene that consists of three exons and two introns. The three exons are 1131-bp long and encode 377 amino acid residues. A cDNA clone corresponding to the putative GA 20-oxidase genomic sequence was constructed with the reverse transcription-PCR method, and the identity of the cDNA clone was confirmed by analyzing the capability of the fusion protein expressed in Escherichia coli to convert GA53 to GA44 and GA19 to GA20. The Arabidopsis GA 20-oxidase shares 55% identity and > 80% similarity with the pumpkin GA 20-oxidase at the derived amino acid level. Both GA 20-oxidases share high homology with other 2-oxoglutarate-dependent dioxygenases (2-ODDs), but the highest homology was found between the two GA 20-oxidases. Mapping results indicated tight linkage between the cloned GA 20-oxidase and the GA5 locus of Arabidopsis. The ga5 semidwarf mutant contains a G-->A point mutation that inserts a translational stop codon in the protein-coding sequence, thus confirming that the GA5 locus encodes GA 20-oxidase. Expression of the GA5 gene in Ara-bidopsis leaves was enhanced after plants were transferred from short to long days; it was reduced by GA4 treatment, suggesting end-product repression in the GA biosynthetic pathway.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The nucleotide sequences of four genes encoding Trimeresurus gramineus (green habu snake, crotalinae) venom gland phospholipase A2 (PLA2; phosphatidylcholine 2-acylhydrolase, EC 3.1.1.4) isozymes were compared internally and externally with those of six genes encoding Trimeresurus flavoviridis (habu snake, crotalinae) venom gland PLA2 isozymes. The numbers of nucleotide substitutions per site (KN) for the noncoding regions including introns were one-third to one-eighth of the numbers of nucleotide substitutions per synonymous site (KS) for the protein-coding regions of exons, indicating that the noncoding regions are much more conserved than the protein-coding regions. The KN values for the introns were found to be nearly equivalent to those of introns of T. gramineus and T. flavoviridis TATA box-binding protein genes, which are assumed to be a general (nonvenomous) gene. Thus, it is evident that the introns of venom gland PLA2 isozyme genes have evolved at a similar rate to those of nonvenomous genes. The numbers of nucleotide substitutions per nonsynonymous site (KA) were close to or larger than the KS values for the protein-coding regions in venom gland PLA2 isozyme genes. All of the data combined reveal that Darwinian-type accelerated evolution has universally occurred only in the protein-coding regions of crotalinae snake venom PLA2 isozyme genes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Domain 5 (D5) is a small hairpin structure within group II introns. A bimolecular assay system depends on binding by D5 to an intron substrate for self-splicing activity. In this study, mutations in D5 identify two among six nearly invariant nucleotides as being critical for 5' splice junction hydrolysis but unimportant for binding. A mutation at another site in D5 blocks binding. Thus, mutations can distinguish two D5 functions: substrate binding and catalysis. The secondary structure of D5 may resemble helix I formed by the U2 and U6 small nuclear RNAs in the eukaryotic spliceosome. Our results support a revision of the previously proposed correspondence between D5 and helix I on the basis of the critical trinucleotide 5'-AGC-3' present in both. We suggest that this trinucleotide plays a similar role in promoting the chemical reactions for both splicing systems.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Clusterina (CLU) è una proteina ubiquitaria, presente nella maggior parte dei fluidi corporei e implicata in svariati processi fisiologici. Dalla sua scoperta fino ad oggi, CLU è risultata essere una proteina enigmatica, la cui funzione non è ancora stata compresa appieno. Il gene codifica per 3 varianti trascrizionali identificate nel database NCBI con i codici: NM_001831 (CLU 1 in questo lavoro di tesi), NR_038335 (CLU 2 in questo lavoro di tesi) e NR_045494 (CLU 3 in questo lavoro di tesi). Tutte le varianti sono trascritte come pre-mRNA contenenti 9 esoni e 8 introni e si differenziano per l’esone 1, la cui sequenza è unica e caratteristica di ogni variante. Sebbene in NCBI sia annotato che le varianti CLU 2 e CLU 3 non sono codificanti, tramite analisi bioinformatica è stato predetto che da tutti e tre i trascritti possono generarsi proteine di differente lunghezza e localizzazione cellulare. Tra tutte le forme proteiche ipotizzate, l’unica a essere stata isolata e sequenziata è quella tradotta dall’AUG presente sull’esone 2 che dà origine a una proteina di 449 aminoacidi. Il processo di maturazione prevede la formazione di un precursore citoplasmatico (psCLU) che subisce modificazioni post-traduzionali tra cui formazione di ponti disolfuro, glicosilazioni, taglio in due catene denominate β e α prima di essere secreta come eterodimero βα (sCLU) nell’ambiente extracellulare, dove esercita la sua funzione di chaperone ATP-indipendente. Oltre alla forma extracellulare, è possibile osservare una forma intracellulare con localizzazione citosolica la cui funzione non è stata ancora completamente chiarita. Questo lavoro di tesi si è prefissato lo scopo di incrementare le conoscenze in merito ai trascritti CLU 1 e CLU 2 e alla loro regolazione, oltre ad approfondire il ruolo della forma citosolica della proteina in relazione al signaling di NF-kB che svolge un ruolo importante nel processo di sviluppo e metastatizzazione del tumore. Nella prima parte, uno screening di differenti linee cellulari, quali cellule epiteliali di prostata e di mammella, sia normali sia tumorali, fibroblasti di origine polmonare e linfociti di tumore non-Hodgkin, ha permesso di caratterizzare i trascritti CLU 1 e CLU 2. Dall’analisi è emerso che la sequenza di CLU 1 è più corta al 5’ rispetto a quella depositata in NCBI con l’identificativo NM_001831 e il primo AUG disponibile per l’inizio della traduzione è localizzato sull’esone 2. È stato dimostrato che CLU 2, al contrario di quanto riportato in NCBI, è tradotto in proteina a partire dall’AUG presente sull’esone 2, allo stesso modo in cui viene tradotto CLU 1. Inoltre, è stato osservato che i livelli d’espressione dei trascritti variano notevolmente tra le diverse linee cellulari e nelle cellule epiteliali CLU 2 è espressa sempre a bassi livelli. In queste cellule, l’espressione di CLU 2 è silenziata per via epigenetica e la somministrazione di farmaci capaci di rendere la cromatina più accessibile, quali tricostatina A e 5-aza-2’-deossicitidina, è in grado di incrementarne l’espressione. Nella seconda parte, un’analisi bioinformatica seguita da saggi di attività in vitro in cellule epiteliali prostatiche trattate con farmaci epigenetici, hanno permesso di identificare, per la prima volta in uomo, una seconda regione regolatrice denominata P2, capace di controllare l’espressione di CLU 2. Rispetto a P1, il classico promotore di CLU già ampiamente studiato da altri gruppi di ricerca, P2 è un promotore debole, privo di TATA box, che nelle cellule epiteliali prostatiche è silente in condizioni basali e la cui attività incrementa in seguito alla somministrazione di farmaci epigenetici capaci di alterare le modificazioni post-traduzionali delle code istoniche nell’intorno di P2. Ne consegue un rilassamento della cromatina e un successivo aumento di trascrizione di CLU 2. La presenza di un’isola CpG differentemente metilata nell’intorno di P1 spiegherebbe, almeno in parte, i differenti livelli di espressione di CLU che si osservano tra le diverse linee cellulari. Nella terza parte, l’analisi del pathway di NF-kB in un modello sperimentale di tumore prostatico in cui CLU è stata silenziata o sovraespressa, ha permesso di capire come la forma citosolica di CLU abbia un ruolo inibitorio nei confronti dell’attività del fattore trascrizionale NF-kB. CLU inibisce la fosforilazione e l’attivazione di p65, il membro più rappresentativo della famiglia NF-kB, con conseguente riduzione della trascrizione di alcuni geni da esso regolati e coinvolti nel rimodellamento della matrice extracellulare, quali l’urochinasi attivatrice del plasminogeno, la catepsina B e la metallo proteinasi 9. È stato dimostrato che tale inibizione non è dovuta a un’interazione fisica diretta tra CLU e p65, per cui si suppone che CLU interagisca con uno dei componenti più a monte della via di segnalazione responsabile della fosforilazione ed attivazione di p65.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Fused in sarcoma (FUS) is a ubiquitously expressed RNA-binding protein proposed to function in various RNA metabolic pathways, including transcription regulation, pre-mRNA splicing, RNA transport and microRNA processing. Mutations in the FUS gene were identified in patients with amyotrophic lateral sclerosis (ALS), but the pathomechanisms by which these mutations cause ALS are not known. Here, we show that FUS interacts with the minor spliceosome constituent U11 snRNP, binds preferentially to minor introns and directly regulates their removal. Furthermore, a FUS knockout in neuroblastoma cells strongly disturbs the splicing of minor intron-containing mRNAs, among them mRNAs required for action potential transmission and for functional spinal motor units. Moreover, an ALS-associated FUS mutant that forms cytoplasmic aggregates inhibits splicing of minor introns by trapping U11 and U12 snRNAs in these aggregates. Collectively, our findings suggest a possible pathomechanism for ALS in which mutated FUS inhibits correct splicing of minor introns in mRNAs encoding proteins required for motor neuron survival.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Alternative splicing is widespread in mammalian gene expression, and variant splice patterns are often specific to different stages of development, particular tissues or a disease state. There is a need to systematically collect data on alternatively spliced exons, introns and splice isoforms, and to annotate this data. The Alternative Splicing Database consortium has been addressing this need, and is committed to maintaining and developing a value-added database of alternative splice events, and of experimentally verified regulatory mechanisms that mediate splice variants. In this paper we present two of the products from this project: namely, a database of computationally delineated alternative splice events as seen in alignments of EST/cDNA sequences with genome sequences, and a database of alternatively spliced exons collected from literature. The reported splice events are from nine different organisms and are annotated for various biological features including expression states and cross-species conservation. The data are presented on our ASD web pages (http://www.ebi.ac.uk/asd).

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Do non-coding RNAs that are derived from the introns and exons of protein-coding and non-protein-coding genes represent a fundamental advance in the genetic operating system of higher organisms? Recent evidence from comparative genomics and molecular genetics indicates that this might be the case. If so, there will be profound consequences for our understanding of the genetics of these organisms, and in particular how the trajectories of differentiation and development and the differences among individuals and species are genomically programmed. But how might this hypothesis be tested?

Relevância:

10.00% 10.00%

Publicador:

Resumo:

There are 481 segments longer than 200 base pairs (bp) that are absolutely conserved (100% identity with no insertions or deletions) between orthologous regions of the human, rat, and mouse genomes. Nearly all of these segments are also conserved in the chicken and dog genomes, with an average of 95 and 99% identity, respectively. Many are also significantly conserved in fish. These ultraconserved elements of the human genome are most often located either overlapping exons in genes involved in RNA processing or in introns or nearby genes involved in the regulation of transcription and development. Along with more than 5000 sequences of over 100 bp that are absolutely conserved among the three sequenced mammals, these represent a class of genetic elements whose functions and evolutionary origins are yet to be determined, but which are more highly conserved between these species than are proteins and appear to be essential for the ontogeny of mammals and other vertebrates.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Mammalian cells harbor numerous small non-protein-coding RNAs, including small nucleolar RNAs (snoRNAs), microRNAs (miRNAs), short interfering RNAs (siRNAs) and small double-stranded RNAs, which regulate gene expression at many levels including chromatin architecture, RNA editing, RNA stability, translation, and quite possibly transcription and splicing. These RNAs are processed by multistep pathways from the introns and exons of longer primary transcripts, including protein-coding transcripts. Most show distinctive temporal- and tissue-specific expression patterns in different tissues, including embryonal stem cells and the brain, and some are imprinted. Small RNAs control a wide range of developmental and physiological pathways in animals, including hematopoietic differentiation, adipocyte differentiation and insulin secretion in mammals, and have been shown to be perturbed in cancer and other diseases. The extent of transcription of non-coding sequences and the abundance of small RNAs suggests the existence of an extensive regulatory network on the basis of RNA signaling which may underpin the development and much of the phenotypic variation in mammals and other complex organisms and which may have different genetic signatures from sequences encoding proteins.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This report describes the presence of a unique dual domain carbonic anhydrase (CA) in the giant clam, Tridacna gigas. CA plays an important role in the movement of inorganic carbon (C-i) from the surrounding seawater to the symbiotic algae that are found within the clam's tissue. One of these isoforms is a glycoprotein which is significantly larger (70 kDa) than any previously reported from animals (generally between 28 and 52 kDa). This alpha-family CA contains two complete carbonic anhydrase domains within the one protein, accounting for its large size; dual domain CAs have previously only been reported from two algal species. The protein contains a leader sequence, an N-terminal CA domain and a C-terminal CA domain. The two CA domains have relatively little identity at the amino acid level (29%). The genomic sequence spans in excess of 17 kb and contains at least 12 introns and 13 exons. A number of these introns are in positions that are only found in the membrane attached/secreted CAs. This fact, along with phylogenetic analysis, suggests that this protein represents the second example of a membrane attached invertebrate CA and it contains a dual domain structure unique amongst all animal CAs characterized to date.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Recent large-scale analyses of mainly full-length cDNA libraries generated from a variety of mouse tissues indicated that almost half of all representative cloned sequences did flat contain ail apparent protein-coding sequence, and were putatively derived from non-protein-coding RNA (ncRNA) genes. However, many of these clones were singletons and the majority were unspliced, raising the possibility that they may be derived from genomic DNA or unprocessed pre-rnRNA contamination during library construction, or alternatively represent nonspecific transcriptional noise. Here we Show, using reverse transcriptase-dependent PCR, microarray, and Northern blot analyses, that many of these clones were derived from genuine transcripts Of unknown function whose expression appears to be regulated. The ncRNA transcripts have larger exons and fewer introns than protein-coding transcripts. Analysis of the genomic landscape around these sequences indicates that some cDNA clones were produced not from terminal poly(A) tracts but internal priming sites within longer transcripts, only a minority of which is encompassed by known genes. A significant proportion of these transcripts exhibit tissue-specific expression patterns, as well as dynamic changes in their expression in macrophages following lipopolysaccharide Stimulation. Taken together, the data provide strong support for the conclusion that ncRNAs are an important, regulated component of the mammalian transcriptome.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The term non-coding RNA (ncRNA) is commonly employed for RNA that does not encode a protein, but this does not mean that such RNAs do not contain information nor have function. Although it has been generally assumed that most genetic information is transacted by proteins, recent evidence suggests that the majority of the genomes of mammals and other complex organisms is in fact transcribed into ncRNAs, many of which are alternatively spliced and/or processed into smaller products. These ncRNAs include microRNAs and snoRNAs (many if not most of which remain to be identified), as well as likely other classes of yet-to-be-discovered small regulatory RNAs, and tens of thousands of longer transcripts (including complex patterns of interlacing and overlapping sense and antisense transcripts), most of whose functions are unknown. These RNAs (including those derived from introns) appear to comprise a hidden layer of internal signals that control various levels of gene expression in physiology and development, including chromatin architecture/epigenetic memory, transcription, RNA splicing, editing, translation and turnover. RNA regulatory networks may determine most of our complex characteristics, play a significant role in disease and constitute an unexplored world of genetic variation both within and between species.