943 resultados para Noncoding Sequences
Resumo:
Genomic sequence comparison across species has enabled the elucidation of important coding and regulatory sequences encoded within DNA. Of particular interest are the noncoding regulatory sequences, which influence gene transcriptional and posttranscriptional processes. A phylogenetic footprinting strategy was employed to identify noncoding conservation patterns of 39 human and bovine orthologous genes. Seventy-three conserved noncoding sequences were identified that shared greater than 70% identity over at least 100 bp. Thirteen of these conserved sequences were also identified in the mouse genome. Evolutionary conservation of noncoding sequences across diverse species may have functional significance, and these conserved sequences may be good candidates for regulatory elements.
Resumo:
Certain recent models of sex determination in mammals, Drosophila melanogaster, Caenorhabditis elegans, and snakes are examined in the light of the hypothesis that the relevant genetic regulatory mechanisms are similar and interrelated. The proposed key element in each of these instances is a noncoding DNA sequence, which serves as a high-affinity binding site for a repressor-like molecule regulating the activity of a major "sex-determining" gene. On this basis it is argued that, in several eukaryotes, (i) certain DNA sequences that are sex-determining are noncoding, in the sense that they are not the structural genes of a sex-determining protein; (ii) in some species these noncoding sequences are present in one sex and absent in the other, while in others their copy number or accessibility to regulatory molecules is significantly unequal between the two sexes; and (iii) this inequality determines whether the embryo develops into a male or a female.
Resumo:
The removal of noncoding sequences, or introns, from the eukaryotic messenger RNA precursors is catalyzed by a ribonucleoprotein complex known as the spliceosome. In most eukaryotes, two distinct classes of introns exist, each removed by a specific type of spliceosome. The major, U2-type introns account for over 99 % of all introns, and are almost ubiquitous. The minor, U12-type introns are found in most but not all eukaryotes, and reside in conserved locations in a specific set of genes. Due to their slow excision rates, the U12-type introns are expected to be involved in the regulation of the genes containing them by inhibiting the maturation of the messenger RNAs. However, little information is currently available on how the activity of the U12-dependent spliceosome itself is regulated. The levels of many known splicing factors are regulated through unproductive alternative splicing events, which lead to inclusion of premature STOP codons, targeting the transcripts for destruction by the nonsense-mediated decay pathway. These alternative splice sites are typically found in highly conserved sequence elements, which also contain binding sites for factors regulating the activation of the splice sites. Often, the activation is achieved by binding of products of the gene in question, resulting in negative feedback loops. In this study, I show that U11-48K, a protein factor specific to the minor spliceosome, specifically recognizes the U12-type 5' splice site sequence, and is essential for proper function of the minor spliceosome. Furthermore, the expression of U11-48K is regulated through a feedback mechanism, which functions through conserved sequence elements that activate alternative splicing and nonsense-mediated decay. This mechanism is conserved from plants to animals, highlighting both the importance and early origin of this mechanism in regulating splicing factors. I also show that the feedback regulation of U11-48K is counteracted by a component of the major spliceosome, the U1 small nuclear ribonucleoprotein particle, as well as members of the hnRNP F/H protein family. These results thus suggest that the feedback mechanism is finely tuned by multiple factors to achieve precise control of the activity of the U12-dependent spliceosome.
Resumo:
Eukaryotic genomes are mostly composed of noncoding DNA whose role is still poorly understood. Studies in several organisms have shown correlations between the length of the intergenic and genic sequences of a gene and the expression of its corresponding mRNA transcript. Some studies have found a positive relationship between intergenic sequence length and expression diversity between tissues, and concluded that genes under greater regulatory control require more regulatory information in their intergenic sequences. Other reports found a negative relationship between expression level and gene length and the interpretation was that there is selection pressure for highly expressed genes to remain small. However, a correlation between gene sequence length and expression diversity, opposite to that observed for intergenic sequences, has also been reported, and to date there is no testable explanation for this observation. To shed light on these varied and sometimes conflicting results, we performed a thorough study of the relationships between sequence length and gene expression using cell-type (tissue) specific microarray data in Arabidopsis thaliana. We measured median gene expression across tissues (expression level), expression variability between tissues (expression pattern uniformity), and expression variability between replicates (expression noise). We found that intergenic (upstream and downstream) and genic (coding and noncoding) sequences have generally opposite relationships with respect to expression, whether it is tissue variability, median, or expression noise. To explain these results we propose a model, in which the lengths of the intergenic and genic sequences have opposite effects on the ability of the transcribed region of the gene to be epigenetically regulated for differential expression. These findings could shed light on the role and influence of noncoding sequences on gene expression.
Resumo:
Background. Visceral leishmaniasis (VL) is caused by Leishmania donovani and Leishmania infantum chagasi. Genome-wide linkage studies from Sudan and Brazil identified a putative susceptibility locus on chromosome 6q27. Methods. Twenty-two single-nucleotide polymorphisms (SNPs) at genes PHF10, C6orf70, DLL1, FAM120B, PSMB1, and TBP were genotyped in 193 VL cases from 85 Sudanese families, and 8 SNPs at genes PHF10, C6orf70, DLL1, PSMB1, and TBP were genotyped in 194 VL cases from 80 Brazilian families. Family-based association, haplotype, and linkage disequilibrium analyses were performed. Multispecies comparative sequence analysis was used to identify conserved noncoding sequences carrying putative regulatory elements. Quantitative reverse-transcription polymerase chain reaction measured expression of candidate genes in splenic aspirates from Indian patients with VL compared with that in the control spleen sample. Results. Positive associations were observed at PHF10, C6orf70, DLL1, PSMB1, and TBP in Sudan, but only at DLL1 in Brazil (combined P = 3 x 10(-4) at DLL1 across Sudan and Brazil). No functional coding region variants were observed in resequencing of 22 Sudanese VL cases. DLL1 expression was significantly (P = 2 x 10(-7)) reduced (mean fold change, 3.5 [SEM, 0.7]) in splenic aspirates from patients with VL, whereas other 6q27 genes showed higher levels (1.27 x 10(-6) < P < .01) than did the control spleen sample. A cluster of conserved noncoding sequences with putative regulatory variants was identified in the distal promoter of DLL1. Conclusions. DLL1, which encodes Delta-like 1, the ligand for Notch3, is strongly implicated as the chromosome 6q27 VL susceptibility gene.
Resumo:
Cardiac morphogenesis is a complex process governed by evolutionarily conserved transcription factors and signaling molecules. The Drosophila cardiac tube is linear, made of 52 pairs of cardiomyocytes (CMs), which express specific transcription factor genes that have human homologues implicated in Congenital Heart Diseases (CHDs) (NKX2-5, GATA4 and TBX5). The Drosophila cardiac tube is linear and composed of a rostral portion named aorta and a caudal one called heart, distinguished by morphological and functional differences controlled by Hox genes, key regulators of axial patterning. Overexpression and inactivation of the Hox gene abdominal-A (abd-A), which is expressed exclusively in the heart, revealed that abd-A controls heart identity. The aim of our work is to isolate the heart-specific cisregulatory sequences of abd-A direct target genes, the realizator genes granting heart identity. In each segment of the heart, four pairs of cardiomyocytes (CMs) express tinman (tin), homologous to NKX2-5, and acquire strong contractile and automatic rhythmic activities. By tyramide amplified FISH, we found that seven genes, encoding ion channels, pumps or transporters, are specifically expressed in the Tin-CMs of the heart. We initially used online available tools to identify their heart-specific cisregutatory modules by looking for Conserved Non-coding Sequences containing clusters of binding sites for various cardiac transcription factors, including Hox proteins. Based on these data we generated several reporter gene constructs and transgenic embryos, but none of them showed reporter gene expression in the heart. In order to identify additional abd-A target genes, we performed microarray experiments comparing the transcriptomes of aorta versus heart and identified 144 genes overexpressed in the heart. In order to find the heart-specific cis-regulatory regions of these target genes we developed a new bioinformatic approach where prediction is based on pattern matching and ordered statistics. We first retrieved Conserved Noncoding Sequences from the alignment between the D.melanogaster and D.pseudobscura genomes. We scored for combinations of conserved occurrences of ABD-A, ABD-B, TIN, PNR, dMEF2, MADS box, T-box and E-box sites and we ranked these results based on two independent strategies. On one hand we ranked the putative cis-regulatory sequences according to best scored ABD-A biding sites, on the other hand we scored according to conservation of binding sites. We integrated and ranked again the two lists obtained independently to produce a final rank. We generated nGFP reporter construct flies for in vivo validation. We identified three 1kblong heart-specific enhancers. By in vivo and in vitro experiments we are determining whether they are direct abd-A targets, demonstrating the role of a Hox gene in the realization of heart identity. The identified abd-A direct target genes may be targets also of the NKX2-5, GATA4 and/or TBX5 homologues tin, pannier and Doc genes, respectively. The identification of sequences coregulated by a Hox protein and the homologues of transcription factors causing CHDs, will provide a mean to test whether these factors function as Hox cofactors granting cardiac specificity to Hox proteins, increasing our knowledge on the molecular mechanisms underlying CHDs. Finally, it may be investigated whether these Hox targets are involved in CHDs.
Resumo:
Mutations in the FBN1 gene are the major cause of Marfan syndrome (MFS), an autosomal dominant connective tissue disorder, which displays variable manifestations in the cardiovascular, ocular, and skeletal systems. Current molecular genetic testing of FBN1 may miss mutations in the promoter region or in other noncoding sequences as well as partial or complete gene deletions and duplications. In this study, we tested for copy number variations by successively applying multiplex ligation-dependent probe amplification (MLPA) and the Affymetrix Human Mapping 500 K Array Set, which contains probes for approximately 500,000 single-nucleotide polymorphisms (SNPs) across the genome. By analyzing genomic DNA of 101 unrelated individuals with MFS or related phenotypes in whom standard genetic testing detected no mutation, we identified FBN1 deletions in two patients with MFS. Our high-resolution approach narrowed down the deletion breakpoints. Subsequent sequencing of the junctional fragments revealed the deletion sizes of 26,887 and 302,580 bp, respectively. Surprisingly, both deletions affect the putative regulatory and promoter region of the FBN1 gene, strongly indicating that they abolish transcription of the deleted allele. This expectation of complete loss of function of one allele, i.e. true haploinsufficiency, was confirmed by transcript analyses. Our findings not only emphasize the importance of screening for large genomic rearrangements in comprehensive genetic testing of FBN1 but, importantly, also extend the molecular etiology of MFS by providing hitherto unreported evidence that true haploinsufficiency is sufficient to cause MFS.
Resumo:
Editing of RNA changes the read-out of information from DNA by altering the nucleotide sequence of a transcript. One type of RNA editing found in all metazoans uses double-stranded RNA (dsRNA) as a substrate and results in the deamination of adenosine to give inosine, which is translated as guanosine. Editing thus allows variant proteins to be produced from a single pre-mRNA. A mechanism by which dsRNA substrates form is through pairing of intronic and exonic sequences before the removal of noncoding sequences by splicing. Here we report that the RNA editing enzyme, human dsRNA adenosine deaminase (DRADA1, or ADAR1) contains a domain (Zα) that binds specifically to the left-handed Z-DNA conformation with high affinity (KD = 4 nM). As formation of Z-DNA in vivo occurs 5′ to, or behind, a moving RNA polymerase during transcription, recognition of Z-DNA by DRADA1 provides a plausible mechanism by which DRADA1 can be targeted to a nascent RNA so that editing occurs before splicing. Analysis of sequences related to Zα has allowed identification of motifs common to this class of nucleic acid binding domain.
Resumo:
In this report we show that yeast expressing brome mosaic virus (BMV) replication proteins 1a and 2a and replicating a BMV RNA3 derivative can be extracted to yield a template-dependent BMV RNA-dependent RNA polymerase (RdRp) able to synthesize (-)-strand RNA from BMV (+)-strand RNA templates added in vitro. This virus-specific yeast-derived RdRp mirrored the template selectivity and other characteristics of RdRp from BMV-infected plants. Equivalent extracts from yeast expressing 1a and 2a but lacking RNA3 contained normal amounts of 1a and 2a but had no RdRp activity on BMV RNAs added in vitro. To determine which RNA3 sequences were required in vivo to yield RdRp activity, we tested deletions throughout RNA3, including the 5',3', and intercistronic noncoding regions, which contain the cis-acting elements required for RNA3 replication in vivo. RdRp activity was obtained only from cells expressing 1a, 2a, and RNA3 derivatives retaining both 3' and intercistronic noncoding sequences. Strong correlation between extracted RdRp activity and BMV (-)-strand RNA accumulation in vivo was found for all RNA3 derivatives tested. Thus, extractable in vitro RdRp activity paralleled formation of a complex capable of viral RNA synthesis in vivo. The results suggest that assembly of active RdRp requires not only viral proteins but also viral RNA, either to directly contribute some nontemplate function or to recruit essential host factors into the RdRp complex and that sequences at both the 3'-terminal initiation site and distant internal sites of RNA3 templates may participate in RdRp assembly and initiation of (-)-strand synthesis.
Resumo:
There are 481 segments longer than 200 base pairs (bp) that are absolutely conserved (100% identity with no insertions or deletions) between orthologous regions of the human, rat, and mouse genomes. Nearly all of these segments are also conserved in the chicken and dog genomes, with an average of 95 and 99% identity, respectively. Many are also significantly conserved in fish. These ultraconserved elements of the human genome are most often located either overlapping exons in genes involved in RNA processing or in introns or nearby genes involved in the regulation of transcription and development. Along with more than 5000 sequences of over 100 bp that are absolutely conserved among the three sequenced mammals, these represent a class of genetic elements whose functions and evolutionary origins are yet to be determined, but which are more highly conserved between these species than are proteins and appear to be essential for the ontogeny of mammals and other vertebrates.
Resumo:
Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in intron 2, 5 kb downstream of the core BRCA1 promoter. The functionality of these elements was examined using homologous-recombination-based mutagenesis of reporter gene-tagged cosmids incorporating these regions and flanking sequences from the BRCA1 locus. This showed that CNS-1 and CNS-2 have differential transcriptional regulatory activity in epithelial cell lines. Mutation of CNS-1 significantly reduced reporter gene expression to 30% of control levels. Conversely mutation of CNS-2 increased expression to 200% of control levels. Regulation is at the level of transcription and shows promoter specificity. Both elements also specifically bind nuclear proteins in vitro. These studies demonstrate that the combination of comparative genomics and functional analysis is a successful strategy to identify novel regulatory elements and provide the first direct evidence that conserved noncoding sequences in BRCA1 regulate gene expression. (c) 2005 Elsevier Inc. All rights reserved.
Resumo:
Although MYB overexpression in colorectal cancer (CRC) is known to be a prognostic indicator for poor survival, the basis for this overexpression is unclear. Among multiple levels of MYB regulation, the most dynamic is the control of transcriptional elongation by sequences within intron I. The authors have proposed that this regulatory sequence is transcribed into an RNA stem-loop and 19-residue polyuridine tract, and is subject to mutation in CRC. When this region was examined in colorectal and breast carcinoma cell lines and tissues, the authors found frequent mutations only in CRC. It was determined that these mutations allowed increased transcription compared with the wild type sequence. These data suggest that this MYB regulatory region within intron I is subject to mutations in CRC but not breast cancer, perhaps consistent with the mutagenic insult that occurs within the colon and not mammary tissue. In CRC, these mutations may contribute to MYB overexpression, highlighting the importance of noncoding sequences in the regulation of key cancer genes. (c) 2006 Wiley-Liss, Inc.
Resumo:
Despite the presence of over 3 million transposons separated on average by similar to 500 bp, the human and mouse genomes each contain almost 1000 transposon-free regions (TFRs) over 10 kb in length. The majority of human TFRs correlate with orthologous TFRs in the mouse, despite the fact that most transposons are lineage specific. Many human TFRs also overlap with orthologous TFRs in the marsupial opossum, indicating that these regions have remained refractory to transposon insertion for long evolutionary periods. Over 90% of the bases covered by TFRs are noncoding, much of which is not highly conserved. Most TFRs are not associated with unusual nucleotide composition, but are significantly associated with genes encoding developmental regulators, suggesting that they represent extended regions of regulatory information that are largely unable to tolerate insertions, a conclusion difficult to reconcile with current conceptions of gene regulation.