993 resultados para Cis-regulatory Sequences


Relevância:

90.00% 90.00%

Publicador:

Resumo:

The use of comparative genomics to infer genome function relies on the understanding of how different components of the genome change over evolutionary time. The aim of such comparative analysis is to identify conserved, functionally transcribed sequences such as protein-coding genes and non-coding RNA genes, and other functional sequences such as regulatory regions, as well as other genomic features. Here, we have compared the entire human chromosome 21 with syntenic regions of the mouse genome, and have identified a large number of conserved blocks of unknown function. Although previous studies have made similar observations, it is unknown whether these conserved sequences are genes or not. Here we present an extensive experimental and computational analysis of human chromosome 21 in an effort to assign function to sequences conserved between human chromosome 21 (ref. 8) and the syntenic mouse regions. Our data support the presence of a large number of potentially functional non-genic sequences, probably regulatory and structural. The integration of the properties of the conserved components of human chromosome 21 to the rapidly accumulating functional data for this chromosome will improve considerably our understanding of the role of sequence conservation in mammalian genomes.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Structural variation, whether it is caused by copy number variants or present in a balanced form, such as reciprocal translocations and inversions, can have a profound and dramatic effect on the expression of genes mapping within and close to the rearrangement, as well as affecting others genome wide. These effects can be caused by altering the copy number of one or more genes or regulatory elements (dosage effect) or from physical disruption of links between regulatory elements and their associated gene or genes, resulting in perturbation of expression. Similarly, large-scale structural variants can result in genome-wide expression changes by altering the positions that chromosomes occupy within the nucleus, potentially disrupting not only local cis interactions, but also trans interactions that occur throughout the genome. Structural variation is, therefore, a significant factor in the study of gene expression and is discussed here in more detail.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Assessing the contribution of promoters and coding sequences to gene evolution is an important step toward discovering the major genetic determinants of human evolution. Many specific examples have revealed the evolutionary importance of cis-regulatory regions. However, the relative contribution of regulatory and coding regions to the evolutionary process and whether systemic factors differentially influence their evolution remains unclear. To address these questions, we carried out an analysis at the genome scale to identify signatures of positive selection in human proximal promoters. Next, we examined whether genes with positively selected promoters (Prom+ genes) show systemic differences with respect to a set of genes with positively selected protein-coding regions (Cod+ genes). We found that the number of genes in each set was not significantly different (8.1% and 8.5%, respectively). Furthermore, a functional analysis showed that, in both cases, positive selection affects almost all biological processes and only a few genes of each group are located in enriched categories, indicating that promoters and coding regions are not evolutionarily specialized with respect to gene function. On the other hand, we show that the topology of the human protein network has a different influence on the molecular evolution of proximal promoters and coding regions. Notably, Prom+ genes have an unexpectedly high centrality when compared with a reference distribution (P = 0.008, for Eigenvalue centrality). Moreover, the frequency of Prom+ genes increases from the periphery to the center of the protein network (P = 0.02, for the logistic regression coefficient). This means that gene centrality does not constrain the evolution of proximal promoters, unlike the case with coding regions, and further indicates that the evolution of proximal promoters is more efficient in the center of the protein network than in the periphery. These results show that proximal promoters have had a systemic contribution to human evolution by increasing the participation of central genes in the evolutionary process.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We extended the characterization of the DNA puff BhB10-1 gene of Bradysia hygida by showing that, although its mRNA is detected only at the end of the fourth larval instar, BhB10-1 expression is not restricted to the salivary gland, the tissue in which this gene is amplified. Different amounts of BhB10-1 mRNA were detected in other larval tissues such as gut, Malpighian tubules, fat body, brain and cuticle, suggesting that this gene is expressed differentially in the various tissues analyzed. Analysis of transgenic Drosophila carrying the BhB10-1 transcription unit and flanking sequences revealed that the tested fragment promotes transcription in a constitutive manner. We suggest that either cis-regulatory elements are missing in the transgene or factors that temporally regulate the BhB10-1 gene in B. hygida are not conserved in Drosophila.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Les facteurs de transcription sont des protéines spécialisées qui jouent un rôle important dans différents processus biologiques tel que la différenciation, le cycle cellulaire et la tumorigenèse. Ils régulent la transcription des gènes en se fixant sur des séquences d’ADN spécifiques (éléments cis-régulateurs). L’identification de ces éléments est une étape cruciale dans la compréhension des réseaux de régulation des gènes. Avec l’avènement des technologies de séquençage à haut débit, l’identification de tout les éléments fonctionnels dans les génomes, incluant gènes et éléments cis-régulateurs a connu une avancée considérable. Alors qu’on est arrivé à estimer le nombre de gènes chez différentes espèces, l’information sur les éléments qui contrôlent et orchestrent la régulation de ces gènes est encore mal définie. Grace aux techniques de ChIP-chip et de ChIP-séquençage il est possible d’identifier toutes les régions du génome qui sont liées par un facteur de transcription d’intérêt. Plusieurs approches computationnelles ont été développées pour prédire les sites fixés par les facteurs de transcription. Ces approches sont classées en deux catégories principales: les algorithmes énumératifs et probabilistes. Toutefois, plusieurs études ont montré que ces approches génèrent des taux élevés de faux négatifs et de faux positifs ce qui rend difficile l’interprétation des résultats et par conséquent leur validation expérimentale. Dans cette thèse, nous avons ciblé deux objectifs. Le premier objectif a été de développer une nouvelle approche pour la découverte des sites de fixation des facteurs de transcription à l’ADN (SAMD-ChIP) adaptée aux données de ChIP-chip et de ChIP-séquençage. Notre approche implémente un algorithme hybride qui combine les deux stratégies énumérative et probabiliste, afin d’exploiter les performances de chacune d’entre elles. Notre approche a montré ses performances, comparée aux outils de découvertes de motifs existants sur des jeux de données simulées et des jeux de données de ChIP-chip et de ChIP-séquençage. SAMD-ChIP présente aussi l’avantage d’exploiter les propriétés de distributions des sites liés par les facteurs de transcription autour du centre des régions liées afin de limiter la prédiction aux motifs qui sont enrichis dans une fenêtre de longueur fixe autour du centre de ces régions. Les facteurs de transcription agissent rarement seuls. Ils forment souvent des complexes pour interagir avec l’ADN pour réguler leurs gènes cibles. Ces interactions impliquent des facteurs de transcription dont les sites de fixation à l’ADN sont localisés proches les uns des autres ou bien médier par des boucles de chromatine. Notre deuxième objectif a été d’exploiter la proximité spatiale des sites liés par les facteurs de transcription dans les régions de ChIP-chip et de ChIP-séquençage pour développer une approche pour la prédiction des motifs composites (motifs composés par deux sites et séparés par un espacement de taille fixe). Nous avons testé ce module pour prédire la co-localisation entre les deux demi-sites ERE qui forment le site ERE, lié par le récepteur des œstrogènes ERα. Ce module a été incorporé à notre outil de découverte de motifs SAMD-ChIP.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The incidence and severity of light leaf spot epidemics caused by the ascomycete fungus Pyrenopeziza brassicae on UK oilseed rape crops is increasing. The disease is currently controlled by a combination of host resistance, cultural practices and fungicide applications. We report decreases in sensitivities of modern UK P. brassicae isolates to the azole (imidazole and triazole) class of fungicides. By cloning and sequencing the P. brassicae CYP51 (PbCYP51) gene, encoding the azole target sterol 14α-demethylase, we identified two non-synonymous mutations encoding substitutions G460S and S508T associated with reduced azole sensitivity. We confirmed the impact of the encoded PbCYP51 changes on azole sensitivity and protein activity by heterologous expression in a Saccharomyces cerevisiae mutant YUG37::erg11 carrying a controllable promoter of native CYP51 expression. In addition, we identified insertions in the predicted regulatory regions of PbCYP51 in isolates with reduced azole sensitivity. The presence of these insertions was associated with enhanced transcription of PbCYP51 in response to sub-inhibitory concentrations of the azole fungicide tebuconazole. Genetic analysis of in vitro crosses of sensitive and resistant isolates confirmed the impact of PbCYP51 alterations in coding and regulatory sequences on a reduced sensitivity phenotype, as well as identifying a second major gene at another locus contributing to resistance in some isolates. The least sensitive field isolates carry combinations of upstream insertions and non-synonymous mutations, suggesting PbCYP51 evolution is on-going and the progressive decline in azole sensitivity of UK P. brassicae populations will continue. The implications for the future control of light leaf spot are discussed.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Transcription factors (TFs) are major players in gene regulatory networks and interactions between TFs and their target genes furnish spatiotemporal patterns of gene expression. Establishing the architecture of regulatory networks requires gathering information on TFs, their targets in the genome, and the corresponding binding sites. We have developed GRASSIUS (Grass Regulatory Information Services) as a knowledge-based Web resource that integrates information on TFs and gene promoters across the grasses. In its initial implementation, GRASSIUS consists of two separate, yet linked, databases. GrassTFDB holds information on TFs from maize (Zea mays), sorghum (Sorghum bicolor), sugarcane (Saccharum spp.), and rice (Oryza sativa). TFs are classified into families and phylogenetic relationships begin to uncover orthologous relationships among the participating species. This database also provides a centralized clearinghouse for TF synonyms in the grasses. GrassTFDB is linked to the grass TFome collection, which provides clones in recombination-based vectors corresponding to full-length open reading frames for a growing number of grass TFs. GrassPROMDB contains promoter and cis-regulatory element information for those grass species and genes for which enough data are available. The integration of GrassTFDB and GrassPROMDB will be accomplished through GrassRegNet as a first step in representing the architecture of grass regulatory networks. GRASSIUS can be accessed from www.grassius.org.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Sugarcane has an importance in Brazil due to sugar and biofuel production. Considering this aspect, there is basic research being done in order to understand its physiology to improve production. The aim of this research is the Base Excision Repair pathway, in special the enzyme MUTM DNA-glycosylase (formamidopyrimidine) which recognizes oxidized guanine in DNA. The sugarcane scMUTM genes were analyzed using four BACs (Bacterial Artificial Chromosome) from a sugarcane genomic library from R570 cultivar. The resulted showed the presence in the region that had homology to scMUTM the presence of transposable elements. Comparing the similarity, it was observed a highest similarity to Sorghum bicolor sequence, both nucleotide and peptide sequences. Furthermore, promoter regions from MUTM genes in some grass showed different cis-regulatory elements, among which, most were related to oxidative stress, suggesting a gene regulation by oxidative stress

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Genomic sequence comparison across species has enabled the elucidation of important coding and regulatory sequences encoded within DNA. Of particular interest are the noncoding regulatory sequences, which influence gene transcriptional and posttranscriptional processes. A phylogenetic footprinting strategy was employed to identify noncoding conservation patterns of 39 human and bovine orthologous genes. Seventy-three conserved noncoding sequences were identified that shared greater than 70% identity over at least 100 bp. Thirteen of these conserved sequences were also identified in the mouse genome. Evolutionary conservation of noncoding sequences across diverse species may have functional significance, and these conserved sequences may be good candidates for regulatory elements.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Recent studies have identified the genetic underpinnings of a growing number of diseases through targeted exome sequencing. However, this strategy ignores the large component of the genome that does not code for proteins, but is nonetheless biologically functional. To address the possible involvement of regulatory variation in congenital heart diseases (CHDs), we searched for regulatory mutations impacting the activity of TBX5, a dosage-dependent transcription factor with well-defined roles in the heart and limb development that has been associated with the HoltOram syndrome (hearthand syndrome), a condition that affects 1/100 000 newborns. Using a combination of genomics, bioinformatics and mouse genetic engineering, we scanned approximate to 700 kb of the TBX5 locus in search of cis-regulatory elements. We uncovered three enhancers that collectively recapitulate the endogenous expression pattern of TBX5 in the developing heart. We re-sequenced these enhancer elements in a cohort of non-syndromic patients with isolated atrial and/or ventricular septal defects, the predominant cardiac defects of the HoltOram syndrome, and identified a patient with a homozygous mutation in an enhancer approximate to 90 kb downstream of TBX5. Notably, we demonstrate that this single-base-pair mutation abrogates the ability of the enhancer to drive expression within the heart in vivo using both mouse and zebrafish transgenic models. Given the population-wide frequency of this variant, we estimate that 1/100 000 individuals would be homozygous for this variant, highlighting that a significant number of CHD associated with TBX5 dysfunction might arise from non-coding mutations in TBX5 heart enhancers, effectively decoupling the heart and hand phenotypes of the HoltOram syndrome.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Modern sugarcane cultivars are complex hybrids resulting from crosses among several Saccharum species. Traditional breeding methods have been employed extensively in different countries over the past decades to develop varieties with increased sucrose yield and resistance to pests and diseases. Conventional variety improvement, however, may be limited by the narrow pool of suitable genes. Thus, molecular genetics is seen as a promising tool to assist in the process of developing improved varieties. The SUCEST-FUN Project (http://sucest-fun.org) aims to associate function with sugarcane genes using a variety of tools, in particular those that enable the study of the sugarcane transcriptome. An extensive analysis has been conducted to characterise, phenotypically, sugarcane genotypes with regard to their sucrose content, biomass and drought responses. Through the analysis of different cultivars, genes associated with sucrose content, yield, lignin and drought have been identified. Currently, tools are being developed to determine signalling and regulatory networks in grasses, and to sequence the sugarcane genome, as well as to identify sugarcane promoters. This is being implemented through the SUCEST-FUN (http://sucest-fun.org) and GRASSIUS databases (http://grassius.org), the cloning of sugarcane promoters, the identification of cis-regulatory elements (CRE) using Chromatin Immunoprecipitation-sequencing (ChIP-Seq) and the generation of a comprehensive Signal Transduction and Transcription gene catalogue (SUCAST Catalogue).

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Abstract Background One of the least common types of alternative splicing is the complete retention of an intron in a mature transcript. Intron retention (IR) is believed to be the result of intron, rather than exon, definition associated with failure of the recognition of weak splice sites flanking short introns. Although studies on individual retained introns have been published, few systematic surveys of large amounts of data have been conducted on the mechanisms that lead to IR. Results TTo understand how sequence features are associated with or control IR, and to produce a generalized model that could reveal previously unknown signals that regulate this type of alternative splicing, we partitioned intron retention events observed in human cDNAs into two groups based on the relative abundance of both isoforms and compared relevant features. We found that a higher frequency of IR in human is associated with individual introns that have weaker splice sites, genes with shorter intron lengths, higher expression levels and lower density of both a set of exon splicing silencers (ESSs) and the intronic splicing enhancer GGG. Both groups of retained introns presented events conserved in mouse, in which the retained introns were also short and presented weaker splice sites. Conclusion Although our results confirmed that weaker splice sites are associated with IR, they showed that this feature alone cannot explain a non-negligible fraction of events. Our analysis suggests that cis-regulatory elements are likely to play a crucial role in regulating IR and also reveals previously unknown features that seem to influence its occurrence. These results highlight the importance of considering the interplay among these features in the regulation of the relative frequency of IR.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Neisseria meningitidis, the leading cause of bacterial meningitis, can adapt to different host niches during human infection. Both transcriptional and post-transcriptional regulatory networks have been identified as playing a crucial role for bacterial stress responses and virulence. We investigated the N. meningitidis transcriptional landscape both by microarray and by RNA sequencing (RNAseq). Microarray analysis of N. meningitidis grown in the presence or absence of glucose allowed us to identify genes regulated by carbon source availability. In particular, we identified a glucose-responsive hexR-like transcriptional regulator in N. meningitidis. Deletion analysis showed that the hexR gene is accountable for a subset of the glucose-responsive regulation, and in vitro assays with the purified protein showed that HexR binds to the promoters of the central metabolic operons of meningococcus, by targeting a DNA region overlapping putative regulatory sequences. Our results indicate that HexR coordinates the central metabolism of meningococcus in response to the availability of glucose, and N. meningitidis strains lacking the hexR gene are also deficient in establishing successful bacteremia in a mouse model of infection. In parallel, RNAseq analysis of N. meningitidis cultured under standard or iron-limiting in vitro growth conditions allowed us to identify novel small non-coding RNAs (sRNAs) potentially involved in N. meningitidis regulatory networks. Manual curation of the RNAseq data generated a list of 51 sRNAs, 8 of which were validated by Northern blotting. Deletion of selected sRNAs caused attenuation of N. meningitidis infection in a murine model, leading to the identification of the first sRNAs influencing meningococcal bacteraemia. Furthermore, we describe the identification and initial characterization of a novel sRNA unique to meningococcus, closely associated to genes relevant for the intracellular survival of pathogenic Neisseriae. Taken together, our findings could help unravel the regulation of N. meningitidis adaptation to the host environment and its implications for pathogenesis.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Geprägte Gene besitzen die Besonderheit, dass sie jeweils nur von einem Allel exprimiert werden und in der Regel in Imprinting Clustern (ICs) im Genom vorliegen. Bei der Regulation in solchen ICs spielen differentiell methylierte Imprinting Kontrollregionen (ICRs) und dort stattfindende Proteinbindungen eine wichtige Rolle. Die essentielle Bedeutung der CTCF-Bindung an die ICR1 in 11p15.5 für die Expressionsregulation der geprägten Gene H19 und IGF2 ist bereits bekannt. In der vorliegenden Arbeit sollte die Bindung von Kaiso an die unmethylierte ICR1 bei humanen Zellen mit maternaler uniparentaler Disomie von 11p15 (upd(11p15)mat) nachgewiesen und die genaue Bindungsverteilung von Kaiso und CTCF in den B-Repeats der Kontrollregion bestimmt werden. Cis-regulatorische und chromosomenübergreifende transkriptionelle Effekte der ICR1-Proteinbindungen sollten dann durch qPCR-Analysen geprägter Gene bei Zellen mit maternaler und paternaler upd(11p15) und nach siRNA-basierter Herunterregulation der beiden Proteine in Zellen mit upd(11p15)mat analysiert werden. In der vorliegenden Arbeit konnte erstmals gezeigt werden, dass Kaiso an die unmethylierte ICR1 bindet. Dabei kann zumindest von einer Bindestellennutzung in der distalen ICR1-Hälfte ausgegangen werden. Für CTCF hingegen wurde eine Nutzung aller analysierten Repeats in beiden ICR1-Hälften gefunden. In der maternalen bzw. paternalen upd(11p15) entspricht die Expression der 11p15.5-Gene IGF2, H19, CDKN1C und KCNQ1OT1 dem jeweiligen Disomie-Status. Von den nicht auf Chromosom 11 gelegenen geprägten Genen zeigen MEST und PLAGL1 bei Zellen mit upd(11p15)pat sowie PEG3 und GRB10 bei der upd(11p15)mat eine stärkere Expression. Ein CTCF-knockdown in Zellen mit upd(11p15)mat führt zur IGF2-Expressionssteigerung. Dies tritt in noch stärkerem Maße beim knockdown von Kaiso auf, wobei hier zusätzlich eine gesteigerte Expression von H19 vorliegt. Des Weiteren findet man beim CTCF-knockdown einen MEST-Expressionsanstieg und beim Kaiso-knockdown gesteigerte Expressionen der Gene PEG3, GRB10 und PLAGL1. Damit lassen sich sowohl eigenständige cis-regulatorische Effekte der ICR1-Bindung beider Proteine auf geprägte Gene des IC1 als auch chromosomenübergreifende Effekte erkennen. Vor allem die starken H19-Expressionsanstiege beim Kaiso-knockdown treten korrelierend mit Veränderungen von geprägten Genen anderer Chromosomen auf. Damit unterstützen die Daten die Theorie, dass die Expressionsregulation geprägter Gene koordiniert in einer Art Netzwerk stattfinden könnte und dabei bestimmte Faktoren wie H19 und PLAGL1 eine übergeordnete Regulatorfunktion besitzen, wie es in Vergangenheit in der Maus beschrieben wurde. Die Expressionsanalysen von PLAGL1 und MEST deuten darüber hinaus durch ihre tendenziell übereinstimmenden Werte bei der paternalen upd mit hypermethylierter ICR1 und den knockdowns auf die Existenz von Chromatin-Interaktionen zwischen der ICR1 und Abschnitten auf den Chromosomen 6 und 7 hin, ggf. mit einem entsprechenden lokalen Effekt der Proteine in diesen Loci. Proteinbindungen an die maternale ICR1 scheinen damit sowohl cis-regulatorisch die Transkription der geprägten Gene IGF2 und H19 zu beeinflussen als auch durch die H19-Expression ein funktionelles Netzwerk geprägter Gene als trans-Faktor zu regulieren und für Interaktionen zwischen verschiedenen Chromosomen mit transkriptionsregulierender Wirkung verantwortlich zu sein.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Chondrocyte gene regulation is important for the generation and maintenance of cartilage tissues. Several regulatory factors have been identified that play a role in chondrogenesis, including the positive transacting factors of the SOX family such as SOX9, SOX5, and SOX6, as well as negative transacting factors such as C/EBP and delta EF1. However, a complete understanding of the intricate regulatory network that governs the tissue-specific expression of cartilage genes is not yet available. We have taken a computational approach to identify cis-regulatory, transcription factor (TF) binding motifs in a set of cartilage characteristic genes to better define the transcriptional regulatory networks that regulate chondrogenesis. Our computational methods have identified several TFs, whose binding profiles are available in the TRANSFAC database, as important to chondrogenesis. In addition, a cartilage-specific SOX-binding profile was constructed and used to identify both known, and novel, functional paired SOX-binding motifs in chondrocyte genes. Using DNA pattern-recognition algorithms, we have also identified cis-regulatory elements for unknown TFs. We have validated our computational predictions through mutational analyses in cell transfection experiments. One novel regulatory motif, N1, found at high frequency in the COL2A1 promoter, was found to bind to chondrocyte nuclear proteins. Mutational analyses suggest that this motif binds a repressive factor that regulates basal levels of the COL2A1 promoter.