965 resultados para Genes Regulatory Sequences
Resumo:
The objective of this work was to identify expressed simple sequence repeats (SSR) markers associated to leaf miner resistance in coffee progenies. Identification of SSR markers was accomplished by directed searches on the Brazilian Coffee Expressed Sequence Tags (EST) database. Sequence analysis of 32 selected SSR loci showed that 65% repeats are of tetra-, 21% of tri- and 14% of dinucleotides. Also, expressed SSR are localized frequently in the 5'-UTR of gene transcript. Moreover, most of the genes containing SSR are associated with defense mechanisms. Polymorphisms were analyzed in progenies segregating for resistance to the leaf miner and corresponding to advanced generations of a Coffea arabica x Coffea racemosa hybrid. Frequency of SSR alleles was 2.1 per locus. However, no polymorphism associated with leaf miner resistance was identified. These results suggest that marker-assisted selection in coffee breeding should be performed on the initial cross, in which genetic variability is still significant.
Resumo:
The Caulobacter DNA methyltransferase CcrM is one of five master cell-cycle regulators. CcrM is transiently present near the end of DNA replication when it rapidly methylates the adenine in hemimethylated GANTC sequences. The timing of transcription of two master regulator genes and two cell division genes is controlled by the methylation state of GANTC sites in their promoters. To explore the global extent of this regulatory mechanism, we determined the methylation state of the entire chromosome at every base pair at five time points in the cell cycle using single-molecule, real-time sequencing. The methylation state of 4,515 GANTC sites, preferentially positioned in intergenic regions, changed progressively from full to hemimethylation as the replication forks advanced. However, 27 GANTC sites remained unmethylated throughout the cell cycle, suggesting that these protected sites could participate in epigenetic regulatory functions. An analysis of the time of activation of every cell-cycle regulatory transcription start site, coupled to both the position of a GANTC site in their promoter regions and the time in the cell cycle when the GANTC site transitions from full to hemimethylation, allowed the identification of 59 genes as candidates for epigenetic regulation. In addition, we identified two previously unidentified N(6)-methyladenine motifs and showed that they maintained a constant methylation state throughout the cell cycle. The cognate methyltransferase was identified for one of these motifs as well as for one of two 5-methylcytosine motifs.
Resumo:
Genomic clones containing the Xenopus laevis vitellogenin gene B1 have been isolated from DNA libraries and characterized by heteroduplex mapping in the electron microscope, restriction endonuclease analysis, and in vitro transcription in a HeLa whole-cell extract. Sequences from the 3'-flanking region of the previously isolated A1 vitellogenin gene were found in the 5'-flanking region of this B1 gene. Thus, the two genes are linked, with 15.5 kilobase pairs of DNA between them. Their length is about 22 kilobase pairs (A1 gene) and 16.5 kilobase pairs (B1 gene) and they have the following arrangement: 5'-A1 gene-spacer-B1 gene-3'. The analysis of heteroduplexes formed between the two genes revealed several regions of homology. Both genes are in the same orientation and, therefore, are transcribed from the same DNA strand. The possible events by which the vitellogenin gene family arose in Xenopus laevis are discussed.
Resumo:
Phosphate is a crucial and often limiting nutrient for plant growth. To obtain inorganic phosphate (P(i) ), which is very insoluble, and is heterogeneously distributed in the soil, plants have evolved a complex network of morphological and biochemical processes. These processes are controlled by a regulatory system triggered by P(i) concentration, not only present in the medium (external P(i) ), but also inside plant cells (internal P(i) ). A 'split-root' assay was performed to mimic a heterogeneous environment, after which a transcriptomic analysis identified groups of genes either locally or systemically regulated by P(i) starvation at the transcriptional level. These groups revealed coordinated regulations for various functions associated with P(i) starvation (including P(i) uptake, P(i) recovery, lipid metabolism, and metal uptake), and distinct roles for members in gene families. Genetic tools and physiological analyses revealed that genes that are locally regulated appear to be modulated mostly by root development independently of the internal P(i) content. By contrast, internal P(i) was essential to promote the activation of systemic regulation. Reducing the flow of P(i) had no effect on the systemic response, suggesting that a secondary signal, independent of P(i) , could be involved in the response. Furthermore, our results display a direct role for the transcription factor PHR1, as genes systemically controlled by low P(i) have promoters enriched with P1BS motif (PHR1-binding sequences). These data detail various regulatory systems regarding P(i) starvation responses (systemic versus local, and internal versus external P(i) ), and provide tools to analyze and classify the effects of P(i) starvation on plant physiology.
Resumo:
In vertebrates, different isoforms of fibroblast growth factor 2 (FGF2) exist, which differ by their N-terminal extension. They show different localization and expression levels and exert distinct biological effects. Nevertheless, genetic inactivation of all FGF2 isoforms in the mouse results in only mild phenotypes. Here, we analyzed mouse FGF2, and show that, as in the human, mouse FGF2 contains CTG-initiated high molecular-weight (HMW) isoforms, which contain a nuclear localization signal, and which mediate localization of this isoform to the nucleus. Using green fluorescent protein-FGF2 fusions, we furthermore observed, that C-terminal deletions disable nuclear localization of the short low-molecular-weight (LMW) 18-kDa isoform. This loss of specific localization is accompanied by a loss in heparin binding. We therefore suggest that, first, localization of mouse FGF2 is comparable to that in other vertebrates and, second, FGF2 contains at least two sequences important for nuclear localization, a nuclear localization sequence at the N terminus which is only contained in the HMW isoform, and another sequence at the C terminus, which is only required for localization of the LMW 18-kDa isoform.
Resumo:
MOTIVATION: The analysis of molecular coevolution provides information on the potential functional and structural implication of positions along DNA sequences, and several methods are available to identify coevolving positions using probabilistic or combinatorial approaches. The specific nucleotide or amino acid profile associated with the coevolution process is, however, not estimated, but only known profiles, such as the Watson-Crick constraint, are usually considered a priori in current measures of coevolution. RESULTS: Here, we propose a new probabilistic model, Coev, to identify coevolving positions and their associated profile in DNA sequences while incorporating the underlying phylogenetic relationships. The process of coevolution is modeled by a 16 × 16 instantaneous rate matrix that includes rates of transition as well as a profile of coevolution. We used simulated, empirical and illustrative data to evaluate our model and to compare it with a model of 'independent' evolution using Akaike Information Criterion. We showed that the Coev model is able to discriminate between coevolving and non-coevolving positions and provides better specificity and specificity than other available approaches. We further demonstrate that the identification of the profile of coevolution can shed new light on the process of dependent substitution during lineage evolution.
Resumo:
Advancements in high-throughput technologies to measure increasingly complex biological phenomena at the genomic level are rapidly changing the face of biological research from the single-gene single-protein experimental approach to studying the behavior of a gene in the context of the entire genome (and proteome). This shift in research methodologies has resulted in a new field of network biology that deals with modeling cellular behavior in terms of network structures such as signaling pathways and gene regulatory networks. In these networks, different biological entities such as genes, proteins, and metabolites interact with each other, giving rise to a dynamical system. Even though there exists a mature field of dynamical systems theory to model such network structures, some technical challenges are unique to biology such as the inability to measure precise kinetic information on gene-gene or gene-protein interactions and the need to model increasingly large networks comprising thousands of nodes. These challenges have renewed interest in developing new computational techniques for modeling complex biological systems. This chapter presents a modeling framework based on Boolean algebra and finite-state machines that are reminiscent of the approach used for digital circuit synthesis and simulation in the field of very-large-scale integration (VLSI). The proposed formalism enables a common mathematical framework to develop computational techniques for modeling different aspects of the regulatory networks such as steady-state behavior, stochasticity, and gene perturbation experiments.
Resumo:
Background: To determine whether misalignment structures such as duplications, repeats, and palindromes are associated to insertions/deletions (indels) in gp120, indicating that indels are indeed frameshift mutations generated by DNA misalignment mechanism. Methods: Cloning and sequencing of a fragment of HIV-1 gp120 spanning C2-C4 derived from plasma RNA in 12 patients with early chronic disease and naïve to antiretroviral therapy. Results: Indels in V4 involved always insertion and deletion of duplicated nucleotide segments, and AAT repeats, and were associated to the presence of palindromic sequences. No duplications were detected in V3 and C3. Palindromic sequences occurred with similar frequencies in V3, C3 and V4; the frequency of palindromes in individual genes was found to be significantly higher in structural (gp120, p ≤ 3.00E-7) and significantly lower in regulatory (Tat, p ≤ 9.00E-7) genes, as compared to the average frequency calculated over the full genome. Discussion: Indels in V4 are associated to misalignment structures (i.e. duplications repeat and palindromes) indicating DNA misalignment as the mechanism underlying length variation in V4. The finding that indels in V4 are caused by DNA misalignment has some very important implications: 1) indels in V4 are likely to occur in proviral DNA (and not in RNA), after integration of HIV into the host genome; 2) they are likely to occur as progressive modifications of the early founder virus during chronic infection, as more and more cells get infected; 3) frameshift mutations involving any number of base pairs are likely to occur evenly across gp120; however, only those mutants carrying a functional gp120 (indels as multiples of three base pairs) will be able to perpetuate the virus cycle and to keep spreading through the population.
Mutational screening of splicing factor genes in cases with autosomal dominant retinitis pigmentosa.
Resumo:
PURPOSE: Mutations in genes encoding proteins from the tri-snRNP complex of the spliceosome account for more than 12% of cases of autosomal dominant retinitis pigmentosa (adRP). Although the exact mechanism by which splicing factor defects trigger photoreceptor death is not completely clear, their role in retinitis pigmentosa has been demonstrated by several genetic and functional studies. To test for possible novel associations between splicing factors and adRP, we screened four tri-snRNP splicing factor genes (EFTUD2, PRPF4, NHP2L1, and AAR2) as candidate disease genes. METHODS: We screened up to 303 patients with adRP from Europe and North America who did not carry known RP mutations. Exon-PCR and Sanger methods were used to sequence the NHP2L1 and AAR2 genes, while the sequences of EFTUD2 and PRPF4 were obtained by using long-range PCRs spanning coding and non-coding regions followed by next-generation sequencing. RESULTS: We detected novel missense changes in individual patients in the sequence of the genes PRPF4 and EFTUD2, but the role of these changes in relationship to disease could not be verified. In one other patient we identified a novel nucleotide substitution in the 5' untranslated region (UTR) of NHP2L1, which did not segregate with the disease in the family. CONCLUSIONS: The absence of clearly pathogenic mutations in the candidate genes screened in our cohort suggests that EFTUD2, PRPF4, NHP2L1, and AAR2 are either not involved in adRP or are associated with the disease in rare instances, at least as observed in this study in patients of European and North American origin.
Resumo:
The objective of this work was to estimate the allele polymorphism frequencies of genes in Nellore cattle and associate them with meat quality and carcass traits. Six hundred males were genotyped for the following polymorphisms: DGAT1 (VNTR with 18 nucleotides at the promoter region); ANK1, a new polymorphism, identified and mapped here at the gene regulatory region NW_001494427.3; TCAP (AY428575.1:g.346G>A); and MYOG (NW_001501985:g.511G>C). In the association study, phenotype data of hot carcass weight, ribeye area, backfat thickness, percentage of intramuscular fat, shear force, myofibrillar fragmentation index, meat color (L*, a*, b*), and cooking losses were used. Allele B from the ANK1 gene was associated with greater redness (a*). Alleles 5R, 6R, and 7R from the DGAT1 VNTR gene were associated with increased intramuscular fat, reduced cooking losses and increased ribeye area, respectively. The single nucleotide polymorphism (SNP) of the TCAP gene was not polymorphic, and MYOG alleles were not associated with any of the evaluated characteristics. These results indicate that ANK1 and DGAT1 genes can be used in the selection of Nellore cattle for carcass and meat quality.
Resumo:
The biosynthetic genes pchDCBA and pchEF, which are known to be required for the formation of the siderophore pyochelin and its precursors salicylate and dihydroaeruginoate (Dha), are clustered with the pchR regulatory gene on the chromosome of Pseudomonas aeruginosa. The 4.6-kb region located downstream of the pchEF genes was found to contain three additional, contiguous genes, pchG, pchH, and pchI, probably forming a pchEFGHI operon. The deduced amino acid sequences of PchH and PchI are similar to those of ATP binding cassette transport proteins with an export function. PchG is a homolog of the Yersinia pestis and Y. enterocolitica proteins YbtU and Irp3, which are involved in the biosynthesis of yersiniabactin. A null mutation in pchG abolished pyochelin formation, whereas mutations in pchH and pchI did not affect the amounts of salicylate, Dha, and pyochelin produced. The pyochelin biosynthetic genes were expressed from a vector promoter, uncoupling them from Fur-mediated repression by iron and PchR-dependent induction by pyochelin. In a P. aeruginosa mutant lacking the entire pyochelin biosynthetic gene cluster, the expressed pchDCBA and pchEFG genes were sufficient for salicylate, Dha, and pyochelin production. Pyochelin formation was also obtained in the heterologous host Escherichia coli expressing pchDCBA and pchEFG together with the E. coli entD gene, which provides a phosphopantetheinyl transferase necessary for PchE and PchF activation. The PchG protein was purified and used in combination with PchD and phosphopantetheinylated PchE and PchF in vitro to produce pyochelin from salicylate, L-cysteine, ATP, NADPH, and S-adenosylmethionine. Based on this assay, a reductase function was attributed to PchG. In summary, this study completes the identification of the biosynthetic genes required for pyochelin formation from chorismate in P. aeruginosa.
Resumo:
Many genes are regulated as an innate part of the eukaryotic cell cycle, and a complex transcriptional network helps enable the cyclic behavior of dividing cells. This transcriptional network has been studied in Saccharomyces cerevisiae (budding yeast) and elsewhere. To provide more perspective on these regulatory mechanisms, we have used microarrays to measure gene expression through the cell cycle of Schizosaccharomyces pombe (fission yeast). The 750 genes with the most significant oscillations were identified and analyzed. There were two broad waves of cell cycle transcription, one in early/mid G2 phase, and the other near the G2/M transition. The early/mid G2 wave included many genes involved in ribosome biogenesis, possibly explaining the cell cycle oscillation in protein synthesis in S.pombe. The G2/M wave included at least three distinctly regulated clusters of genes: one large cluster including mitosis, mitotic exit, and cell separation functions, one small cluster dedicated to DNA replication, and another small cluster dedicated to cytokinesis and division. S. pombe cell cycle genes have relatively long, complex promoters containing groups of multiple DNA sequence motifs, often of two, three, or more different kinds. Many of the genes, transcription factors, and regulatory mechanisms are conserved between S. pombe and S. cerevisiae. Finally, we found preliminary evidence for a nearly genome-wide oscillation in gene expression: 2,000 or more genes undergo slight oscillations in expression as a function of the cell cycle, although whether this is adaptive, or incidental to other events in the cell, such as chromatin condensation, we do not know.
Resumo:
Background: The main goal of the present study was to analyse the genetic architecture of mRNA expression in muscle, a tissue with an outmost economic importance for pig breeders. Previous studies have used F2 crosses to detect porcine expression QTL (eQTL), so they contributed with data that mostly represents the between-breed component of eQTL variation. Herewith, we have analysed eQTL segregation in an outbred Duroc population using two groups of animals with divergent fatness profiles. This approach is particularly suitable to analyse the within-breed component of eQTL variation, with a special emphasis on loci involved in lipid metabolism. Methodology/Principal Findings: GeneChip Porcine Genome arrays (Affymetrix) were used to determine the mRNA expression levels of gluteus medius samples from 105 Duroc barrows. A whole-genome eQTL scan was carried out with a panel of 116 microsatellites. Results allowed us to detect 613 genome-wide significant eQTL unevenly distributed across the pig genome. A clear predominance of trans- over cis-eQTL, was observed. Moreover, 11 trans-regulatory hotspots affecting the expression levels of four to 16 genes were identified. A Gene Ontology study showed that regulatory polymorphisms affected the expression of muscle development and lipid metabolism genes. A number of positional concordances between eQTL and lipid trait QTL were also found, whereas limited evidence of a linear relationship between muscle fat deposition and mRNA levels of eQTL regulated genes was obtained. Conclusions/Significance: Our data provide substantial evidence that there is a remarkable amount of within-breed genetic variation affecting muscle mRNA expression. Most of this variation acts in trans and influences biological processes related with muscle development, lipid deposition and energy balance. The identification of the underlying causal mutations and the ascertainment of their effects on phenotypes would allow gaining a fundamental perspective about how complex traits are built at the molecular level.
Resumo:
Protein-coding genes evolve at different rates, and the influence of different parameters, from gene size to expression level, has been extensively studied. While in yeast gene expression level is the major causal factor of gene evolutionary rate, the situation is more complex in animals. Here we investigate these relations further, especially taking in account gene expression in different organs as well as indirect correlations between parameters. We used RNA-seq data from two large datasets, covering 22 mouse tissues and 27 human tissues. Over all tissues, evolutionary rate only correlates weakly with levels and breadth of expression. The strongest explanatory factors of purifying selection are GC content, expression in many developmental stages, and expression in brain tissues. While the main component of evolutionary rate is purifying selection, we also find tissue-specific patterns for sites under neutral evolution and for positive selection. We observe fast evolution of genes expressed in testis, but also in other tissues, notably liver, which are explained by weak purifying selection rather than by positive selection.
Resumo:
The key information processing units within gene regulatory networks are enhancers. Enhancer activity is associated with the production of tissue-specific noncoding RNAs, yet the existence of such transcripts during cardiac development has not been established. Using an integrated genomic approach, we demonstrate that fetal cardiac enhancers generate long noncoding RNAs (lncRNAs) during cardiac differentiation and morphogenesis. Enhancer expression correlates with the emergence of active enhancer chromatin states, the initiation of RNA polymerase II at enhancer loci and expression of target genes. Orthologous human sequences are also transcribed in fetal human hearts and cardiac progenitor cells. Through a systematic bioinformatic analysis, we identified and characterized, for the first time, a catalog of lncRNAs that are expressed during embryonic stem cell differentiation into cardiomyocytes and associated with active cardiac enhancer sequences. RNA-sequencing demonstrates that many of these transcripts are polyadenylated, multi-exonic long noncoding RNAs. Moreover, knockdown of two enhancer-associated lncRNAs resulted in the specific downregulation of their predicted target genes. Interestingly, the reactivation of the fetal gene program, a hallmark of the stress response in the adult heart, is accompanied by increased expression of fetal cardiac enhancer transcripts. Altogether, these findings demonstrate that the activity of cardiac enhancers and expression of their target genes are associated with the production of enhancer-derived lncRNAs.