943 resultados para Promoter Sequences
Resumo:
PromEC is an updated compilation of Escherichia coli mRNA promoter sequences. It includes documentation on the location of experimentally identified mRNA transcriptional start sites on the E.coli chromosome, as well as the actual sequences in the promoter region. The database was updated as of July 2000 and includes 472 entries. PromEC is accessible at http://bioinfo.md.huji.ac.il/marg/promec
Resumo:
A cDNA and corresponding promoter region for a naturally occurring, feedback-insensitive anthranilate synthase (AS) α-subunit gene, ASA2, has been isolated from an unselected, but 5-methyl-tryptophan-resistant (5MTr), tobacco (Nicotiana tabacum) cell line (AB15–12-1). The ASA2 cDNA contains a putative transit peptide sequence, and Southern hybridization shows that more than one closely related sequence is present in the tobacco genome. The ASA2 cDNA complemented a trpE nonsense mutant Escherichia coli strain, allowing growth on 300 μm 5MT-containing minimal medium without tryptophan, and cell extracts contained feedback-insensitive AS activity. The 5MTr was lost when the E. coli strain was transformed with an ASA2 site-directed mutant (phenylalanine-107-arginine-108 → serine-107-glutamine-108). Identical nucleotide sequences encoding the phenylalanine-107-arginine-108 region have been found in polymerase chain reaction-amplified 326-bp ASA2 genomic fragments of wild-type (5-methyl-tryptophan-sensitive [5MTs]) tobacco and a progenitor species. High-level ASA2 transcriptional expression was detected only in 5MTr-cultured cells, not in 5MTs cells or in plants. Promoter studies indicate that tissue specificity of ASA2 is controlled by the promoter region between −2252 and −607. Since the ASA2 promoter sequences are not substantially different in the 5MTr and 5MTs lines, the increased levels of ASA2 mRNA in the 5MTr lines are most likely due to changes in a regulatory gene affecting ASA2 expression.
Resumo:
Although the control of carbon fixation and nitrogen assimilation has been studied in detail, relatively little is known about the regulation of carbon and nitrogen flow into amino acids. In this paper we report our study of the metabolic regulation of expression of an Arabidopsis aspartate kinase/homoserine dehydrogenase (AK/HSD) gene, which encodes two linked key enzymes in the biosynthetic pathway of aspartate family amino acids. Northern blot analyses, as well as expression of chimeric AK/HSD-β-glucuronidase constructs, have shown that the expression of this gene is regulated by the photosynthesis-related metabolites sucrose and phosphate but not by nitrogenous compounds. In addition, analysis of AK/HSD promoter deletions suggested that a CTTGACTCTA sequence, resembling the binding site for the yeast GCN4 transcription factor, is likely to play a functional role in the expression of this gene. Nevertheless, longer promoter fragments, lacking the GCN4-like element, were still able to confer sugar inducibility, implying that the metabolic regulation of this gene is apparently obtained by multiple and redundant promoter sequences. The present and previous studies suggest that the conversion of aspartate into either the storage amino acid asparagine or aspartate family amino acids is subject to a coordinated, reciprocal metabolic control, and this biochemical branch point is a part of a larger, coordinated regulatory mechanism of nitrogen and carbon storage and utilization.
Resumo:
The Wilms tumor suppressor gene WT1 is implicated in the ontogeny of genito-urinary abnormalities, including Denys-Drash syndrome and Wilms tumor of the kidney. WT1 encodes Kruppel-type zinc finger proteins that can regulate the expression of several growth-related genes, apparently by binding to specific DNA sites located within 5' untranslated leader regions as well as 5' promoter sequences. Both WT1 and a closely related early growth response factor, EGR1, can bind the same DNA sequences from the mouse gene encoding insulin-like growth factor 2 (Igf-2). We report that WT1, but not EGR1, can bind specific Igf-2 exonic RNA sequences, and that the zinc fingers are required for this interaction. WT1 zinc finger 1, which is not represented in EGR1, plays a more significant role in RNA binding than zinc finger 4, which does have a counterpart in EGR1. Furthermore, the normal subnuclear localization of WT1 proteins is shown to be RNase, but not DNase, sensitive. Therefore, WT1 might, like the Kruppel-type zinc finger protein TFIIIA, regulate gene expression by both transcriptional and posttranscriptional mechanisms.
Resumo:
Saccharomyces cerevisiae responds to DNA damage by arresting cell cycle progression (thereby preventing the replication and segregation of damaged chromosomes) and by inducing the expression of numerous genes, some of which are involved in DNA repair, DNA replication, and DNA metabolism. Induction of the S. cerevisiae 3-methyladenine DNA glycosylase repair gene (MAG) by DNA-damaging agents requires one upstream activating sequence (UAS) and two upstream repressing sequences (URS1 and URS2) in the MAG promoter. Sequences similar to the MAG URS elements are present in at least 11 other S. cerevisiae DNA repair and metabolism genes. Replication protein A (Rpa) is known as a single-stranded-DNA-binding protein that is involved in the initiation and elongation steps of DNA replication, nucleotide excision repair, and homologous recombination. We now show that the MAG URS1 and URS2 elements form similar double-stranded, sequence-specific, DNA-protein complexes and that both complexes contain Rpa. Moreover, Rpa appears to bind the MAG URS1-like elements found upstream of 11 other DNA repair and DNA metabolism genes. These results lead us to hypothesize that Rpa may be involved in the regulation of a number of DNA repair and DNA metabolism genes.
Resumo:
The transient expression of the retinoblastoma protein (Rb) regulates the transcription of a variety of growth-control genes, including c-fos, c-myc, and the gene for transforming growth factor beta 1 via discrete promoter sequences termed retinoblastoma control elements (RCE). Previous analyses have shown that Sp1 is one of three RCE-binding proteins identified in nuclear extracts and that Rb functionally interacts with Sp1 in vivo, resulting in the "superactivation" of Sp1-mediated transcription. By immunochemical and biochemical criteria, we report that an Sp1-related transcription factor, Sp3, is a second RCE-binding protein. Furthermore, in transient cotransfection assays, we report that Rb "superactivates" Sp3-mediated RCE-dependent transcription in vivo and that levels of superactivation are dependent on the trans-activator (Sp1 or Sp3) studied. Using expression vectors carrying mutated Rb cDNAs, we have identified two portions of Rb required for superactivation: (i) a portion of the Rb "pocket" (amino acids 614-839) previously determined to be required for physical interactions between Rb and transcription factors such as E2F-1 and (ii) a novel amino-terminal region (amino acids 140-202). Since both of these regions of Rb are targets of mutation in human tumors, our data suggest that superactivation of Sp1/Sp3 may play a role in Rb-mediated growth suppression and/or the induction of differentiation.
Resumo:
Conclusive evidence was provided that gamma 1, the upstream of the two linked simian gamma-globin loci (5'-gamma 1-gamma 2-3'), is a pseudogene in a major group of New World monkeys. Sequence analysis of PCR-amplified genomic fragments of predicted sizes revealed that all extant genera of the platyrrhine family Atelidae [Lagothrix (woolly monkeys), Brachyteles (woolly spider monkeys), Ateles (spider monkeys), and Alouatta (howler monkeys)] share a large deletion that removed most of exon 2, all of intron 2 and exon 3, and much of the 3' flanking sequence of gamma 1. The fact that two functional gamma-globin genes were not present in early ancestors of the Atelidae (and that gamma 1 was the dispensible gene) suggests that for much or even all of their evolution, platyrrhines have had gamma 2 as the primary fetally expressed gamma-globin gene, in contrast to catarrhines (e.g., humans and chimpanzees) that have gamma 1 as the primary fetally expressed gamma-globin gene. Results from promoter sequences further suggest that all three platyrrhine families (Atelidae, Cebidae, and Pitheciidae) have gamma 2 rather than gamma 1 as their primary fetally expressed gamma-globin gene. The implications of this suggestion were explored in terms of how gene redundancy, regulatory mutations, and distance of each gamma-globin gene from the locus control region were possibly involved in the acquisition and maintenance of fetal, rather than embryonic, expression.
Resumo:
Tartrate-resistant acid phosphatase (TRAP) is highly expressed in osteoclasts and in a subset of tissue macrophages and dendritic cells. It is expressed at lower levels in the parenchymal cells of the liver, glomerular mesangial cells of the kidney and pancreatic acinar cells. We have identified novel TRAP mRNAs that differ in their 5-untranslated region (5'-UTR) sequence, but align with the known murine TRAP mRNA from the first base of Exon 2. The novel 5'-UTRs represent alternative first exons located upstream of the known 5'-UTR. A similar genomic structure exists for the human TRAP gene with partial conservation of the exon and promoter sequences. Expression of the most distal 5'-UTR (Exon 1A) is restricted to adult bone and spleen tissue. Exon 1B is expressed primarily in tissues containing TRAP-positive nonhaematopoietic cells. The known TRAP 5'-UTR (Exon 1) is expressed in tissues characteristic of myeloid cell expression. In addition the Exon 1C promoter sequence is shown to comprise distinct transcription start regions, with an osteoclast-specific transcription initiation site identified downstream of a TATA-like element. Macrophages are shown to initiate transcription of the Exon 1C transcript from a purine-rich region located upstream of the osteoclast-specific transcription start point. The distinct expression patterns for each of the TRAP 5'-UTRs suggest that TRAP mRNA expression is regulated by the use of four alternative tissue- and cell-restricted promoters. (C) 2003 Elsevier Science B.V. All rights reserved.
Resumo:
Background: The multitude of motif detection algorithms developed to date have largely focused on the detection of patterns in primary sequence. Since sequence-dependent DNA structure and flexibility may also play a role in protein-DNA interactions, the simultaneous exploration of sequence-and structure-based hypotheses about the composition of binding sites and the ordering of features in a regulatory region should be considered as well. The consideration of structural features requires the development of new detection tools that can deal with data types other than primary sequence. Results: GANN ( available at http://bioinformatics.org.au/gann) is a machine learning tool for the detection of conserved features in DNA. The software suite contains programs to extract different regions of genomic DNA from flat files and convert these sequences to indices that reflect sequence and structural composition or the presence of specific protein binding sites. The machine learning component allows the classification of different types of sequences based on subsamples of these indices, and can identify the best combinations of indices and machine learning architecture for sequence discrimination. Another key feature of GANN is the replicated splitting of data into training and test sets, and the implementation of negative controls. In validation experiments, GANN successfully merged important sequence and structural features to yield good predictive models for synthetic and real regulatory regions. Conclusion: GANN is a flexible tool that can search through large sets of sequence and structural feature combinations to identify those that best characterize a set of sequences.
Resumo:
Background: Current methods to find significantly under- and over-represented gene ontology (GO) terms in a set of genes consider the genes as equally probable balls in a bag, as may be appropriate for transcripts in micro-array data. However, due to the varying length of genes and intergenic regions, that approach is inappropriate for deciding if any GO terms are correlated with a set of genomic positions. Results: We present an algorithm - GONOME - that can determine which GO terms are significantly associated with a set of genomic positions given a genome annotated with (at least) the starts and ends of genes. We show that certain GO terms may appear to be significantly associated with a set of randomly chosen positions in the human genome if gene lengths are not considered, and that these same terms have been reported as significantly over-represented in a number of recent papers. This apparent over-representation disappears when gene lengths are considered, as GONOME does. For example, we show that, when gene length is taken into account, the term development is not significantly enriched in genes associated with human CpG islands, in contradiction to a previous report. We further demonstrate the efficacy of GONOME by showing that occurrences of the proteosome-associated control element (PACE) upstream activating sequence in the S. cerevisiae genome associate significantly to appropriate GO terms. An extension of this approach yields a whole-genome motif discovery algorithm that allows identification of many other promoter sequences linked to different types of genes, including a large group of previously unknown motifs significantly associated with the terms 'translation' and 'translational elongation'. Conclusion: GONOME is an algorithm that correctly extracts over-represented GO terms from a set of genomic positions. By explicitly considering gene size, GONOME avoids a systematic bias toward GO terms linked to large genes. Inappropriate use of existing algorithms that do not take gene size into account has led to erroneous or suspect conclusions. Reciprocally GONOME may be used to identify new features in genomes that are significantly associated with particular categories of genes.
Resumo:
Fungal pathogen Candida albicans causes serious nosocomial infections in patients, in part, due to formation of drug-resistant biofilms. Protein kinases (PK) and transcription factors (TF) mediate signal transduction and transcription of proteins involved in biofilm development. To discover biofilm-related PKs, a collection of 63 C. albicans PK mutants was screened twice independently with microtiter plate-based biofilm assay (XTT). Thirty-eight (60%) mutants showed different degrees of biofilm impairment with the poor biofilm formers additionally possessing filamentation defects. Most of these genes were already known to encode proteins associated with Candida morphology and biofilms but VPS15, PKH3, PGA43, IME2 and CEX1, were firstly associated with both processes in this study. Previous studies of Holcombe et al. (2010) had shown that bacterial pathogen, Pseudomonas aeruginosa can impair C. albicans filamentation and biofilm development. To investigate their interaction, the good biofilm former PK mutants of C. albicans were assessed for their response to P. aeruginosa supernatants derived from two strains, wildtype PAO1 and homoserine lactone (HSL)-free mutant ΔQS, without finding any nonresponsive mutants. This suggested that none of the PKs in this study was implicated in Candida-Pseudomonas signaling. To screen promoter sequences for overrepresented TFs across C. albicans gene sets significantly up/downregulated in presence of bacterial supernatants from Holcombe et al. (2010) study, TFbsST database was created online. The TFbsST database integrates experimentally verified TFs of Candida to analyse promoter sequences for TF binding sites. In silico studies predicted that Efg1p was overrepresented in C. albicans and C. parapsilosis RBT family genes.
Resumo:
BACKGROUND Integrons are found in hundreds of environmental bacterial species, but are mainly known as the agents responsible for the capture and spread of antibiotic-resistance determinants between Gram-negative pathogens. The SOS response is a regulatory network under control of the repressor protein LexA targeted at addressing DNA damage, thus promoting genetic variation in times of stress. We recently reported a direct link between the SOS response and the expression of integron integrases in Vibrio cholerae and a plasmid-borne class 1 mobile integron. SOS regulation enhances cassette swapping and capture in stressful conditions, while freezing the integron in steady environments. We conducted a systematic study of available integron integrase promoter sequences to analyze the extent of this relationship across the Bacteria domain. RESULTS Our results showed that LexA controls the expression of a large fraction of integron integrases by binding to Escherichia coli-like LexA binding sites. In addition, the results provide experimental validation of LexA control of the integrase gene for another Vibrio chromosomal integron and for a multiresistance plasmid harboring two integrons. There was a significant correlation between lack of LexA control and predicted inactivation of integrase genes, even though experimental evidence also indicates that LexA regulation may be lost to enhance expression of integron cassettes. CONCLUSIONS Ancestral-state reconstruction on an integron integrase phylogeny led us to conclude that the ancestral integron was already regulated by LexA. The data also indicated that SOS regulation has been actively preserved in mobile integrons and large chromosomal integrons, suggesting that unregulated integrase activity is selected against. Nonetheless, additional adaptations have probably arisen to cope with unregulated integrase activity. Identifying them may be fundamental in deciphering the uneven distribution of integrons in the Bacteria domain.
Resumo:
Cytosine-and guanine-rich regions of DNA are capable of forming complex structures named i-motifs and G-quadruplexes, respectively. In the present study the solution equilibria at nearly physiological conditions of a 34 -bases long cytosine-rich sequence and its complementary guanin e-rich strand corresponding to the first intron of the n-mycgene were studied. Both sequences , not yet studied, contain a 12 - base tract capable of forming stable hairpins inside the i-motif and G-quadruplex structures, respectively ...
Resumo:
The pattern of expression of the pro$\alpha$2(I) collagen gene is highly tissue-specific in adult mice and shows its strongest expression in bones, tendons, and skin. Transgenic mice were generated harboring promoter fragments of the mouse pro$\alpha$2(I) collagen gene linked to the Escherichia coli $\beta$-galactosidase or firefly luciferase genes to examine the activity of these promoters during development. A region of the mouse pro$\alpha$2(I) collagen promoter between $-$2000 and +54 exhibited a pattern of $\beta$-galactosidase activity during embryonic development that corresponded to the expression pattern of the endogenous pro$\alpha$2(I) collagen gene as determined by in situ hybridization. A similar pattern of activity was also observed with much smaller promoter fragments containing either 500 or 350 bp of upstream sequence relative to the start of transcription. Embryonic regions expressing high levels of $\beta$-galactosidase activity included the valves of the developing heart, sclerotomes, meninges, limb buds, connective tissue fascia between muscle fibers, osteoblasts, tendon, periosteum, dermis, and peritoneal membranes. The pattern of $\beta$-galactosidase activity was similar to the extracellular immunohistochemical localization of transforming growth factor-$\beta$1 (TGF-$\beta$1). The $-$315 to $-$284 region of the pro$\alpha$2(I) collagen promoter was previously shown to mediate the stimulatory effects of TGF-$\beta$1 on the pro$\alpha$2(I) collagen promoter in DNA transfection experiments with cultured fibroblasts. A construct containing this sequence tandemly repeated 5$\sp\prime$ to both a very short $\alpha$2(I) collagen promoter ($-$40 to +54) and a heterologous minimal promoter showed preferential activity in tail and skin of 4-week old transgenic mice. The pattern of expression mimics that of the $-$350 to +54 pro$\alpha$2(I) collagen promoter linked to a luciferase reporter gene in transgenic mice. ^
Resumo:
The chi63 promoter directs glucose-sensitive, chitin-dependent transcription of a gene involved in the utilization of chitin as carbon source. Analysis of 5′ and 3′ deletions of the promoter region revealed that a 350-bp segment is sufficient for wild-type levels of expression and regulation. The analysis of single base changes throughout the promoter region, introduced by random and site-directed mutagenesis, identified several sequences to be important for activity and regulation. Single base changes at −10, −12, −32, −33, −35, and −37 upstream of the transcription start site resulted in loss of activity from the promoter, suggesting that bases in these positions are important for RNA polymerase interaction. The sequences centered around −10 (TATTCT) and −35 (TTGACC) in this promoter are, in fact, prototypical of eubacterial promoters. Overlapping the RNA polymerase binding site is a perfect 12-bp direct repeat sequence. Some base changes within this direct repeat resulted in constitutive expression, suggesting that this sequence is an operator for negative regulation. Other base changes resulted in loss of glucose repression while retaining the requirement for chitin induction, suggesting that this sequence is also involved in glucose repression. The fact that cis-acting mutations resulted in glucose resistance but not inducer independence rules out the possibility that glucose repression acts exclusively by inducer exclusion. The fact that mutations that affect glucose repression and chitin induction fall within the same direct repeat sequence module suggests that the direct repeat sequence facilitates both chitin induction and glucose repression.