993 resultados para conserved noncoding sequence
Resumo:
The globin gene family of Xenopus laevis comprises pairs of closely related genes that are arranged in two clusters, each pair of genes being co-ordinately and stage-specifically expressed. To get information on putative regulatory elements, we compared the DNA sequences and the chromatin conformation 5' to the co-ordinately expressed adult alpha-globin genes. Sequence analysis revealed a relatively conserved region from the cap site up to position -289, and further upstream seven distinct boxes of homology, separated by more diverged sequences or deletions/insertions. The homology boxes comprise 22 to 194 base-pairs showing 78 to 95% homology. Analysis of chromatin conformation showed that DNase I preferentially cuts the upstream region of both genes at similar positions, 5' to the T-A-T-A and the C-C-A-A-T boxes, only in chromatin of adult erythroblasts and erythrocytes, where adult globin genes are expressed, but not in chromatin of adult liver cells or larval erythrocytes, where these genes are silent. This suggests that cell- and stage-specific activation of these genes coincides with specific changes in chromatin conformation within the proximal upstream region. No difference was found in the nucleotide sequence within the DNase I hypersensitive region proximal to the adult alpha 1-globin gene in DNA from embryonic cells, in which this gene is inactive, and adult erythrocytes, expressing this gene.
Resumo:
We have studied the requirements for efficient histone-specific RNA 3' processing in nuclear extract from mammalian tissue culture cells. Processing is strongly impaired by mutations in the pre-mRNA spacer element that reduce the base-pairing potential with U7 RNA. Moreover, by exchanging the hairpin and spacer elements of two differently processed H4 genes, we find that this difference is exclusively due to the spacer element. Finally, processing is inhibited by the addition of competitor RNAs, if these contain a wild-type spacer sequence, but not if their spacer element is mutated. Conversely, the importance of the hairpin for histone RNA 3' processing is highly variable: A hairpin mutant of the H4-12 gene is processed with almost wild-type efficiency in extract from K21 mouse mastocytoma cells but is strongly affected in HeLa cell extract, whereas an identical hairpin mutant of the H4-1 gene is affected in both extracts. The hairpin defect of H4-12-specific RNA in HeLa cells can be overcome by a compensatory mutation that increases the base complementarity to U7 snRNA. Very similar results were also obtained in RNA competition experiments: processing of H4-12-specific RNA can be competed by RNA carrying a wild-type hairpin element in extract from HeLa, but not K21 cells, whereas processing of H4-1-specific RNA can be competed in both extracts. With two additional histone genes we obtained results that were in one case intermediate and in the other similar to those obtained with H4-1. These results suggest that hairpin binding factor(s) can cooperatively support the ability of U7 snRNPs to form an active processing complex, but is(are) not directly involved in the processing mechanism.
Resumo:
Vertebrate limbs grow out from the flanks of embryos, with their main axis extending proximodistally from the trunk. Distinct limb domains, each with specific traits, are generated in a proximal-to-distal sequence during development. Diffusible factors expressed from signalling centres promote the outgrowth of limbs and specify their dorsoventral and anteroposterior axes. However, the molecular mechanism by which limb cells acquire their proximodistal (P-D) identity is unknown. Here we describe the role of the homeobox genes Meis1/2 and Pbx1 in the development of mouse, chicken and Drosophila limbs. We find that Meis1/2 expression is restricted to a proximal domain, coincident with the previously reported domain in which Pbx1 is localized to the nucleus, and resembling the distribution of the Drosophila homologues homothorax (hth) and extradenticle (exd); that Meis1 regulates Pbx1 activity by promoting nuclear import of the Pbx1 protein; and that ectopic expression of Meis1 in chicken and hth in Drosophila disrupts distal limb development and induces distal-to-proximal transformations. We suggest that restriction of Meis1/Hth to proximal regions of the vertebrate and insect limb is essential to specify cell fates and differentiation patterns along the P-D axis of the limb.
Resumo:
DNA sequence variation is currently a major source of data for studying human origins, evolution, and demographic history, and for detecting linkage association of complex diseases. In this dissertation, I investigated DNA variation in worldwide populations from two ∼10 kb autosomal regions on 22q11.2 (noncoding) and 1q24 (introns). A total of 75 variant sites were found among 128 human sequences in the 22q11.2 region, yielding an estimate of 0.088% for nucleotide diversity (π), and a total of 52 variant sites were found among 122 human sequences in the 1q24 region with an estimated π value of 0.057%. The data from these two regions and a 10 kb noncoding region on Xq13.3 all show a strong excess of low-frequency variants in comparison to that expected from an equilibrium population, indicating a relatively recent population expansion. The effective population sizes estimated from the three regions were 11,000, 12,700, and 8,600, respectively, which are close to the commonly used value of 10,000. In each of the two autosomal regions, the age of the most recent common ancestor (MRCA) was estimated to be older than 1 million years among all the sequences and ∼600,000 years among non-African sequences, providing first evidence from autosomal noncoding or intronic regions for a genetic history of humans much more ancient than the emergence of modern humans. The ancient genetic history of humans indicates no severe bottleneck during the evolution of humans in the last half million years; otherwise, much of the ancient genetic history would have been lost during a severe bottleneck. This study strongly suggests that both the “out of Africa” and the multiregional models are too simple for explaining the evolution of modern humans. A compilation of genome-wide data revealed that nucleotide diversity is highest in autosomal regions, intermediate in X-linked regions, and lowest in Y-linked regions. The data suggest the existence of background selection or selective sweep on Y-linked loci. In general, the nucleotide diversity in humans is low compared to that in chimpanzee and Drosophila populations. ^
Resumo:
Historically morphological features were used as the primary means to classify organisms. However, the age of molecular genetics has allowed us to approach this field from the perspective of the organism's genetic code. Early work used highly conserved sequences, such as ribosomal RNA. The increasing number of complete genomes in the public data repositories provides the opportunity to look not only at a single gene, but at organisms' entire parts list. ^ Here the Sequence Comparison Index (SCI) and the Organism Comparison Index (OCI), algorithms and methods to compare proteins and proteomes, are presented. The complete proteomes of 104 sequenced organisms were compared. Over 280 million full Smith-Waterman alignments were performed on sequence pairs which had a reasonable expectation of being related. From these alignments a whole proteome phylogenetic tree was constructed. This method was also used to compare the small subunit (SSU) rRNA from each organism and a tree constructed from these results. The SSU rRNA tree by the SCI/OCI method looks very much like accepted SSU rRNA trees from sources such as the Ribosomal Database Project, thus validating the method. The SCI/OCI proteome tree showed a number of small but significant differences when compared to the SSU rRNA tree and proteome trees constructed by other methods. Horizontal gene transfer does not appear to affect the SCI/OCI trees until the transferred genes make up a large portion of the proteome. ^ As part of this work, the Database of Related Local Alignments (DaRLA) was created and contains over 81 million rows of sequence alignment information. DaRLA, while primarily used to build the whole proteome trees, can also be applied shared gene content analysis, gene order analysis, and creating individual protein trees. ^ Finally, the standard BLAST method for analyzing shared gene content was compared to the SCI method using 4 spirochetes. The SCI system performed flawlessly, finding all proteins from one organism against itself and finding all the ribosomal proteins between organisms. The BLAST system missed some proteins from its respective organism and failed to detect small ribosomal proteins between organisms. ^
Resumo:
One of the most critical aspects of G Protein Coupled Receptors (GPCRs) regulation is their rapid and acute desensitization following agonist stimulation. Phosphorylation of these receptors by GPCR kinases (GRK) is a major mechanism of desensitization. Considerable evidence from studies of rhodopsin kinase and GRK2 suggests there is an allosteric docking site for the receptor distinct from the GRK catalytic site. While the agonist-activated GPCR appears crucial for GRK activation, the molecular details of this interaction remain unclear. Recent studies suggested an important role for the N- and C-termini and domains in the small lobe of the kinase domain in allosteric activation; however, neither the mechanism of action of that site nor the RH domain contributions have been elucidated. To search for the allosteric site, we first indentified evolutionarily conserved sites within the RH and kinase domains presumably deterministic of protein function employing evolutionary trace (ET) methodology and crystal structures of GRK6. Focusing on a conserved cluster centered on helices 3, 9, and 10 in the RH domain, key residues of GRK5 and 6 were targeted for mutagenesis and functional assays. We found that a number of double mutations within helices 3, 9, and 10 and the N-terminus markedly reduced (50–90%) the constitutive phosphorylation of the β-2 Adrenergic Receptor (β2AR) in intact cells and phosphorylation of light-activated rhodopsin (Rho*) in vitro as compared to wild type (WT) GRK5 or 6. Based on these results, we designed peptide mimetics of GRK5 helix 9 both computationally and through chemical modifications with the goal of both confirming the importance of helix 9 and developing a useful inhibitor to disrupt the GPCR-GRK interaction. Several peptides were found to block Rho* phosphorylation by GRK5 including the native helix 9 sequence, Peptide Builder designed-peptide preserving only the key ET residues, and chemically locked helices. Most peptidomimetics showed inhibition of GRK5 activity greater than 80 % with an IC50 of ∼ 30 µM. Alanine scanning of helix 9 has further revealed both essential and non-essential residues for inhibition. Importantly, substitution of Arg 169 by an alanine in the native helix 9-based peptide gave an almost complete inhibition at 30 µM with an IC50 of ∼ 10 µM. In summary we report a previously unrecognized crucial role for the RH domain of GRK5 and 6, and the subsequent identification of a lead peptide inhibitor of protein-protein interaction with potential for specific blockade of GPCR desensitization. ^
Resumo:
The slow/cardiac alkali myosin light chain (MLC1s/1c) is a member of a multigene family whose protein products are essential for activation of the myosin ATPase. In the adult, the MLC1s/1c isoform is expressed in both cardiac and slow-twitch skeletal muscles, while it is expressed by all skeletal muscles during development.^ To elucidate the molecular mechanisms that underlie the transcriptional regulation of MLC1s/1c gene expression, the immediate 5$\sp\prime$ flanking region of the gene was isolated and shown to be capable of directing reporter gene expression. Analysis of this region revealed a 110 bp muscle-specific enhancer that includes a myocyte-specific enhancer-binding factor 2 (MEF-2) site, E-boxes, which are potential binding sites for the basic-helix-loop-helix proteins such as MyoD, and a MLC box. The focus of the thesis was to identify the role of the MLC box in expression of the MLC1s/1c gene.^ The MLC box is a member of the family of CArG box containing cis-acting DNA elements. Mutagenesis showed that the MLC box is necessary, but not sufficient, for the expression of a reporter gene linked to the 5$\sp\prime$ flanking region of the MLC1s/1c gene. Linker scanner and site-directed mutagenesis identified a number of potential sites within the 110 bp muscle-specific enhancer that may cooperate with the MLC box. These are the MEF-2 site, the E-box site, and a 10 bp element located upstream of the MEF-2 site that does not have sequence similarity with any known cis-acting element. The MLC box is capable of binding to factors present in muscle nuclear extracts, as well as to human recombinant serum response factor (SRF). Binding of SRF to the MLC box was correlated with the ability of the 5$\sp\prime$ flanking region of the MLC1s/1c gene to drive reporter gene expression. Results suggest a model in which binding of SRF to the MLC box activates expression of the MLC1s/1c gene while binding of the factors present in the nuclear extracts suppresses the expression of the gene. (Abstract shortened with permission of author.) ^
Resumo:
p53 mutations are the most commonly observed genetic alterations in human cancers to date. A majority of these point mutations cluster in four evolutionarily conserved domains spanning amino acids 100-300. This region of p53 has been called its central conserved, or conformational domain. This domain of p53 is also targeted by the SV40 T antigen. Mutation, as well as interaction with SV40 T antigen results in inactivation of p53. We hypothesized that mutations and SV40 T antigen disrupt p53 function by interfering with the molecular interactions of the central conserved domain. Using a chimeric protein consisting of the central conserved domain of wild-type p53 (amino acids 115-295) and a protein A affinity tail, we isolated several cellular proteins that interact specifically with this domain of p53. These proteins range in size from 30K to 90K M$\rm\sb{r}.$ We also employed the p53 fusion protein to demonstrate that the central conserved domain of p53 possesses sequence-specific DNA-binding activity. Interestingly, the cellular proteins binding to the central conserved domain of p53 enhance the sequence-specific DNA-binding activity of full length p53. Partial purification of the individual proteins binding to the conformational domain of p53 by utilizing a sodium chloride step-gradient enabled further characterization of two proteins: (1) a 42K M$\rm\sb{r}$ protein that eluted at 0.5M NaCl, and bound DNA nonspecifically, and (2) a 35K M$\rm\sb{r}$ protein eluting into the 1.0M NaCl fraction, capable of enhancing the sequence-specific DNA-binding activity of p53. In order to determine the physiologic relevance of the molecular interactions of the conformational domain of p53, we examined the biochemical processes underlying the TNF-$\alpha$ mediated growth suppression of the NSCLC cell line H460. While growth suppression was accompanied by enhanced sequence-specific p53-DNA binding activity in TNF-$\alpha$ treated H460 nuclei, there was no increase in p53 protein levels. Furthermore, p35 was upregulated in TNF-$\alpha$ treated H460 cells, suggesting that the enhanced p53-DNA binding seen in these cells may be mediated by p35. Our studies define two novel interactions involving the central conserved domain of p53 that appear to be functionally relevant: (1) sequence-specific DNA-binding, and (2) interaction with other cellular proteins. ^
Resumo:
Methionine aminopeptidase (MetAP) exists in two forms (type I and type II), both of which remove the N-terminal methionine from proteins. It previously has been shown that the type II enzyme is the molecular target of fumagillin and ovalicin, two epoxide-containing natural products that inhibit angiogenesis and suppress tumor growth. By using mass spectrometry, N-terminal sequence analysis, and electronic absorption spectroscopy we show that fumagillin and ovalicin covalently modify a conserved histidine residue in the active site of the MetAP from Escherichia coli, a type I enzyme. Because all of the key active site residues are conserved, it is likely that a similar modification occurs in the type II enzymes. This modification, by occluding the active site, may prevent the action of MetAP on proteins or peptides involved in angiogenesis. In addition, the results suggest that these compounds may be effective pharmacological agents against pathogenic and resistant forms of E. coli and other microorganisms.
Resumo:
Autonomously replicating sequence (ARS) elements, which function as the cis-acting chromosomal replicators in the yeast Saccharomyces cerevisiae, depend upon an essential copy of the 11-bp ARS consensus sequence (ACS) for activity. Analysis of the chromosome III replicator ARS309 unexpectedly revealed that its essential ACS differs from the canonical ACS at two positions. One of the changes observed in ARS309 inactivates other ARS elements. This atypical ACS binds the origin recognition complex efficiently and is required for chromosomal replication origin activity. Comparison of the essential ACS of ARS309 with the essential regions of other ARS elements revealed an expanded 17-bp conserved sequence that efficiently predicts the essential core of ARS elements.
Resumo:
The Drosophila retinal degeneration C (rdgC) gene encodes an unusual protein serine/threonine phosphatase in that it contains at least two EF-hand motifs at its carboxy terminus. By a combination of large-scale sequencing of human retina cDNA clones and searches of expressed sequence tag and genomic DNA databases, we have identified two sequences in mammals [Protein Phosphatase with EF-hands-1 and 2 (PPEF-1 and PPEF-2)] and one in Caenorhabditis elegans (PPEF) that closely resemble rdgC. In the adult, PPEF-2 is expressed specifically in retinal rod photoreceptors and the pineal. In the retina, several isoforms of PPEF-2 are predicted to arise from differential splicing. The isoform that most closely resembles rdgC is localized to rod inner segments. Together with the recently described localization of PPEF-1 transcripts to primary somatosensory neurons and inner ear cells in the developing mouse, these data suggest that the PPEF family of protein serine/threonine phosphatases plays a specific and conserved role in diverse sensory neurons.
Resumo:
The gene encoding 2-methyl-3-hydroxypyridine-5-carboxylic acid oxygenase (MHPCO; EC 1.14.12.4) was cloned by using an oligonucleotide probe corresponding to the N terminus of the enzyme to screen a DNA library of Pseudomonas sp. MA-1. The gene encodes for a protein of 379 amino acid residues corresponding to a molecular mass of 41.7 kDa, the same as that previously estimated for MHPCO. MHPCO was expressed in Escherichia coli and found to have the same properties as the native enzyme from Pseudomonas sp. MA-1. This study shows that MHPCO is a homotetrameric protein with one flavin adenine dinucleotide bound per subunit. Sequence comparison of the enzyme with other hydroxylases reveals regions that are conserved among aromatic flavoprotein hydroxylases.
Resumo:
Multiprotein bridging factor 1 (MBF1) is a transcriptional cofactor that bridges between the TATA box-binding protein (TBP) and the Drosophila melanogaster nuclear hormone receptor FTZ-F1 or its silkworm counterpart BmFTZ-F1. A cDNA clone encoding MBF1 was isolated from the silkworm Bombyx mori whose sequence predicts a basic protein consisting of 146 amino acids. Bacterially expressed recombinant MBF1 is functional in interactions with TBP and a positive cofactor MBF2. The recombinant MBF1 also makes a direct contact with FTZ-F1 through the C-terminal region of the FTZ-F1 DNA-binding domain and stimulates the FTZ-F1 binding to its recognition site. The central region of MBF1 (residues 35–113) is essential for the binding of FTZ-F1, MBF2, and TBP. When the recombinant MBF1 was added to a HeLa cell nuclear extract in the presence of MBF2 and FTZ622 bearing the FTZ-F1 DNA-binding domain, it supported selective transcriptional activation of the fushi tarazu gene as natural MBF1 did. Mutations disrupting the binding of FTZ622 to DNA or MBF1, or a MBF2 mutation disrupting the binding to MBF1, all abolished the selective activation of transcription. These results suggest that tethering of the positive cofactor MBF2 to a FTZ-F1-binding site through FTZ-F1 and MBF1 is essential for the binding site-dependent activation of transcription. A homology search in the databases revealed that the deduced amino acid sequence of MBF1 is conserved across species from yeast to human.
Resumo:
The Drosophila melanogaster Suppressor of forked [Su(f)] protein shares homology with the yeast RNA14 protein and the 77-kDa subunit of human cleavage stimulation factor, which are proteins involved in mRNA 3′ end formation. This suggests a role for Su(f) in mRNA 3′ end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. Using temperature-sensitive su(f) mutants, we show that accumulation of the truncated transcript requires wild-type Su(f) protein. This suggests that the Su(f) protein autoregulates negatively its accumulation by stimulating 3′ end formation of the truncated su(f) RNA. Cloning of su(f) from Drosophila virilis and analysis of its RNA profile suggest that su(f) autoregulation is conserved in this species. Sequence comparison between su(f) from both species allows us to point out three conserved regions in intron 4 downstream of the truncated RNA poly(A) site. These conserved regions include the GU-rich downstream sequence involved in poly(A) site definition. Using transgenes truncated within intron 4, we show that sequence up to the conserved GU-rich domain is sufficient for production of the truncated RNA and for regulation of this production by su(f). Our results indicate a role of su(f) in the regulation of poly(A) site utilization and an important role of the GU-rich sequence for this regulation to occur.
Resumo:
Neuronal and glial glutamate transporters remove the excitatory neurotransmitter glutamate from the synaptic cleft. The proteins belong to a large family of secondary transporters, which includes bacterial glutamate transporters. The C-terminal half of the glutamate transporters is well conserved and thought to contain the translocation path and the binding sites for substrate and coupling ions. A serine-rich sequence motif in this part of the proteins is located in a putative intracellular loop. Cysteine-scanning mutagenesis was applied to this loop in the glutamate transporter GltT of Bacillus stearothermophilus. The loop was found to be largely intracellular, but three consecutive positions in the conserved serine-rich motif (S269, S270, and E271) are accessible from both sides of the membrane. Single-cysteine mutants in the serine-rich motif were still capable of glutamate transport, but modification with N-ethylmaleimide blocked the transport activity in six mutants (T267C, A268C, S269C, S270C, E271C, and T272C). Two milimolars l-glutamate effectively protected against the modification of the cysteines at position 269–271 from the periplasmic side of the membrane but was unable to protect cysteine modification from the cytoplasmic side of the membrane. The results indicate that the conserved serine-rich motif in the glutamate transporter forms a reentrant loop, a structure that is found in several ion channels but is unusual for transporter proteins. The reentrant loop is of crucial importance for the function of the glutamate transporter.