993 resultados para conserved noncoding sequence
Resumo:
Gold(I) salts and selenite, which have diverse therapeutic and biological effects, are noted for their reactivity with thiols. Since the binding of Jun-Jun and Jun-Fos dimers to the AP-1 DNA binding site is regulated in vitro by a redox process involving conserved cysteine residues, we hypothesized that some of the biological actions of gold and selenium are mediated via these residues. In electrophoretic mobility-shift analyses, AP-1 DNA binding was inhibited by gold(I) thiolates and selenite, with 50% inhibition occurring at approximately 5 microM and 1 microM, respectively. Thiomalic acid had no effect in the absence of gold(I), and other metal ions inhibited at higher concentrations, in a rank order correlating with their thiol binding affinities. Cysteine-to-serine mutants demonstrated that these effects of gold(I) and selenite require Cys272 and Cys154 in the DNA-binding domains of Jun and Fos, respectively. Gold(I) thiolates and selenite did not inhibit nonspecific protein binding to the AP-1 site and were at least an order of magnitude less potent as inhibitors of sequence-specific binding to the AP-2, TFIID, or NF1 sites compared with the AP-1 site. In addition, 10 microM gold(I) or 10 microM selenite inhibited expression of an AP-1-dependent reporter gene, but not an AP-2-dependent reporter gene. These data suggest a mechanism regulating transcription factor activity by inorganic ions which may contribute to the known antiarthritic action of gold and cancer chemoprevention by selenium.
Resumo:
A large family of genes encodes proteins with RNA recognition motifs that are presumed to bind RNA and to function in posttranscriptional regulation. Neural-specific members of this family include elav, a gene required for correct differentiation and maintenance of neurons in Drosophila melanogaster, and a related gene, HuD, which is expressed in human neuronal cells. I have identified genes related to elav and HuD in Xenopus laevis, zebrafish, and mouse that define a family of four closely related vertebrate elav-like genes (elrA, elrB, elrC, and elrD) in fish, frogs, and mammals. In addition to protein sequence conservation, a segment of the 3'-untranslated sequence of elrD is also conserved, implying a functional role in elrD expression. In adult frogs, elrC and elrD are exclusively expressed in the brain, whereas elrB is expressed in brain, testis, and ovary. During Xenopus development, elrC and elrD RNAs are detected by late gastrula and late neurula stages, respectively, whereas a nervous system-specific elrB RNA species is expressed by early tadpole stage. Additional elrB transcripts are detected in the ovary and early embryo, demonstrating a maternal supply of mRNA and possibly of protein. These expression patterns suggest a role for different elav-like genes in early development and neuronal differentiation. Surprisingly, elrA is expressed in all adult tissues tested and at all times during development. Thus, the widely expressed elrA is expected to have a related function in all cells.
Resumo:
We have explored the feasibility of using a "double-tagging" assay for assessing which amino acids of a protein are responsible for its binding to another protein. We have chosen the adenovirus E1A-retinoblastoma gene product (pRB) proteins for a model system, and we focused on the high-affinity conserved region 2 of adenovirus E1A (CR2). We used site-specific mutagenesis to generate a mutant E1A gene with a lysine instead of an aspartic acid at position 121 within the CR2 site. We demonstrated that this mutant exhibited little binding to pRB by the double-tagging assay. We also have shown that this lack of binding is not due to any significant decrease in the level of expression of the beta-galactosidase-E1A fusion protein. We then created a "library" of phage expressing beta-galactosidase-E1A fusion proteins with a variety of different mutations within CR2. This library of E1A mutations was used in a double-tagging screening to identify mutant clones that bound to pRB. Three classes of phage were identified: the vast majority of clones were negative and exhibited no binding to pRB. Approximately 1 in 10,000 bound to pRB but not to E1A ("true positives"). A variable number of clones appeared to bind equally well to both pRB and E1A ("false positives"). The DNA sequence of 10 true positive clones yielded the following consensus sequence: DLTCXEX, where X = any amino acid. The recovery of positive clones with only one of several allowed amino acids at each position suggests that most, if not all, of the conserved residues play an important role in binding to pRB. On the other hand, the DNA sequence of the negative clones appeared random. These results are consistent with those obtained from other sources. These data suggest that a double-tagging assay can be employed for determining which amino acids of a protein are important for specifying its interaction with another protein if the complex forms within bacteria. This assay is rapid and up to 1 x 10(6) mutations can be screened at one time.
Resumo:
In this paper, we show the conserved regulation of the homeodomain gene Distal-less-3 (Dlx-3) by analyzing the expression of a promoter from the Xenopus ortholog, Xdll-2, in transgenic mice. A 470-bp frog regulatory sequence confers appropriate expression on a lacZ reporter gene in the ectodermal component of structures derived from epithelial-mesenchymal interactions. Remarkably, this includes structures absent in Xenopus, such as the hair follicle and mammary gland, suggesting that conserved regulatory elements can be used to control the formation of structures peculiar to individual species. In addition, expression of Dlx-3 in developing limbs is highest at the most distal portion. This pattern is duplicated by the Xenopus promoter, indicating that this DNA may include sequences responsive to conserved proximodistal patterning signals in the vertebrate limb.
Resumo:
The influence of a synthetic retroviral peptide, CKS-17, on T helper type 1 (Th1)- or Th2-related cytokines was investigated in human blood mononuclear cells. Cells were stimulated with staphylococcal enterotoxin A, anti-CD3 plus anti-CD28 monoclonal antibodies, or lipopolysaccharide to induce cytokine mRNA. mRNA was detected by a reverse transcription-polymerase chain reaction or Northern blot analysis. CKS-17 down-regulated stimulant-induced mRNA accumulation for interferon gamma (IFN-gamma), interleukin (IL)-2, and p40 heavy and p35 light chains of IL-12, a cytokine that mediates development of Th1 response. CKS-17 up-regulated stimulant-induced mRNA accumulation of IL-10 and did not suppress Th2-related cytokine (IL-4, IL-5, IL-6, or IL-13) mRNA expression. A reverse sequence of CKS-17 peptide, used as a control, showed no such action. Anti-human IL-10 monoclonal antibody blocked ability of CKS-17 to inhibit mRNA accumulation for IFN-gamma but not the CKS-17 suppressive activity of IL-12 p40 heavy chain mRNA. Thus, CKS-17-mediated suppression of IFN-gamma mRNA expression is dependent upon augmentation of IL-10 production by CKS-17. This conserved component of several retroviral envelope proteins, CKS-17, may act as an immunomodulatory epitope responsible for cytokine dysregulation that leads to suppression of cellular immunity.
Resumo:
In vertebrate species, the innate immune system down-regulates protein translation in response to viral infection through the action of the double-stranded RNA (dsRNA)-activated protein kinase (PKR). In some teleost species another protein kinase, Z-DNA-dependent protein kinase (PKZ), plays a similar role but instead of dsRNA binding domains, PKZ has Zα domains. These domains recognize the left-handed conformer of dsDNA and dsRNA known as Z-DNA/Z-RNA. Cyprinid herpesvirus 3 infects common and koi carp, which have PKZ, and encodes the ORF112 protein that itself bears a Zα domain, a putative competitive inhibitor of PKZ. Here we present the crystal structure of ORF112-Zα in complex with an 18-bp CpG DNA repeat, at 1.5 Å. We demonstrate that the bound DNA is in the left-handed conformation and identify key interactions for the specificity of ORF112. Localization of ORF112 protein in stress granules induced in Cyprinid herpesvirus 3-infected fish cells suggests a functional behavior similar to that of Zα domains of the interferon-regulated, nucleic acid surveillance proteins ADAR1 and DAI.
Resumo:
The C2 domain is one of the most frequent and widely distributed calcium-binding motifs. Its structure comprises an eight-stranded beta-sandwich with two structural types as if the result of a circular permutation. Combining sequence, structural and modelling information, we have explored, at different levels of granularity, the functional characteristics of several families of C2 domains. At the coarsest level,the similarity correlates with key structural determinants of the C2 domain fold and, at the finest level, with the domain architecture of the proteins containing them, highlighting the functional diversity between the various subfamilies. The functional diversity appears as different conserved surface patches throughout this common fold. In some cases, these patches are related to substrate-binding sites whereas in others they correspond to interfaces of presumably permanent interaction between other domains within the same polypeptide chain. For those related to substrate-binding sites, the predictions overlap with biochemical data in addition to providing some novel observations. For those acting as protein-protein interfaces' our modelling analysis suggests that slight variations between families are a result of not only complementary adaptations in the interfaces involved but also different domain architecture. In the light of the sequence and structural genomic projects, the work presented here shows that modelling approaches along with careful sub-typing of protein families will be a powerful combination for a broader coverage in proteomics. (C) 2003 Elsevier Ltd. All rights reserved.
Resumo:
With the sequencing and annotation of genomes and transcriptomes of several eukaryotes, the importance of noncoding RNA (ncRNA)-RNA molecules that are not translated to protein products-has become more evident. A subclass of ncRNA transcripts are encoded by highly regulated, multi-exon, transcriptional units, are processed like typical protein-coding mRNAs and are increasingly implicated in regulation of many cellular functions in eukaryotes. This study describes the identification of candidate functional ncRNAs from among the RIKEN mouse full-length cDNA collection, which contains 60,770 sequences, by using a systematic computational filtering approach. We initially searched for previously reported ncRNAs and found nine murine ncRNAs and homologs of several previously described nonmouse ncRNAs. Through our computational approach to filter artifact-free clones that lack protein coding potential, we extracted 4280 transcripts as the largest-candidate set. Many clones in the set had EST hits, potential CpG islands surrounding the transcription start sites, and homologies with the human genome. This implies that many candidates are indeed transcribed in a regulated manner. Our results demonstrate that ncRNAs are a major functional subclass of processed transcripts in mammals.
Resumo:
Background: Protein tertiary structure can be partly characterized via each amino acid's contact number measuring how residues are spatially arranged. The contact number of a residue in a folded protein is a measure of its exposure to the local environment, and is defined as the number of C-beta atoms in other residues within a sphere around the C-beta atom of the residue of interest. Contact number is partly conserved between protein folds and thus is useful for protein fold and structure prediction. In turn, each residue's contact number can be partially predicted from primary amino acid sequence, assisting tertiary fold analysis from sequence data. In this study, we provide a more accurate contact number prediction method from protein primary sequence. Results: We predict contact number from protein sequence using a novel support vector regression algorithm. Using protein local sequences with multiple sequence alignments (PSI-BLAST profiles), we demonstrate a correlation coefficient between predicted and observed contact numbers of 0.70, which outperforms previously achieved accuracies. Including additional information about sequence weight and amino acid composition further improves prediction accuracies significantly with the correlation coefficient reaching 0.73. If residues are classified as being either contacted or non-contacted, the prediction accuracies are all greater than 77%, regardless of the choice of classification thresholds. Conclusion: The successful application of support vector regression to the prediction of protein contact number reported here, together with previous applications of this approach to the prediction of protein accessible surface area and B-factor profile, suggests that a support vector regression approach may be very useful for determining the structure-function relation between primary sequence and higher order consecutive protein structural and functional properties.
Resumo:
The function of the prion protein gene (PRNP) and its normal product PrPC is elusive. We used comparative genomics as a strategy to understand the normal function of PRNP. As the reliability of comparisons increases with the number of species and increased evolutionary distance, we isolated and sequenced a 66.5 kb BAC containing the PRNP gene from a distantly related mammal, the model Australian marsupial Macropus eugenii (tammar wallaby). Marsupials are separated from eutherians such as human and mouse by roughly 180 million years of independent evolution. We found that tammar PRNP, like human PRNP, has two exons. Prion proteins encoded by the tammar wallaby and a distantly related marsupial, Monodelphis domestica (Brazilian opossum) PRNP contain proximal PrP repeats with a distinct, marsupial-specific composition and a variable number. Comparisons of tammar wallaby PRNP with PRNPs from human, mouse, bovine and ovine allowed us to identify non-coding gene regions conserved across the marsupial-eutherian evolutionary distance, which are candidates for regulatory regions. In the PRNP 3' UTR we found a conserved signal for nuclear-specific polyadenylation and the putative cytoplasmic polyadenylation element (CPE), indicating that post-transcriptional control of PRNP mRNA activity is important. Phylogenetic footprinting revealed conserved potential binding sites for the MZF-1 transcription factor in both upstream promoter and intron/intron 1, and for the MEF2, MyTI, Oct-1 and NFAT transcription factors in the intron(s). The presence of a conserved NFAT-binding site and CPE indicates involvement of PrPC in signal transduction and synaptic plasticity. (c) 2004 Elsevier B.V. All rights reserved.
Resumo:
Background: The multitude of motif detection algorithms developed to date have largely focused on the detection of patterns in primary sequence. Since sequence-dependent DNA structure and flexibility may also play a role in protein-DNA interactions, the simultaneous exploration of sequence-and structure-based hypotheses about the composition of binding sites and the ordering of features in a regulatory region should be considered as well. The consideration of structural features requires the development of new detection tools that can deal with data types other than primary sequence. Results: GANN ( available at http://bioinformatics.org.au/gann) is a machine learning tool for the detection of conserved features in DNA. The software suite contains programs to extract different regions of genomic DNA from flat files and convert these sequences to indices that reflect sequence and structural composition or the presence of specific protein binding sites. The machine learning component allows the classification of different types of sequences based on subsamples of these indices, and can identify the best combinations of indices and machine learning architecture for sequence discrimination. Another key feature of GANN is the replicated splitting of data into training and test sets, and the implementation of negative controls. In validation experiments, GANN successfully merged important sequence and structural features to yield good predictive models for synthetic and real regulatory regions. Conclusion: GANN is a flexible tool that can search through large sets of sequence and structural feature combinations to identify those that best characterize a set of sequences.
Resumo:
The tropical abalone. Haliotis asinina. is,in ideal species to investigate the molecular mechanisms that control development. growth, reproduction and shell formation in all cultured haliotids. Here we describe the analysis of 232 expressed sequence tags (EST) obtained front a developmental H. asinina cDNA library intended for future microarray studies. From this data set we identified 183 unique gene Clusters. Of these, 90 clusters showed significant homology with sequences lodged in GenBank, ranging in function from general housekeeping to signal transduction, gene regulation and cell-cell communication. Seventy-one clusters possessed completely novel ORFs greater than 50 codons in length, highlighting the paucity of sequence data from molluscs and other lophotrochozoans. This study of developmental gene expression in H. asinina provides the foundation for further detailed analyses of abalone growth, development and reproduction.
Resumo:
Although MYB overexpression in colorectal cancer (CRC) is known to be a prognostic indicator for poor survival, the basis for this overexpression is unclear. Among multiple levels of MYB regulation, the most dynamic is the control of transcriptional elongation by sequences within intron I. The authors have proposed that this regulatory sequence is transcribed into an RNA stem-loop and 19-residue polyuridine tract, and is subject to mutation in CRC. When this region was examined in colorectal and breast carcinoma cell lines and tissues, the authors found frequent mutations only in CRC. It was determined that these mutations allowed increased transcription compared with the wild type sequence. These data suggest that this MYB regulatory region within intron I is subject to mutations in CRC but not breast cancer, perhaps consistent with the mutagenic insult that occurs within the colon and not mammary tissue. In CRC, these mutations may contribute to MYB overexpression, highlighting the importance of noncoding sequences in the regulation of key cancer genes. (c) 2006 Wiley-Liss, Inc.
Resumo:
The gene content of a mitochondrial (mt) genome, i.e., 37 genes and a large noncoding region (LNR), is usually conserved in Metazoa. The arrangement of these genes and the LNR is generally conserved at low taxonomic levels but varies substantially at high levels. We report here a variation in mt gene content and gene arrangement among chigger mites of the genus Leptotrombidium. We found previously that the mt genome of Leptotrombidium pallidum has an extra gene for large-subunit rRNA (rrnL), a pseudo-gene for small-subunit rRNA (PrrnS), and three extra LNRs, additional to the 37 genes and an LNR typical of Metazoa. Further, the arrangement of mt genes of L. pallidum differs drastically from that of the hypothetical ancestor of the arthropods. To find to what extent the novel gene content and gene arrangement occurred in Leptotrombidium, we sequenced the entire or partial mt genomes of three other species, L. akamushi, L. deliense, and L. fletcheri. These three species share the arrangement of all genes with L. pallidum, except trnQ (for tRNA-glutamine). Unlike L. pallidum, however, these three species do not have extra rrnL or PrrnS and have only one extra LNR. By comparison between Leptotrombidium species and the ancestor of the arthropods, we propose that (1) the type of mt genome present in L. pallidum evolved from the type present in the other three Leptotrombidium species, and (2) three molecular mechanisms were involved in the evolution of mt gene content and gene arrangement in Leptotrombidium species.
Resumo:
Our previous studies using trans-complementation analysis of Kunjin virus (KUN) full-length cDNA clones harboring in-frame deletions in the NS3 gene demonstrated the inability of these defective complemented RNAs to be packaged into virus particles (W. J. Liu, P. L. Sedlak, N. Kondratieva, and A. A. Khromykh, J. Virol. 76:10766-10775). In this study we aimed to establish whether this requirement for NS3 in RNA packaging is determined by the secondary RNA structure of the NS3 gene or by the essential role of the translated NS3 gene product. Multiple silent mutations of three computer-predicted stable RNA structures in the NS3 coding region of KUN replicon RNA aimed at disrupting RNA secondary structure without affecting amino acid sequence did not affect RNA replication and packaging into virus-like particles in the packaging cell line, thus demonstrating that the predicted conserved RNA structures in the NS3 gene do not play a role in RNA replication and/or packaging. In contrast, double frameshift mutations in the NS3 coding region of full-length KUN RNA, producing scrambled NS3 protein but retaining secondary RNA structure, resulted in the loss of ability of these defective RNAs to be packaged into virus particles in complementation experiments in KUN replicon-expressing cells. Furthermore, the more robust complementation-packaging system based on established stable cell lines producing large amounts of complemented replicating NS3-deficient replicon RNAs and infection with KUN virus to provide structural proteins also failed to detect any secreted virus-like particles containing packaged NS3-deficient replicon RNAs. These results have now firmly established the requirement of KUN NS3 protein translated in cis for genome packaging into virus particles.