86 resultados para protein sequence classification
Resumo:
Wurst is a protein threading program with an emphasis on high quality sequence to structure alignments (http://www.zbh.uni-hamburg.de/wurst). Submitted sequences are aligned to each of about 3000 templates with a conventional dynamic programming algorithm, but using a score function with sophisticated structure and sequence terms. The structure terms are a log-odds probability of sequence to structure fragment compatibility, obtained from a Bayesian classification procedure. A simplex optimization was used to optimize the sequence-based terms for the goal of alignment and model quality and to balance the sequence and structural contributions against each other. Both sequence and structural terms operate with sequence profiles.
Resumo:
We sequenced cDNAs coding for chicken cellular nucleic acid binding protein (CNBP). Two slightly different variations of the open reading frame were found, each of which translates into a protein with seven zinc finger domains. The longest transcript contains an in-frame insert of 3 bp. The sequence conservation between chick CNBP cDNAs with human, rat and mouse CNBP cDNAs is extreme, especially in the coding region, where the deduced amino acid sequence identity with human, rat and mouse CNBP is 99%. CNBP-like transcripts were also found in various tissues from insect, shrimp, fish and lizard. Regions with remarkable nucleotide conservation were also found in the 3' untranslated region, indicating important functions for these regions. Quantitative reverse transcription polymerase chain reaction (RT-PCR) indicated that in the chick, CNBP is present in all tissues examined in approximately equal ratios to total RNA. RT-PCR of total RNA isolated from different phyla indicate CNBP-like proteins art widespread throughout the animal kingdom. The extraordinary level of conservation suggests an important physiological role for CNBP. (C) 1997 Elsevier Science Inc.
Resumo:
Segregation of mRNAs in the cytoplasm of polar cells has been demonstrated for proteins involved in Xenopus and Drosophila oogenesis, and for some proteins in somatic cells. It is assumed that vectorial transport of the messages is generally responsible for this localization. The mRNA encoding the basic protein of central nervous system myelin is selectively transported to the distal ends of the processes of oligodendrocytes, where it is anchored to the myelin membrane and translated. This transport is dependent on a 21-nucleotide cis-acting segment of the 3'-untranslated region (RTS). Proteins that bind to this cis-acting segment have now been isolated from extracts of rat brain. A group of six 35-42-kDa proteins bind to a 35-base oligoribonucleotide incorporating the RTS, but not to several oligoribonucleotides with the same composition but randomized sequences, thus establishing specificity for the base sequence in the RTS. The most abundant of these proteins has been identified, by Edman sequencing of tryptic peptides and mass spectroscopy, as heterogeneous nuclear ribonucleoprotein (hnRNP) A2, a 36-kDa member of a family of proteins that are primarily, but not solely, intranuclear. This protein was most abundant in samples from rat brain and testis, with lower amounts in other tissues. It was separated from the other polypeptides by using reverse-phase HPLC and shown to retain preferential association with the RTS. In cultured oligodendrocytes, hnRNP A2 was demonstrated by confocal microscopy to be distributed throughout the nucleus, cell soma, and processes.
Resumo:
Background: The ornamental tobacco Nicotiana alata produces a series of proteinase inhibitors (Pls) that are derived from a 43 kDa precursor protein, NaProPl. NaProPl contains six highly homologous repeats that fold to generate six separate structural domains, each corresponding to one of the native Pls. An unusual feature of NaProPl is that the structural domains lie across adjacent repeats and that the sixth Pl domain is generated from fragments of the first and sixth repeats. Although the homology of the repeats suggests that they may have arisen from gene duplication, the observed folding does not appear to support this. This study of the solution structure of a single NaProPl repeat (aPl1) forms a basis for unravelling the mechanism by which this protein may have evolved, Results: The three-dimensional structure of aPl1 closely resembles the triple-stranded antiparallel beta sheet observed in each of the native Pls. The five-residue sequence Glu-Glu-Lys-Lys-Asn, which forms the linker between the six structural domains in NaProPl, exists as a disordered loop in aPl1. The presence of this loop in aPl1 results in a loss of the characteristically flat and disc-like topography of the native inhibitors. Conclusions: A single repeat from NaProPl is capable of folding into a compact globular domain that displays native-like Pl activity. Consequently, it is possible that a similar single-domain inhibitor represents the ancestral protein from which NaProPl evolved.
Resumo:
Endoparasitoid wasps produce maternal protein secretions, which are transported into the body of insect hosts at oviposition to regulate host physiology for successful development of their offspring. Venturia canescens calyx fluid contains so-called virus-like particles (VLPs) that are essential for immune evasion of the developing parasitoid inside the host. VLPs consist of four major proteins. In this paper, we describe the isolation and molecular cloning of a gene (vlp2) that is a constituent of VLPs and discuss its possible role in VLP structure and function.
Resumo:
Human N-acetyltransferase Type I (NAT1) catalyses the acetylation of many aromatic amine and hydrazine compounds and it has been implicated in the catabolism of folic acid. The enzyme is widely expressed in the body, although there are considerable differences in the level of activity between tissues. A search of the mRNA databases revealed the presence of several NAT1 transcripts in human tissue that appear to be derived from different promoters. Because little is known about NAT1 gene regulation, the present study was undertaken to characterize one of the putative promoter sequences of the NAT1 gene located just upstream of the coding region. We show with reverse-transcriptase PCR that mRNA transcribed from this promoter (Promoter 1) is present in a variety of human cell-lines, but not in quiescent peripheral blood mononuclear cells. Using deletion mutant constructs, we identified a 20 bp sequence located 245 bases upstream of the translation start site which was sufficient for basal NAT1 expression. It comprised an AP-1 (activator protein 1)-binding site, flanked on either side by a TCATT motif. Mutational analysis showed that the AP-1 site and the 3' TCATT sequence were necessary for gene expression, whereas the 5' TCATT appeared to attenuate promoter activity. Electromobility shift assays revealed two specific bands made up by complexes of c-Fos/Fra, c-Jun, YY-1 (Yin and Yang 1) and possibly Oct-1. PMA treatment enhanced expression from the NAT1 promoter via the AP-1-binding site. Furthermore, in peripheral blood mononuclear cells, PMA increased endogenous NAT1 activity and induced mRNA expression from Promoter I, suggesting that it is functional in vivo.
Resumo:
CysView is a web-based application tool that identifies and classifies proteins according to their disulfide connectivity patterns. It accepts a dataset of annotated protein sequences in various formats and returns a graphical representation of cysteine pairing patterns. CysView displays cysteine patterns for those records in the data with disulfide annotations. It allows the viewing of records grouped by connectivity patterns. CysView's utility as an analysis tool was demonstrated by the rapid and correct classification of scorpion toxin entries from GenPept on the basis of their disulfide pairing patterns. It has proved useful for rapid detection of irrelevant and partial records, or those with incomplete annotations. CysView can be used to support distant homology between proteins. CysView is publicly available at http://research.i2r.a-star.edu.sg/CysView/.
Resumo:
We have developed a computational strategy to identify the set of soluble proteins secreted into the extracellular environment of a cell. Within the protein sequences predominantly derived from the RIKEN representative transcript and protein set, we identified 2033 unique soluble proteins that are potentially secreted from the cell. These proteins contain a signal peptide required for entry into the secretory pathway and lack any transmembrane domains or intracellular localization signals. This class of proteins, which we have termed the mouse secretome, included >500 novel proteins and 92 proteins
Resumo:
The C2 domain is one of the most frequent and widely distributed calcium-binding motifs. Its structure comprises an eight-stranded beta-sandwich with two structural types as if the result of a circular permutation. Combining sequence, structural and modelling information, we have explored, at different levels of granularity, the functional characteristics of several families of C2 domains. At the coarsest level,the similarity correlates with key structural determinants of the C2 domain fold and, at the finest level, with the domain architecture of the proteins containing them, highlighting the functional diversity between the various subfamilies. The functional diversity appears as different conserved surface patches throughout this common fold. In some cases, these patches are related to substrate-binding sites whereas in others they correspond to interfaces of presumably permanent interaction between other domains within the same polypeptide chain. For those related to substrate-binding sites, the predictions overlap with biochemical data in addition to providing some novel observations. For those acting as protein-protein interfaces' our modelling analysis suggests that slight variations between families are a result of not only complementary adaptations in the interfaces involved but also different domain architecture. In the light of the sequence and structural genomic projects, the work presented here shows that modelling approaches along with careful sub-typing of protein families will be a powerful combination for a broader coverage in proteomics. (C) 2003 Elsevier Ltd. All rights reserved.
Resumo:
Sequence diversity in the coat protein coding region of Australian strains of Johnsongrass mosaic virus (JGMV) was investigated. Field isolates were sampled during a seven year period from Johnsongrass, sorghum and corn across the northern grain growing region. The 23 isolates were found to have greater than 94% nucleotide and amino acid sequence identity. The Australian isolates and two strains from the U.S.A. had about 90% nucleotide sequence identity and were between 19 and 30% different in the N-terminus of the coat protein. Two amino acid residues were found in the core region of the coat protein in isolates obtained from sorghum having the Krish gene for JGMV resistance that differed from those found in isolates from other hosts which did not have this single dominant resistance gene. These amino acid changes may have been responsible for overcoming the resistance conferred by the Krish gene for JGMV resistance in sorghum. The identification of these variable regions was essential for the development of durable pathogen-derived resistance to JGMV in sorghum.
Resumo:
The function of the prion protein gene (PRNP) and its normal product PrPC is elusive. We used comparative genomics as a strategy to understand the normal function of PRNP. As the reliability of comparisons increases with the number of species and increased evolutionary distance, we isolated and sequenced a 66.5 kb BAC containing the PRNP gene from a distantly related mammal, the model Australian marsupial Macropus eugenii (tammar wallaby). Marsupials are separated from eutherians such as human and mouse by roughly 180 million years of independent evolution. We found that tammar PRNP, like human PRNP, has two exons. Prion proteins encoded by the tammar wallaby and a distantly related marsupial, Monodelphis domestica (Brazilian opossum) PRNP contain proximal PrP repeats with a distinct, marsupial-specific composition and a variable number. Comparisons of tammar wallaby PRNP with PRNPs from human, mouse, bovine and ovine allowed us to identify non-coding gene regions conserved across the marsupial-eutherian evolutionary distance, which are candidates for regulatory regions. In the PRNP 3' UTR we found a conserved signal for nuclear-specific polyadenylation and the putative cytoplasmic polyadenylation element (CPE), indicating that post-transcriptional control of PRNP mRNA activity is important. Phylogenetic footprinting revealed conserved potential binding sites for the MZF-1 transcription factor in both upstream promoter and intron/intron 1, and for the MEF2, MyTI, Oct-1 and NFAT transcription factors in the intron(s). The presence of a conserved NFAT-binding site and CPE indicates involvement of PrPC in signal transduction and synaptic plasticity. (c) 2004 Elsevier B.V. All rights reserved.
Resumo:
This Article Right arrow Full Text Right arrow Full Text (PDF) Right arrow Supplemental material Right arrow Alert me when this article is cited Right arrow Alert me if a correction is posted Services Right arrow Similar articles in this journal Right arrow Similar articles in PubMed Right arrow Alert me to new issues of the journal Right arrow Download to citation manager Right arrow Reprints and Permissions Right arrow Copyright Information Right arrow Books from ASM Press Right arrow MicrobeWorld Citing Articles Right arrow Citing Articles via HighWire Right arrow Citing Articles via Google Scholar Google Scholar Right arrow Articles by Lee, N. Right arrow Articles by McCarthy, J. Right arrow Search for Related Content PubMed Right arrow PubMed Citation Right arrow Articles by Lee, N. Right arrow Articles by McCarthy, J. Right arrow Pubmed/NCBI databases * Substance via MeSH Previous Article | Next Article Journal of Clinical Microbiology, August 2006, p. 2773-2778, Vol. 44, No. 8 0095-1137/06/$08.00+0 doi:10.1128/JCM.02557-05 Copyright © 2006, American Society for Microbiology. All Rights Reserved. Effect of Sequence Variation in Plasmodium falciparum Histidine- Rich Protein 2 on Binding of Specific Monoclonal Antibodies: Implications for Rapid Diagnostic Tests for Malaria{dagger} Nelson Lee,1,2 Joanne Baker,2 Kathy T. Andrews,1 Michelle L. Gatton,1,3 David Bell,4 Qin Cheng,2,3 and James McCarthy1* Australian Centre for International and Tropical Health and Nutrition, Queensland Institute of Medical Research and School of Population Health, University of Queensland, Queensland, Australia,1 Department of Drug Resistance and Diagnostics, Australian Army Malaria Institute, Brisbane, Australia,2 Malaria Drug Resistance and Chemotherapy, Queensland Institute of Medical Research, Queensland, Australia,3 World Health Organization, Regional Office for the Western Pacific, Manila, Philippines4 Received 8 December 2005/ Returned for modification 23 February 2006/ Accepted 26 May 2006 The ability to accurately diagnose malaria infections, particularly in settings where laboratory facilities are not well developed, is of key importance in the control of this disease. Rapid diagnostic tests (RDTs) offer great potential to address this need. Reports of significant variation in the field performance of RDTs based on the detection of Plasmodium falciparum histidine-rich protein 2 (HRP2) (PfHRP2) and of significant sequence polymorphism in PfHRP2 led us to evaluate the binding of four HRP2-specific monoclonal antibodies (MABs) to parasite proteins from geographically distinct P. falciparum isolates, define the epitopes recognized by these MABs, and relate the copy number of the epitopes to MAB reactivity. We observed a significant difference in the reactivity of the same MAB to different isolates and between different MABs tested with single isolates. When the target epitopes of three of the MABs were determined and mapped onto the peptide sequences of the field isolates, significant variability in the frequency of these epitopes was observed. These findings support the role of sequence variation as an explanation for variations in the performance of HRP2-based RDTs and point toward possible approaches to improve their diagnostic sensitivities