988 resultados para Signal Peptide Prediction
Resumo:
Signal peptides and transmembrane helices both contain a stretch of hydrophobic amino acids. This common feature makes it difficult for signal peptide and transmembrane helix predictors to correctly assign identity to stretches of hydrophobic residues near the N-terminal methionine of a protein sequence. The inability to reliably distinguish between N-terminal transmembrane helix and signal peptide is an error with serious consequences for the prediction of protein secretory status or transmembrane topology. In this study, we report a new method for differentiating protein N-terminal signal peptides and transmembrane helices. Based on the sequence features extracted from hydrophobic regions (amino acid frequency, hydrophobicity, and the start position), we set up discriminant functions and examined them on non-redundant datasets with jackknife tests. This method can incorporate other signal peptide prediction methods and achieve higher prediction accuracy. For Gram-negative bacterial proteins, 95.7% of N-terminal signal peptides and transmembrane helices can be correctly predicted (coefficient 0.90). Given a sensitivity of 90%, transmembrane helices can be identified from signal peptides with a precision of 99% (coefficient 0.92). For eukaryotic proteins, 94.2% of N-terminal signal peptides and transmembrane helices can be correctly predicted with coefficient 0.83. Given a sensitivity of 90%, transmembrane helices can be identified from signal peptides with a precision of 87% (coefficient 0.85). The method can be used to complement current transmembrane protein prediction and signal peptide prediction methods to improve their prediction accuracies. (C) 2003 Elsevier Inc. All rights reserved.
Resumo:
Maturation of the arenavirus GP precursor (GPC) involves proteolytic processing by cellular signal peptidase and the proprotein convertase subtilisin kexin isozyme 1 (SKI-1)/site 1 protease (S1P), yielding a tripartite complex comprised of a stable signal peptide (SSP), the receptor-binding GP1, and the fusion-active transmembrane GP2. Here we investigated the roles of SKI-1/S1P processing and SSP in the biosynthesis of the recombinant GP ectodomains of lymphocytic choriomeningitis virus (LCMV) and Lassa virus (LASV). When expressed in mammalian cells, the LCMV and LASV GP ectodomains underwent processing by SKI-1/S1P, followed by dissociation of GP1 from GP2. The GP2 ectodomain spontaneously formed trimers as revealed by chemical cross-linking. The endogenous SSP, known to be crucial for maturation and transport of full-length arenavirus GPC was dispensable for processing and secretion of the soluble GP ectodomain, suggesting a specific role of SSP in the stable prefusion conformation and transport of full-length GPC.
Resumo:
Persistence in canine distemper virus (CDV) infection is correlated with very limited cell-cell fusion and lack of cytolysis induced by the neurovirulent A75/17-CDV compared to that of the cytolytic Onderstepoort vaccine strain. We have previously shown that this difference was at least in part due to the amino acid sequence of the fusion (F) protein (P. Plattet, J. P. Rivals, B. Zuber, J. M. Brunner, A. Zurbriggen, and R. Wittek, Virology 337:312-326, 2005). Here, we investigated the molecular mechanisms of the neurovirulent CDV F protein underlying limited membrane fusion activity. By exchanging the signal peptide between both F CDV strains or replacing it with an exogenous signal peptide, we demonstrated that this domain controlled intracellular and consequently cell surface protein expression, thus indirectly modulating fusogenicity. In addition, by serially passaging a poorly fusogenic virus and selecting a syncytium-forming variant, we identified the mutation L372W as being responsible for this change of phenotype. Intriguingly, residue L372 potentially is located in the helical bundle domain of the F(1) subunit. We showed that this mutation drastically increased fusion activity of F proteins of both CDV strains in a signal peptide-independent manner. Due to its unique structure even among morbilliviruses, our findings with respect to the signal peptide are likely to be specifically relevant to CDV, whereas the results related to the helical bundle add new insights to our growing understanding of this class of F proteins. We conclude that different mechanisms involving multiple domains of the neurovirulent A75/17-CDV F protein act in concert to limit fusion activity, preventing lysis of infected cells, which ultimately may favor viral persistence.
Resumo:
The stable signal peptide (SSP) of the lymphocytic choriomeningitis virus surface glycoprotein precursor has several unique characteristics. The SSP is unusually long, at 58 amino acids, and contains two hydrophobic domains, and its sequence is highly conserved among both Old and New World arenaviruses. To better understand the functions of the SSP, a panel of point and deletion mutants was created by in vitro mutagenesis to target the highly conserved elements within the SSP. We were also able to confirm critical residues required for separate SSP functions by trans-complementation. Using these approaches, it was possible to resolve functional domains of the SSP. In characterizing our SSP mutants, we discovered that the SSP is involved in several distinct functions within the viral life cycle, beyond translocation of the viral surface glycoprotein precursor into the endoplasmic reticulum lumen. The SSP is required for efficient glycoprotein expression, posttranslational maturation cleavage of GP1 and GP2 by SKI-1/S1P protease, glycoprotein transport to the cell surface plasma membrane, formation of infectious virus particles, and acid pH-dependent glycoprotein-mediated cell fusion.
Resumo:
The presence of the schizont stage of the obligate intracellular parasites Theileria parva or T. annulata in the cytoplasm of an infected leukocyte results in host cell transformation via a mechanism that has not yet been elucidated. Proteins, secreted by the schizont, or expressed on its surface, are of interest as they can interact with host cell molecules that regulate host cell proliferation and/or survival. The major schizont surface protein is the polymorphic immunodominant molecule, PIM, which contains a large glutamine- and proline-rich domain (QP-rd) that protrudes into the host cell cytoplasm. Analyzing QP-rd generated by in vitro transcription/translation, we found that the signal peptide was efficiently cleaved post-translationally upon addition of T cell lysate or canine pancreatic microsomes, whereas signal peptide cleavage of a control protein only occurred cotranslationally and in the presence of microsomal membranes. The QP-rd of PIM migrated anomalously in SDS-PAGE and removal of the 19 amino acids corresponding to the predicted signal peptide caused a decrease in apparent molecular mass of 24kDa. The molecule was analyzed using monoclonal antibodies that recognize a set of previously defined PIM epitopes. Depending on the presence or the absence of the signal peptide, two conformational states could be demonstrated that are differentially recognized, with N-terminal epitopes becoming readily accessible upon signal peptide removal, and C-terminal epitopes becoming masked. Similar observations were made when the QP-rd of PIM was expressed in bacteria. Our observations could also be of relevance to other schizont proteins. A recent analysis of the proteomes of T. parva and T. annulata revealed the presence of a large family of potentially secreted proteins, characterized by the presence of large stretches of amino acids that are also particularly rich in QP-residues.
Resumo:
Persistence in canine distemper virus (CDV) infection is correlated with very limited cell-cell fusion and lack of cytolysis induced by the neurovirulent A75/17-CDV compared to that of the cytolytic Onderstepoort vaccine strain. We have previously shown that this difference was at least in part due to the amino acid sequence of the fusion (F) protein (P. Plattet, J. P. Rivals, B. Zuber, J. M. Brunner, A. Zurbriggen, and R. Wittek, Virology 337:312-326, 2005). Here, we investigated the molecular mechanisms of the neurovirulent CDV F protein underlying limited membrane fusion activity. By exchanging the signal peptide between both F CDV strains or replacing it with an exogenous signal peptide, we demonstrated that this domain controlled intracellular and consequently cell surface protein expression, thus indirectly modulating fusogenicity. In addition, by serially passaging a poorly fusogenic virus and selecting a syncytium-forming variant, we identified the mutation L372W as being responsible for this change of phenotype. Intriguingly, residue L372 potentially is located in the helical bundle domain of the F(1) subunit. We showed that this mutation drastically increased fusion activity of F proteins of both CDV strains in a signal peptide-independent manner. Due to its unique structure even among morbilliviruses, our findings with respect to the signal peptide are likely to be specifically relevant to CDV, whereas the results related to the helical bundle add new insights to our growing understanding of this class of F proteins. We conclude that different mechanisms involving multiple domains of the neurovirulent A75/17-CDV F protein act in concert to limit fusion activity, preventing lysis of infected cells, which ultimately may favor viral persistence.
Resumo:
Although there is considerable evidence that PrPSc is the infectious form of the prion protein, it has recently been proposed that a transmembrane variant called CtmPrP is the direct cause of prion-associated neurodegeneration. We report here, using a mutant form of PrP that is synthesized exclusively with the CtmPrP topology, that CtmPrP is retained in the endoplasmic reticulum and is degraded by the proteasome. We also demonstrate that CtmPrP contains an uncleaved, N-terminal signal peptide as well as a C-terminal glycolipid anchor. These results provide insight into general mechanisms that control the topology of membrane proteins during their synthesis in the endoplasmic reticulum, and they also suggest possible cellular pathways by which CtmPrP may cause disease.
Resumo:
The maize floury 2 (fl2) mutation enhances the lysine content of the grain, but the soft texture of the endosperm makes it unsuitable for commercial production. The mutant phenotype is linked with the appearance of a 24-kDa alpha-zein protein and increased synthesis of binding protein, both of which are associated with irregularly shaped protein bodies. We have cloned the gene encoding the 24-kDa protein and show that it is expressed as a 22-kDa alpha-zein with an uncleaved signal peptide. Comparison of the deduced N-terminal amino acid sequence of the 24-kDa alpha-zein protein with other alpha-zeins revealed an alanine to valine substitution at the C-terminal position of the signal peptide, a histidine insertion within the seventh alpha-helical repeat, and an alanine to threonine substitution with the same alpha-helical repeat of the protein. Structural defects associated with this alpha-zein explain many of the phenotypic effects of the fl2 mutation.
Resumo:
Membrane proteins are a large and important class of proteins. They are responsible for several of the key functions in a living cell, e.g. transport of nutrients and ions, cell-cell signaling, and cell-cell adhesion. Despite their importance it has not been possible to study their structure and organization in much detail because of the difficulty to obtain 3D structures. In this thesis theoretical studies of membrane protein sequences and structures have been carried out by analyzing existing experimental data. The data comes from several sources including sequence databases, genome sequencing projects, and 3D structures. Prediction of the membrane spanning regions by hydrophobicity analysis is a key technique used in several of the studies. A novel method for this is also presented and compared to other methods. The primary questions addressed in the thesis are: What properties are common to all membrane proteins? What is the overall architecture of a membrane protein? What properties govern the integration into the membrane? How many membrane proteins are there and how are they distributed in different organisms? Several of the findings have now been backed up by experiments. An analysis of the large family of G-protein coupled receptors pinpoints differences in length and amino acid composition of loops between proteins with and without a signal peptide and also differences between extra- and intracellular loops. Known 3D structures of membrane proteins have been studied in terms of hydrophobicity, distribution of secondary structure and amino acid types, position specific residue variability, and differences between loops and membrane spanning regions. An analysis of several fully and partially sequenced genomes from eukaryotes, prokaryotes, and archaea has been carried out. Several differences in the membrane protein content between organisms were found, the most important being the total number of membrane proteins and the distribution of membrane proteins with a given number of transmembrane segments. Of the properties that were found to be similar in all organisms, the most obvious is the bias in the distribution of positive charges between the extra- and intracellular loops. Finally, an analysis of homologues to membrane proteins with known topology uncovered two related, multi-spanning proteins with opposite predicted orientations. The predicted topologies were verified experimentally, providing a first example of "divergent topology evolution".
Resumo:
The goal of this thesis work is to develop a computational method based on machine learning techniques for predicting disulfide-bonding states of cysteine residues in proteins, which is a sub-problem of a bigger and yet unsolved problem of protein structure prediction. Improvement in the prediction of disulfide bonding states of cysteine residues will help in putting a constraint in the three dimensional (3D) space of the respective protein structure, and thus will eventually help in the prediction of 3D structure of proteins. Results of this work will have direct implications in site-directed mutational studies of proteins, proteins engineering and the problem of protein folding. We have used a combination of Artificial Neural Network (ANN) and Hidden Markov Model (HMM), the so-called Hidden Neural Network (HNN) as a machine learning technique to develop our prediction method. By using different global and local features of proteins (specifically profiles, parity of cysteine residues, average cysteine conservation, correlated mutation, sub-cellular localization, and signal peptide) as inputs and considering Eukaryotes and Prokaryotes separately we have reached to a remarkable accuracy of 94% on cysteine basis for both Eukaryotic and Prokaryotic datasets, and an accuracy of 90% and 93% on protein basis for Eukaryotic dataset and Prokaryotic dataset respectively. These accuracies are best so far ever reached by any existing prediction methods, and thus our prediction method has outperformed all the previously developed approaches and therefore is more reliable. Most interesting part of this thesis work is the differences in the prediction performances of Eukaryotes and Prokaryotes at the basic level of input coding when ‘profile’ information was given as input to our prediction method. And one of the reasons for this we discover is the difference in the amino acid composition of the local environment of bonded and free cysteine residues in Eukaryotes and Prokaryotes. Eukaryotic bonded cysteine examples have a ‘symmetric-cysteine-rich’ environment, where as Prokaryotic bonded examples lack it.
Resumo:
The twin arginine translocation (TAT) system ferries folded proteins across the bacterial membrane. Proteins are directed into this system by the TAT signal peptide present at the amino terminus of the precursor protein, which contains the twin arginine residues that give the system its name. There are currently only two computational methods for the prediction of TAT translocated proteins from sequence. Both methods have limitations that make the creation of a new algorithm for TAT-translocated protein prediction desirable. We have developed TATPred, a new sequence-model method, based on a Nave-Bayesian network, for the prediction of TAT signal peptides. In this approach, a comprehensive range of models was tested to identify the most reliable and robust predictor. The best model comprised 12 residues: three residues prior to the twin arginines and the seven residues that follow them. We found a prediction sensitivity of 0.979 and a specificity of 0.942.