135 resultados para Protein Sequence
Resumo:
Background: Signal transduction events often involve transient, yet specific, interactions between structurally conserved protein domains and polypeptide sequences in target proteins. The identification and validation of these associating domains is crucial to understand signal transduction pathways that modulate different cellular or developmental processes. Bioinformatics strategies to extract and integrate information from diverse sources have been shown to facilitate the experimental design to understand complex biological events. These methods, primarily based on information from high-throughput experiments, have also led to the identification of new connections thus providing hypothetical models for cellular events. Such models, in turn, provide a framework for directing experimental efforts for validating the predicted molecular rationale for complex cellular processes. In this context, it is envisaged that the rational design of peptides for protein-peptide binding studies could substantially facilitate the experimental strategies to evaluate a predicted interaction. This rational design procedure involves the integration of protein-protein interaction data, gene ontology, physico-chemical calculations, domain-domain interaction data and information on functional sites or critical residues. Results: Here we describe an integrated approach called ``PeptideMine'' for the identification of peptides based on specific functional patterns present in the sequence of an interacting protein. This approach based on sequence searches in the interacting sequence space has been developed into a webserver, which can be used for the identification and analysis of peptides, peptide homologues or functional patterns from the interacting sequence space of a protein. To further facilitate experimental validation, the PeptideMine webserver also provides a list of physico-chemical parameters corresponding to the peptide to determine the feasibility of using the peptide for in vitro biochemical or biophysical studies. Conclusions: The strategy described here involves the integration of data and tools to identify potential interacting partners for a protein and design criteria for peptides based on desired biochemical properties. Alongside the search for interacting protein sequences using three different search programs, the server also provides the biochemical characteristics of candidate peptides to prune peptide sequences based on features that are most suited for a given experiment. The PeptideMine server is available at the URL: http://caps.ncbs.res.in/peptidemine
Resumo:
The crystal structures of two forms of Mycobacterium leprae single-stranded DNA-binding protein (SSB) have been determined at 2.05 and 2.8 A resolution. Comparison of these structures with the structures of other eubacterial SSBs indicates considerable variation in their quaternary association, although the DNA-binding domains in all of them exhibit the same OB-fold. This variation has no linear correlation with sequence variation, but could be related to variation in protein stability. Molecular-dynamics simulations have been carried out on tetrameric molecules derived from the two forms and the prototype Escherichia coli SSB and the individual subunits of both proteins. Together, the X-ray studies and molecular-dynamics simulations yield information on the relatively rigid and flexible regions of the molecule and on the effect of oligomerization on flexibility. The simulations provide insight into the changes in subunit structure on oligomerization. They also provide insight into the stability and time evolution of the hydrogen bonds/water bridges that connect the two pairs of monomers in the tetramer.
Resumo:
The 3' terminal 1255 nt sequence of Physalis mottle virus (PhMV) genomic RNA has been determined from a set of overlapping cDNA clones. The open reading frame (ORF) at the 3' terminus corresponds to the amino acid sequence of the coat protein (CP) determined earlier except for the absence of the dipeptide, Lys-Leu, at position 110-111. In addiition, the sequence upstream of the CP gene contains the message coding for 178 amino acid residues of the C-terminus of the putative replicase protein (RP). The sequence downstream of the CP gene contains an untranslated region whose terminal 80 nucleotides can be folded into a characteristic tRNA-like structure. A phylogenetic tree constructed after aligning separately the sequence of the CP, the replicase protein (RP) and the tRNA-like structure determined in this study with the corresponding sequences of other tymoviruses shows that PhMV wrongly named belladonna mottle virus [BDMV(I)] is a separate tymovirus and not another strain of BDMV(E) as originally envisaged. The phylogenetic tree in all the three cases is identical showing that any subset of genomic sequence of sufficient length can be used for establishing evolutionary relationships among tymoviruses.
Resumo:
Monoclonal antibodies raised against chicken egg white riboflavin carrier protein were classified into seven categories each recognizing a distinct epitope. Of these, six were directed against conformation dependent epitopes and one to a sequential epitope. The roles of lysine residues and the post-translationally attached phosphate and oligosaccharide moieties in the antigenicity of riboflavin carrier protein recognized by the monoclonal antibodies were investigated. The binding region of three monoclonal antibodies could be located within the 87–219 amino acid sequence of the protein and one antibody among these recognized a sequence of 182–204 amino acid residues. All the monoclonal antibodies were able to recognize riboflavin carrier proteins present in the sera of pregnant rats, cows and humans indicating that the epitopes to which they are directed are conserved through evolution from chicken to the human.
Resumo:
Monoclonal antibodies raised against human serum retinol-binding protein (hRBP) were used as probes for the study of the antigenic determinants of hRBP and those shared with the same protein from other species. The antibodies could be classified into four distinct groups and react with the homologous proteins from the rat as well as the rabbit sera. Three of these antibodies recognize sequential or continuous epitopes while the remaining antibody is directed against a discontinuous or conformational epitope. By chemical cleavage with cyanogen bromide, the domains recognized by the monoclonal antibodies could be delineated. By solid-phase synthetic approach, the core sequences recognized by two of these monoclonal antibodies were identified to amino acid sequences 45–51 and 128–131 of the primary amino acid sequence of hRBP.
Resumo:
Sesbania mosaic virus (SeMV) is a single-stranded positive-sense RNA plant virus belonging to the genus Sobemovirus. The movement protein (MP) encoded by SeMV ORF1 showed no significant sequence similarity with MPs of other genera, but showed 32% identity with the MP of Southern bean mosaic virus within the Sobemovirus genus. With a view to understanding the mechanism of cell-to-cell movement in sobemoviruses, the SeMV MP gene was cloned, over-expressed in Escherichia coli and purified. Interaction of the recombinant MP with the native virus (NV) was investigated by ELISA and pull-down assays. It was observed that SeMV MP interacted with NV in a concentration- and pH-dependent manner. Analysis of N- and C-terminal deletion mutants of the MP showed that SeMV MP interacts with the NV through the N- terminal 49 amino acid segment. Yeast two-hybrid assays confirmed the in vitro observations, and suggested that SeMV might belong to the class of viruses that require MP and NV/coat protein for cell-to-cell movement.
Resumo:
Sequence specific resonance assignment constitutes an important step towards high-resolution structure determination of proteins by NMR and is aided by selective identification and assignment of amino acid types. The traditional approach to selective labeling yields only the chemical shifts of the particular amino acid being selected and does not help in establishing a link between adjacent residues along the polypeptide chain, which is important for sequential assignments. An alternative approach is the method of amino acid selective `unlabeling' or reverse labeling, which involves selective unlabeling of specific amino acid types against a uniformly C-13/N-15 labeled background. Based on this method, we present a novel approach for sequential assignments in proteins. The method involves a new NMR experiment named, {(CO)-C-12 (i) -N-15 (i+1)}-filtered HSQC, which aids in linking the H-1(N)/N-15 resonances of the selectively unlabeled residue, i, and its C-terminal neighbor, i + 1, in HN-detected double and triple resonance spectra. This leads to the assignment of a tri-peptide segment from the knowledge of the amino acid types of residues: i - 1, i and i + 1, thereby speeding up the sequential assignment process. The method has the advantage of being relatively inexpensive, applicable to H-2 labeled protein and can be coupled with cell-free synthesis and/or automated assignment approaches. A detailed survey involving unlabeling of different amino acid types individually or in pairs reveals that the proposed approach is also robust to misincorporation of N-14 at undesired sites. Taken together, this study represents the first application of selective unlabeling for sequence specific resonance assignments and opens up new avenues to using this methodology in protein structural studies.
Resumo:
Heat shock protein 90 participates in diverse biological processes ranging from protein folding, cell cycle, signal transduction and development to evolution in all eukaryotes. It is also critically involved in regulating growth of protozoa such as Dictyostelium discoideum, Leishmania donovani, Plasmodium falciparum, Trypanosoma cruzi, and Trypanosoma evansi. Selective inhibition of Hsp90 has also been explored as an intervention strategy against important human diseases such as cancer, malaria, or trypanosomiasis. Giardia lamblia, a simple protozoan parasite of humans and animals, is an important cause of diarrheal disease with significant morbidity and some mortality in tropical countries. Here we show that the G. lamblia cytosolic hsp90 ( glhsp90) is split in two similar sized fragments located 777 kb apart on the same scaffold. Intrigued by this unique arrangement, which appears to be specific for the Giardiinae, we have investigated the biosynthesis of GlHsp90. We used genome sequencing to confirm the split nature of the giardial hsp90. However, a specific antibody raised against the peptide detected a product with a mass of about 80 kDa, suggesting a post-transcriptional rescue of the genomic defect. We show evidence for the joining of the two independent Hsp90 transcripts in-trans to one long mature mRNA presumably by RNA splicing. The splicing junction carries hallmarks of classical cis-spliced introns, suggesting that the regular cis-splicing machinery may be sufficient for repair of the open reading frame. A complementary 26-nt sequence in the ``intron'' regions adjacent to the splice sites may assist in positioning the two pre-mRNAs for processing. This is the first example of post-transcriptional rescue of a split gene by trans-splicing.
Resumo:
EcoP15I DNA methyltransferase (Mtase) recognizes the asymmeteric sequence CAGCAG and catalyzes the transfer of a methyl group from S-adenosyl-L-methionine to the second adenine residue. We have investigated the DNA binding properties of EcoP15I DNA Mtase using gel mobility shift assays. EcoP15I DNA Mtase binds approximately threefold more tightly to DNA containing its recognition sequence, CAGCAG, than to non-specific sequences in the absence or presence of cofactors. Interestingly, in the presence of ATP the discrimination between specific and non-specific sequences increases significantly. These results suggest for the first time a role for ATP in DNA recognition by type III restriction-modification enzymes. In addition, we have shown that bromodeoxyuridine-containing oligonucleotides form complexes with EcoP15I DNA Mtase that are crosslinked upon irradiation. More importantly, we have shown that the crosslink site is at the site of DNA binding, since it can be suppressed by an excess of unmodified oligonucleotide. EcoP15I DNA Mtase exhibited Michaelis-Menten kinetics with both unmodified and bromodeoxyuridine-substituted DNA, with a higher specificity constant for the latter. Furthermore, gel mobility shift assays showed that proteolyzed EcoP15I DNA Mtase formed a specific complex with DNA, which had similar mobility as the native protein-DNA complex. Taken together these results form the basis fora detailed structure-function analysis of EcoP15I DNA Mtase.
Resumo:
Structure comparison tools can be used to align related protein structures to identify structurally conserved and variable regions and to infer functional and evolutionary relationships. While the conserved regions often superimpose well, the variable regions appear non superimposable. Differences in homologous protein structures are thought to be due to evolutionary plasticity to accommodate diverged sequences during evolution. One of the kinds of differences between 3-D structures of homologous proteins is rigid body displacement. A glaring example is not well superimposed equivalent regions of homologous proteins corresponding to a-helical conformation with different spatial orientations. In a rigid body superimposition, these regions would appear variable although they may contain local similarity. Also, due to high spatial deviation in the variable region, one-to-one correspondence at the residue level cannot be determined accurately. Another kind of difference is conformational variability and the most common example is topologically equivalent loops of two homologues but with different conformations. In the current study, we present a refined view of the ``structurally variable'' regions which may contain local similarity obscured in global alignment of homologous protein structures. As structural alphabet is able to describe local structures of proteins precisely through Protein Blocks approach, conformational similarity has been identified in a substantial number of `variable' regions in a large data set of protein structural alignments; optimal residue-residue equivalences could be achieved on the basis of Protein Blocks which led to improved local alignments. Also, through an example, we have demonstrated how the additional information on local backbone structures through protein blocks can aid in comparative modeling of a loop region. In addition, understanding on sequence-structure relationships can be enhanced through our approach. This has been illustrated through examples where the equivalent regions in homologous protein structures share sequence similarity to varied extent but do not preserve local structure.
Resumo:
The concept of one enzyme-one activity had influenced biochemistry for over half a century. Over 1000 enzymes are now described. Many of them are highly 'specific'. Some of them are crystallized and their three-dimensional structures determined. They range from 12 to 1000 kDa in molecular weight and possess 124 to several hundreds of amino acids. They occur as single polypeptides or multiple-subunit proteins. The active sites are assembled on these by appropriate tertiary folding of the polypeptide chain, or by interaction of the constituent subunits. The substrate is held by the side-chains of a few amino acids at the active site on the surface, occupying a tiny fraction of the total area. What is the bulk of the protein behind the active site doing? Do all proteins have only one function each? Why not a protein have more than one active site on its large surface? Will we discover more than one activity for some proteins? These newer possibilities are emerging and are finding experimental support. Some proteins purified to homogeneity using assay methods for different activities are now recognized to have the same molecular weight and a high degree of homology of amino acid sequence. Obviously they are identical. They represent the phenomenon of one protein-many functions.
Resumo:
Sesbania mosaic virus (SMV) is a plant virus that infects Sesbania grandiflora plants in Andhra Pradesh, India. The amino acid sequence of the coat protein of SMV was determined using purified peptides generated by cleavage with trypsin, chymotrypsin, V8 protease and clostripain. The 230 residues so far determined were compared to the corresponding residues of southern bean mosaic virus (SBMV), the type member of sobemoviruses. The overall identity between the sequences is 61.7%. The amino terminal 64 residues, which constitute an independent domain (R-domain) known to interact with RNA, are conserved to a lower extent (52.5%). Comparison of the positively charged residues in this domain suggests that the RNA-protein interactions are considerably weaker in SMV. The residues that constitute the major domain of the coat protein, the surface domain (S-domain, residues 65-260), are better conserved (66.5%). The positively charged residues of this domain that face the nucleic acid are well conserved. The longest conserved stretch of residues (131-142) corresponds to the loop involved in intersubunit interactions between subunits related by the quasi 3-fold symmetry. A unique cation binding site located on the quasi 3-fold axis contributes to the stability of SMV. These differences are reflected in the increased stability of the SMV coat protein and its ability to be reconstituted with RNA at pH 7.5. A major epitope was identified using monoclonal antibodies to SMV in the segment 201-223 which contains an exposed helix in the capsid structure. This region is highly conserved between SMV and SBMV (70%) suggesting that it could represent the site of an important function such as vector recognition.
Resumo:
A successful protein-protein docking study culminates in identification of decoys at top ranks with near-native quaternary structures. However, this task remains enigmatic because no generalized scoring functions exist that effectively infer decoys according to the similarity to near-native quaternary structures. Difficulties arise because of the highly irregular nature of the protein surface and the significant variation of the nonbonding and solvation energies based on the chemical composition of the protein-protein interface. In this work, we describe a novel method combining an interface-size filter, a regression model for geometric compatibility (based on two correlated surface and packing parameters), and normalized interaction energy (calculated from correlated nonbonded and solvation energies), to effectively rank decoys from a set of 10,000 decoys. Tests on 30 unbound binary protein-protein complexes show that in 16 cases we can identify at least one decoy in top three ranks having <= 10 angstrom backbone root mean square deviation from true binding geometry. Comparisons with other state-of-art methods confirm the improved ranking power of our method without the use of any experiment-guided restraints, evolutionary information, statistical propensities, or modified interaction energy equations. Tests on 118 less-difficult bound binary protein-protein complexes with <= 35% sequence redundancy at the interface showed that in 77% cases, at least 1 in 10,000 decoys were identified with <= 5 angstrom backbone root mean square deviation from true geometry at first rank. The work will promote the use of new concepts where correlations among parameters provide more robust scoring models. It will facilitate studies involving molecular interactions, including modeling of large macromolecular assemblies and protein structure prediction. (C) 2010 Wiley Periodicals, Inc. J Comput Chem 32: 787-796, 2011.
Resumo:
In attempts to convert an elongator tRNA to an initiator tRNA, we previously generated a mutant elongator methionine tRNA carrying an anticodon sequence change from CAU to CUA along with the two features important for activity of Escherichia coli initiator tRNA in initiation. This mutant tRNA (Mi:2 tRNA) was active in initiation in vivo but only when aminoacylated with methionine by overproduction of methionyl-tRNA synthetase. Here we show that the Mi:2 tRNA is normally aminoacylated in vivo with lysine and that the tRNA aminoacylated with lysine is a very poor substrate for formylation compared with the same tRNA aminoacylated with methionine. By introducing further changes at base pairs 4:69 and 5:68 in the acceptor stem of the Mi:2 tRNA to those found in the E. coli initiator tRNA, we show that change of the U4:A69 base pair to G4:C69 and overproduction of lysyl-tRNA synthetase and methionyl-tRNA transformylase results in partial formylation of the mutant tRNA and activity of the formyllysyl-tRNAs in initiation of protein synthesis. Thus, the G4:C69 base pair contributes toward formylation of the tRNA and protein synthesis in E. coli can be initiated with formyllysine. We also discuss the implications of these and other results on recognition of tRNAs by E. coli lysyl-tRNA synthetase and on competition in cells among aminoacyl-tRNA synthetases.
Resumo:
In the absence of interlogs, building docking models is a time intensive task, involving generation of a large pool of docking decoys followed by refinement and screening to identify near native docking solutions. This limits the researcher interested in building docking methods with the choice of benchmarking only a limited number of protein complexes. We have created a repository called dockYard (http://pallab.serc.iisc.ernet.in/dockYard), that allows modelers interested in protein-protein interaction to access large volume of information on protein dimers and their interlogs, and also download decoys for their work if they are interested in building modeling methods. dockYard currently offers four categories of docking decoys derived from: Bound (native dimer co-crystallized), Unbound (individual subunits are crystallized, as well as the target dimer), Variants (match the previous two categories in at least one subunit with 100% sequence identity), and Interlogs (match the previous categories in at least one subunit with >= 90% or >= 50% sequence identity). The web service offers options for full or selective download based on search parameters. Our portal also serves as a repository to modelers who may want to share their decoy sets with the community.