109 resultados para Amino Acid Sequence
em Indian Institute of Science - Bangalore - Índia
Resumo:
The complete amino acid sequence of winged bean basic agglutinin (WBA I) was obtained by a combination of manual and gas-phase sequencing methods. Peptide fragments for sequence analyses were obtained by enzymatic cleavages using trypsin and Staphylococcus aureus V8 endoproteinase and by chemical cleavages using iodosobenzoic acid, hydroxylamine, and formic acid. COOH-terminal sequence analysis of WBA I and other peptides was performed using carboxypeptidase Y. The primary structure of WBA I was homologous to those of other legume lectins and more so to Erythrina corallodendron. Interestingly, the sequence shows remarkable identities in the regions involved in the association of the two monomers of E. corallodendron lectin. Other conserved regions are the double metal-binding site and residues contributing to the formation of the hydrophobic cavity and the carbohydrate-binding site. Chemical modification studies both in the presence and absence of N-acetylgalactosamine together with sequence analyses of tryptophan-containing tryptic peptides demonstrate that tryptophan 133 is involved in the binding of carbohydrate ligands by the lectin. The location of tryptophan 133 at the active center of WBA I for the first time subserves to explain a role for one of the most conserved residues in legume lectins.
Resumo:
The conformation of amino acid side chains as observed in well-determined structures of globular proteins has earlier been extensively investigated. In contrast, the structural features of the polypeptide backbone that result from the occurrence of specific amino acids along the polypeptide have not been analysed. In this article, we present the statistically significant features in the backbone geometry that appear to be a consequence of the occurrence of rotamers of different amino acid side chains by analysing 102 well-refined structures that form a random collection of proteins. It is found that the persistence of helical segments around each residue is influenced by the residue type. Several residues exert asymmetrical influence between the carboxyl and amino terminal polypeptide segments. The degree to which secondary structures depart from an average geometry also appears to depend on residue type. These departures are correlated to the corresponding Chou and Fasman parameters of amino acid residues. The frequency distribution of the side chain rotamers is influenced by polypeptide secondary structure. In turn, the rotamer conformation of side chain affects the extension of the secondary structure of the backbone. The strongest correlation is found between the occurrence of g+ conformation and helix propagation on the carboxyl side of many residues.
Resumo:
NSP3, an acidic nonstructural protein, encoded by gene 7 has been implicated as the key player in the assembly of the 11 viral plus-strand RNAs into the early replication intermediates during rotavirus morphogenesis. To date, the sequence or NSP3 from only three animal rotaviruses (SA11, SA114F, and bovine UK) has been determined and that from a human strain has not been reported. To determine the genetic diversity among gene 7 alleles from group A rotaviruses, the nucleotide sequence of the NSP3 gene from 13 strains belonging to nine different G serotypes, from both humans and animals, has been determined. Based on the amino acid sequence identity as well as phylogenetic analysis, NSP3 from group A rotaviruses falls into three evolutionarily related groups, i.e., the SA11 group, the Wa group, and the S2 group. The SA 11/SA114F gene appears to have a distant ancestral origin from that of the others and codes for a polypeptide of 315 amino acids (aa) in length. NSP3 from all other group A rotaviruses is only 313 aa in length because of a 2-amino-acid deletion near the carboxy-terminus, While the SA114F gene has the longest 3' untranslated region (UTR) of 132 nucleotides, that from other strains suffered deletions of varying lengths at two positions downstream of the translational termination codon. In spite of the divergence of the nucleotide (nt) sequence in the protein coding region, a stretch of about 80 nt in the 3' UTR is highly conserved in the NSP3 gene from all the strains. This conserved sequence in the 3' UTR might play an important role in the regulation of expression of the NSP3 gene. (C) 1995 Academic Press, Inc.
Resumo:
Using a dataset of 1164 crystal structures of largely non-homologous proteins defined at a resolution of 1.5 angstrom or better, we have investigated the (phi,psi) preferences of 20 residue types by considering the residues which occur in loops. Propensities of residue types to occur in the loops with (phi,psi) values in the aa region of the Ramachandran map has a poor correlation coefficient of 0.48 to the Chou-Fasman propensities of the residue types to occur in the a-helical segments. However the correlation coefficient between propensities of residues in loops to adopt beta conformations and those in beta-sheet is much higher (0.95). These observations suggest that a-helix formation is well influenced by the local amino acid sequence while intrinsic preference of residue types for beta-sheet plays a major role in the formation of beta-sheet. The main chain polar groups of residues in loops, that can affect the (phi,psi) values, can be involved in intra-molecular hydrogen bonding. Therefore we investigated further by considering subset of residues in loops with low (0 to 2) number of intra-molecular hydrogen bonds per residue involving main chain polar atoms. For this subset, the correlation coefficients between propensities for alpha-helix and alpha(R) region and between beta-sheet and beta-region are 0.26 and 0.64 respectively. This reiterates higher intrinsic tendency of beta-region favouring residues to adopt beta-sheet than alpha(R) region favouring residues to adopt alpha-helical structure.
Resumo:
The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as Protein Blocks (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa.
Resumo:
The notion of optimization is inherent in protein design. A long linear chain of twenty types of amino acid residues are known to fold to a 3-D conformation that minimizes the combined inter-residue energy interactions. There are two distinct protein design problems, viz. predicting the folded structure from a given sequence of amino acid monomers (folding problem) and determining a sequence for a given folded structure (inverse folding problem). These two problems have much similarity to engineering structural analysis and structural optimization problems respectively. In the folding problem, a protein chain with a given sequence folds to a conformation, called a native state, which has a unique global minimum energy value when compared to all other unfolded conformations. This involves a search in the conformation space. This is somewhat akin to the principle of minimum potential energy that determines the deformed static equilibrium configuration of an elastic structure of given topology, shape, and size that is subjected to certain boundary conditions. In the inverse-folding problem, one has to design a sequence with some objectives (having a specific feature of the folded structure, docking with another protein, etc.) and constraints (sequence being fixed in some portion, a particular composition of amino acid types, etc.) while obtaining a sequence that would fold to the desired conformation satisfying the criteria of folding. This requires a search in the sequence space. This is similar to structural optimization in the design-variable space wherein a certain feature of structural response is optimized subject to some constraints while satisfying the governing static or dynamic equilibrium equations. Based on this similarity, in this work we apply the topology optimization methods to protein design, discuss modeling issues and present some initial results.
Resumo:
Sequence specific resonance assignment constitutes an important step towards high-resolution structure determination of proteins by NMR and is aided by selective identification and assignment of amino acid types. The traditional approach to selective labeling yields only the chemical shifts of the particular amino acid being selected and does not help in establishing a link between adjacent residues along the polypeptide chain, which is important for sequential assignments. An alternative approach is the method of amino acid selective `unlabeling' or reverse labeling, which involves selective unlabeling of specific amino acid types against a uniformly C-13/N-15 labeled background. Based on this method, we present a novel approach for sequential assignments in proteins. The method involves a new NMR experiment named, {(CO)-C-12 (i) -N-15 (i+1)}-filtered HSQC, which aids in linking the H-1(N)/N-15 resonances of the selectively unlabeled residue, i, and its C-terminal neighbor, i + 1, in HN-detected double and triple resonance spectra. This leads to the assignment of a tri-peptide segment from the knowledge of the amino acid types of residues: i - 1, i and i + 1, thereby speeding up the sequential assignment process. The method has the advantage of being relatively inexpensive, applicable to H-2 labeled protein and can be coupled with cell-free synthesis and/or automated assignment approaches. A detailed survey involving unlabeling of different amino acid types individually or in pairs reveals that the proposed approach is also robust to misincorporation of N-14 at undesired sites. Taken together, this study represents the first application of selective unlabeling for sequence specific resonance assignments and opens up new avenues to using this methodology in protein structural studies.
Resumo:
L-Lysine d-pantothenate, a 1:1 amino acid-vitamin complex, crystallizes in the monoclinic space group P21 with Image Full-size image (1K) .The structure has been solved by direct methods and refined to an R value of 0.053 for 1868 observed reflections. The zwitterionic positively charged lysine molecules in the structure assume the sterically most favourable conformation with an all-trans side chain trans to the α-carboxylate group. The pantothenate anion has a somewhat folded conformation stabilised by an intramolecular bifurcated hydrogen bond. The unlike molecules aggregate into separate alternating layers. The molecules in the lysine layers form a head-to-tail sequence parallel to the a-axis. The interactions which hold the adjacent layers together include those between the side chain amino group of lysine and the carboxylate group in the pantothenate anion. The geometry of these interactions is such that each carboxylate group is sandwiched between two amino groups in a periodic arrangement of alternating carboxylate and amino groups.
Resumo:
The primary structure of collagen is characterized by the repeating tripeptide sequence (Gly-R2-R3)n. The results of theoretical studies, carried out using contact criteria to compute the stereochemically allowed orientations for various side chains at locations 2 and 3, are reported here. It is found that side chains with only γ-atoms, as in valine, serine and threonine, or with only one δ-methyl group, as in isoleucine, can occur equally well at locations 2 and 3, as is actually the case in collagen. Side chains with two Cδ-atoms, as in leucine and phenyl-alanine, can also be accommodated at both positions. However, if they occur as R3 their freedom of orientation is severely restricted in the presence of a proline residue as R2 in a neighbouring chain. If water molecules bound to the chains of the triple helix are assumed to be present, then location 3 is virtually impossible for leucine and phenylalanine residues. Location 2 is, however, unaffected, and their presence as R2 can help to shield the water molecules from disturbance by the solvent medium. This may be the reason for the preferential occurrence of Leu and Phe residues in location 2 in the collagen triplets, although the polypeptides (Gly-Pro-Leu)n and (Gly-Pro-Phe)n form collagen-like structures.
Resumo:
Plant seeds contain a large number of protease inhibitors of animal, fungal, and bacterial origin. One of the well-studied families of these inhibitors is the Bowman-Birk family(BBI). The BBIs from dicotyledonous seeds are 8K, double-headed proteins. In contrast, the 8K inhibitors from monocotyledonous seeds are single headed. Monocots also have a 16K, double-headed inhibitor. We have determined the primary structure of a Bowman-Birk inhibitor from a dicot, horsegram, by sequential edman analysis of the intact protein and peptides derived from enzymatic and chemical cleavage. The 76-residue-long inhibitor is very similar to that ofMacrotyloma axillare. An analysis of this inhibitor along with 26 other Bowman-Birk inhibitor domains (MW 8K) available in the SWISSPROT databank revealed that the proteins from monocots and dicots belong to related but distinct families. Inhibitors from monocots show larger variation in sequence. Sequence comparison shows that a crucial disulphide which connects the amino and carboxy termini of the active site loop is lost in monocots. The loss of a reactive site in monocots seems to be correlated to this. However, it appears that this disulphide is not absolutely essential for retention of inhibitory function. Our analysis suggests that gene duplication leading to a 16K inhibitor in monocots has occurred, probably after the divergence of monocots and dicots, and also after the loss of second reactive site in monocots.
Resumo:
Understanding the key factors that influence the interaction preferences of amino acids in the folding of proteins have remained a challenge. Here we present a knowledge-based approach for determining the effective interactions between amino acids based on amino acid type, their secondary structure, and the contact based environment that they find themselves in the native state structure as measured by their number of neighbors. We find that the optimal information is approximately encoded in a 60 x 60 matrix describing the 20 types of amino acids in three distinct secondary structures (helix, beta strand, and loop). We carry out a clustering scheme to understand the similarity between these interactions and to elucidate a nonredundant set. We demonstrate that the inferred energy parameters can be used for assessing the fit of a given sequence into a putative native state structure.
Resumo:
In attempts to convert an elongator tRNA to an initiator tRNA, we previously generated a mutant elongator methionine tRNA carrying an anticodon sequence change from CAU to CUA along with the two features important for activity of Escherichia coli initiator tRNA in initiation. This mutant tRNA (Mi:2 tRNA) was active in initiation in vivo but only when aminoacylated with methionine by overproduction of methionyl-tRNA synthetase. Here we show that the Mi:2 tRNA is normally aminoacylated in vivo with lysine and that the tRNA aminoacylated with lysine is a very poor substrate for formylation compared with the same tRNA aminoacylated with methionine. By introducing further changes at base pairs 4:69 and 5:68 in the acceptor stem of the Mi:2 tRNA to those found in the E. coli initiator tRNA, we show that change of the U4:A69 base pair to G4:C69 and overproduction of lysyl-tRNA synthetase and methionyl-tRNA transformylase results in partial formylation of the mutant tRNA and activity of the formyllysyl-tRNAs in initiation of protein synthesis. Thus, the G4:C69 base pair contributes toward formylation of the tRNA and protein synthesis in E. coli can be initiated with formyllysine. We also discuss the implications of these and other results on recognition of tRNAs by E. coli lysyl-tRNA synthetase and on competition in cells among aminoacyl-tRNA synthetases.
Resumo:
The transcription from rrn and a number of other promoters is regulated by initiating ribonucleotides (iNTPs) and guanosine tetra/penta phosphate (p)ppGpp], either by strengthening or by weakening of the RNA polymerase (RNAP)-promoter interactions during initiation. Studies in Escherichia coli revealed the importance of a sequence termed discriminator, located between -10 and the transcription start site of the responsive promoters in this mode of regulation. Instability of the open complex at these promoters is attributed to the lack of stabilizing interactions between the suboptimal discriminator and the 1.2 region of sigma 70 (Sig70) in RNAP holoenzyme. We demonstrate a different pattern of interaction between the promoters and sigma A (SigA) of Mycobacterium tuberculosis to execute similar regulation. Instead of cytosine and methionine, thymine at three nucleotides downstream to -10 element and leucine 232 in SigA are found to be essential for iNTPs and pppGpp mediated response at the rrn and gyr promoters of the organism. The specificity of the interaction is substantiated by mutational replacements, either in the discriminator or in SigA, which abolish the nucleotide mediated regulation in vitro or in vivo. Specific yet distinct bases and the amino acids appear to have co-evolved' to retain the discriminator-sigma 1.2 region regulatory switch operated by iNTPs/pppGpp during the transcription initiation in different bacteria.
Resumo:
Jacalin and artocarpin, the two lectins from jackfruit (Artocarpus integrifolia) seeds, have different physicochemical properties and carbohydrate-binding specificities. However, comparison of the partial amino-acid sequence of artocarpin with the known sequence of jacalin indicates close to 50% sequence identity. Artocarpin crystallizes in two forms, both monoclinic P2(1), with one and two tetramic molecules, respectively, in the asymmetric units of form I (a = 69.9, b = 73.7, c = 60.6 Angstrom and beta = 95.1 degrees) and form II (a = 87.6, b = 72.2, c = 92.6 Angstrom and beta = 101.1 degrees). Both the crystal structures have been solved by the molecular replacement method using the known structure of jacalin as the search model and ope of them partially refined, confirming that the two lectins are indeed homologous.
Resumo:
The DL- and L-arginine complexes of oxalic acid are made up of zwitterionic positively charged amino acid molecules and semi-oxalate ions. The dissimilar molecules aggregate into separate alternating layers in the former. The basic unit in the arginine layer is a centrosymmetric dimer, while the semi-oxalate ions form hydrogen-bonded strings in their layer. In the L-arginine complex each semi-oxalate ion is surrounded by arginine molecules and the complex can be described as an inclusion compound. The oxalic acid complexes of basic amino acids exhibit a variety of ionization states and stoichiometry. They illustrate the effect of aggregation and chirality on ionization state and stoichiometry, and that of molecular properties on aggregation. The semi-oxalate/oxalate ions tend to be planar, but large departures from planarity are possible. The amino acid aggregation in the different oxalic acid complexes do not resemble one another significantly, but the aggregation of a particular amino acid in its oxalic acid complex tends to have similarities with its aggregation in other structures. Also, semi-oxalate ions aggregate into similar strings in four of the six oxalic acid complexes. Thus, the intrinsic aggregation propensities of individual molecules tend to be retained in the complexes.