113 resultados para N-terminal amino acid sequence


Relevância:

100.00% 100.00%

Publicador:

Resumo:

The complete amino acid sequence of winged bean basic agglutinin (WBA I) was obtained by a combination of manual and gas-phase sequencing methods. Peptide fragments for sequence analyses were obtained by enzymatic cleavages using trypsin and Staphylococcus aureus V8 endoproteinase and by chemical cleavages using iodosobenzoic acid, hydroxylamine, and formic acid. COOH-terminal sequence analysis of WBA I and other peptides was performed using carboxypeptidase Y. The primary structure of WBA I was homologous to those of other legume lectins and more so to Erythrina corallodendron. Interestingly, the sequence shows remarkable identities in the regions involved in the association of the two monomers of E. corallodendron lectin. Other conserved regions are the double metal-binding site and residues contributing to the formation of the hydrophobic cavity and the carbohydrate-binding site. Chemical modification studies both in the presence and absence of N-acetylgalactosamine together with sequence analyses of tryptophan-containing tryptic peptides demonstrate that tryptophan 133 is involved in the binding of carbohydrate ligands by the lectin. The location of tryptophan 133 at the active center of WBA I for the first time subserves to explain a role for one of the most conserved residues in legume lectins.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The conformation of amino acid side chains as observed in well-determined structures of globular proteins has earlier been extensively investigated. In contrast, the structural features of the polypeptide backbone that result from the occurrence of specific amino acids along the polypeptide have not been analysed. In this article, we present the statistically significant features in the backbone geometry that appear to be a consequence of the occurrence of rotamers of different amino acid side chains by analysing 102 well-refined structures that form a random collection of proteins. It is found that the persistence of helical segments around each residue is influenced by the residue type. Several residues exert asymmetrical influence between the carboxyl and amino terminal polypeptide segments. The degree to which secondary structures depart from an average geometry also appears to depend on residue type. These departures are correlated to the corresponding Chou and Fasman parameters of amino acid residues. The frequency distribution of the side chain rotamers is influenced by polypeptide secondary structure. In turn, the rotamer conformation of side chain affects the extension of the secondary structure of the backbone. The strongest correlation is found between the occurrence of g+ conformation and helix propagation on the carboxyl side of many residues.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

NSP3, an acidic nonstructural protein, encoded by gene 7 has been implicated as the key player in the assembly of the 11 viral plus-strand RNAs into the early replication intermediates during rotavirus morphogenesis. To date, the sequence or NSP3 from only three animal rotaviruses (SA11, SA114F, and bovine UK) has been determined and that from a human strain has not been reported. To determine the genetic diversity among gene 7 alleles from group A rotaviruses, the nucleotide sequence of the NSP3 gene from 13 strains belonging to nine different G serotypes, from both humans and animals, has been determined. Based on the amino acid sequence identity as well as phylogenetic analysis, NSP3 from group A rotaviruses falls into three evolutionarily related groups, i.e., the SA11 group, the Wa group, and the S2 group. The SA 11/SA114F gene appears to have a distant ancestral origin from that of the others and codes for a polypeptide of 315 amino acids (aa) in length. NSP3 from all other group A rotaviruses is only 313 aa in length because of a 2-amino-acid deletion near the carboxy-terminus, While the SA114F gene has the longest 3' untranslated region (UTR) of 132 nucleotides, that from other strains suffered deletions of varying lengths at two positions downstream of the translational termination codon. In spite of the divergence of the nucleotide (nt) sequence in the protein coding region, a stretch of about 80 nt in the 3' UTR is highly conserved in the NSP3 gene from all the strains. This conserved sequence in the 3' UTR might play an important role in the regulation of expression of the NSP3 gene. (C) 1995 Academic Press, Inc.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Using a dataset of 1164 crystal structures of largely non-homologous proteins defined at a resolution of 1.5 angstrom or better, we have investigated the (phi,psi) preferences of 20 residue types by considering the residues which occur in loops. Propensities of residue types to occur in the loops with (phi,psi) values in the aa region of the Ramachandran map has a poor correlation coefficient of 0.48 to the Chou-Fasman propensities of the residue types to occur in the a-helical segments. However the correlation coefficient between propensities of residues in loops to adopt beta conformations and those in beta-sheet is much higher (0.95). These observations suggest that a-helix formation is well influenced by the local amino acid sequence while intrinsic preference of residue types for beta-sheet plays a major role in the formation of beta-sheet. The main chain polar groups of residues in loops, that can affect the (phi,psi) values, can be involved in intra-molecular hydrogen bonding. Therefore we investigated further by considering subset of residues in loops with low (0 to 2) number of intra-molecular hydrogen bonds per residue involving main chain polar atoms. For this subset, the correlation coefficients between propensities for alpha-helix and alpha(R) region and between beta-sheet and beta-region are 0.26 and 0.64 respectively. This reiterates higher intrinsic tendency of beta-region favouring residues to adopt beta-sheet than alpha(R) region favouring residues to adopt alpha-helical structure.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as Protein Blocks (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The mouse and human malarial parasites, Plasmodium berghei and Plasmodium falciparum, respectively, synthesize heme de novo following the standard pathway observed in animals despite the availability of large amounts of heme, derived from red cell hemoglobin, which is stored as hemozoin pigment, The enzymes, delta-aminolevulinate dehydrase (ALAD), coproporphyrinogen oxidase, and ferrochelatase are present at strikingly high levels in the P, berghei infected mouse red cell in vivo, The isolated parasite has low levels of ALAD and the data clearly indicate it to be of red cell origin. The purified enzyme preparations from the uninfected red cell and the parasite are identical in kinetic properties, subunit molecular weight, cross-reaction with antibodies to the human enzyme, and N-terminal amino acid sequence. Immunogold electron microscopy of the infected culture indicates that the enzyme is present inside the parasite and, therefore, is not a contaminant, The parasite derives functional ALAD from the host and the enzyme binds specifically to isolated parasite membrane in vitro, suggestive of the involvement of a receptor in its translocation into the parasite, While, ALAD, coproporphyrinogen oxidase, and ferrochelatase from the parasite and the uninfected red cell supernatant have identical subunit molecular weights on SDS-polyacrylamide gel electrophoresis and show immunological cross-reaction with antibodies to the human enzymes, as revealed by Western analysis, the first enzyme of the pathway, namely, delta-aminolevulinate synthase (ALAS) in the parasite, unlike that of the red cell host, does not cross-react with antibodies to the human enzyme, However, ALAS enzyme activity in the parasite is higher than that of the infected red cell supernatant. We therefore conclude that the parasite, while making its own ALAS, imports ALAD and perhaps most of the other enzymes of the pathway from the host to synthesize heme de novo, and this would enable it to segregate this heme from the heme derived from red cell hemoglobin degradation, ALAS of the parasite and the receptor(s) involved in the translocation of the host enzymes into the parasite would be unique drug targets.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Sequence specific resonance assignment constitutes an important step towards high-resolution structure determination of proteins by NMR and is aided by selective identification and assignment of amino acid types. The traditional approach to selective labeling yields only the chemical shifts of the particular amino acid being selected and does not help in establishing a link between adjacent residues along the polypeptide chain, which is important for sequential assignments. An alternative approach is the method of amino acid selective `unlabeling' or reverse labeling, which involves selective unlabeling of specific amino acid types against a uniformly C-13/N-15 labeled background. Based on this method, we present a novel approach for sequential assignments in proteins. The method involves a new NMR experiment named, {(CO)-C-12 (i) -N-15 (i+1)}-filtered HSQC, which aids in linking the H-1(N)/N-15 resonances of the selectively unlabeled residue, i, and its C-terminal neighbor, i + 1, in HN-detected double and triple resonance spectra. This leads to the assignment of a tri-peptide segment from the knowledge of the amino acid types of residues: i - 1, i and i + 1, thereby speeding up the sequential assignment process. The method has the advantage of being relatively inexpensive, applicable to H-2 labeled protein and can be coupled with cell-free synthesis and/or automated assignment approaches. A detailed survey involving unlabeling of different amino acid types individually or in pairs reveals that the proposed approach is also robust to misincorporation of N-14 at undesired sites. Taken together, this study represents the first application of selective unlabeling for sequence specific resonance assignments and opens up new avenues to using this methodology in protein structural studies.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The 3prime terminal 1255nt sequence of Physalis mottle virus (PhMV) genomic RNA has been determined from a set of overlapping cDNA clones. The open reading frame (ORF) at the 3prime terminus corresponds to the amino acid sequence of the coat protein (CP) determined earlier except for the absence of the dipeptide, Lys-Leu, at position 110-111. In addition, the sequence upstream of the CP gene contains the message coding for 178 amino acid residues of the C-terminus of the putative replicase protein (RP). The sequence downstream of the CP gene contains an untranslated region whose terminal 80 nucleotides can be folded into a characteristic tRNA-like structure. A phylogenetic tree constructed after aligning separately the sequence of the CP, the replicase protein (RP) and the tRNA-like structure determined in this study with the corresponding sequences of other tymoviruses shows that PhMV wrongly named belladonna mottle virus [BDMV(I)] is a separate tymovirus and not another strain of BDMV(E) as originally envisaged. The phylogenetic tree in all the three cases is identical showing that any subset of genomic sequence of sufficient length can be used for establishing evolutionary relationships among tymoviruses.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The notion of optimization is inherent in protein design. A long linear chain of twenty types of amino acid residues are known to fold to a 3-D conformation that minimizes the combined inter-residue energy interactions. There are two distinct protein design problems, viz. predicting the folded structure from a given sequence of amino acid monomers (folding problem) and determining a sequence for a given folded structure (inverse folding problem). These two problems have much similarity to engineering structural analysis and structural optimization problems respectively. In the folding problem, a protein chain with a given sequence folds to a conformation, called a native state, which has a unique global minimum energy value when compared to all other unfolded conformations. This involves a search in the conformation space. This is somewhat akin to the principle of minimum potential energy that determines the deformed static equilibrium configuration of an elastic structure of given topology, shape, and size that is subjected to certain boundary conditions. In the inverse-folding problem, one has to design a sequence with some objectives (having a specific feature of the folded structure, docking with another protein, etc.) and constraints (sequence being fixed in some portion, a particular composition of amino acid types, etc.) while obtaining a sequence that would fold to the desired conformation satisfying the criteria of folding. This requires a search in the sequence space. This is similar to structural optimization in the design-variable space wherein a certain feature of structural response is optimized subject to some constraints while satisfying the governing static or dynamic equilibrium equations. Based on this similarity, in this work we apply the topology optimization methods to protein design, discuss modeling issues and present some initial results.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Enzymes belonging to the M1 family play important cellular roles and the key amino acids (aa) in the catalytic domain are conserved. However, C-terminal domain aa are highly variable and demonstrate distinct differences in organization. To address a functional role for the C-terminal domain, progressive deletions were generated in Tricorn interacting factor F2 from Thermoplasma acidophilum (F2) and Peptidase N from Escherichia coli (PepN). Catalytic activity was partially reduced in PepN lacking 4 C-terminal residues (PepNΔC4) whereas it was greatly reduced in F2 lacking 10 C-terminal residues (F2ΔC10) or PepN lacking eleven C-terminal residues (PepNΔC11). Notably, expression of PepNΔC4, but not PepNΔC11, in E. coliΔpepN increased its ability to resist nutritional and high temperature stress, demonstrating physiological significance. Purified C-terminal deleted proteins demonstrated greater sensitivity to trypsin and bound stronger to 8-amino 1-napthalene sulphonic acid (ANS), revealing greater numbers of surface exposed hydrophobic aa. Also, F2 or PepN containing large aa deletions in the C-termini, but not smaller deletions, were present in high amounts in the insoluble fraction of cell extracts probably due to reduced protein solubility. Modeling studies, using the crystal structure of E. coli PepN, demonstrated increase in hydrophobic surface area and change in accessibility of several aa from buried to exposed upon deletion of C-terminal aa. Together, these studies revealed that non-conserved distal C-terminal aa repress the surface exposure of apolar aa, enhance protein solubility, and catalytic activity in two soluble and distinct members of the M1 family.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

L-Lysine d-pantothenate, a 1:1 amino acid-vitamin complex, crystallizes in the monoclinic space group P21 with Image Full-size image (1K) .The structure has been solved by direct methods and refined to an R value of 0.053 for 1868 observed reflections. The zwitterionic positively charged lysine molecules in the structure assume the sterically most favourable conformation with an all-trans side chain trans to the α-carboxylate group. The pantothenate anion has a somewhat folded conformation stabilised by an intramolecular bifurcated hydrogen bond. The unlike molecules aggregate into separate alternating layers. The molecules in the lysine layers form a head-to-tail sequence parallel to the a-axis. The interactions which hold the adjacent layers together include those between the side chain amino group of lysine and the carboxylate group in the pantothenate anion. The geometry of these interactions is such that each carboxylate group is sandwiched between two amino groups in a periodic arrangement of alternating carboxylate and amino groups.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The primary structure of collagen is characterized by the repeating tripeptide sequence (Gly-R2-R3)n. The results of theoretical studies, carried out using contact criteria to compute the stereochemically allowed orientations for various side chains at locations 2 and 3, are reported here. It is found that side chains with only γ-atoms, as in valine, serine and threonine, or with only one δ-methyl group, as in isoleucine, can occur equally well at locations 2 and 3, as is actually the case in collagen. Side chains with two Cδ-atoms, as in leucine and phenyl-alanine, can also be accommodated at both positions. However, if they occur as R3 their freedom of orientation is severely restricted in the presence of a proline residue as R2 in a neighbouring chain. If water molecules bound to the chains of the triple helix are assumed to be present, then location 3 is virtually impossible for leucine and phenylalanine residues. Location 2 is, however, unaffected, and their presence as R2 can help to shield the water molecules from disturbance by the solvent medium. This may be the reason for the preferential occurrence of Leu and Phe residues in location 2 in the collagen triplets, although the polypeptides (Gly-Pro-Leu)n and (Gly-Pro-Phe)n form collagen-like structures.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Plant seeds contain a large number of protease inhibitors of animal, fungal, and bacterial origin. One of the well-studied families of these inhibitors is the Bowman-Birk family(BBI). The BBIs from dicotyledonous seeds are 8K, double-headed proteins. In contrast, the 8K inhibitors from monocotyledonous seeds are single headed. Monocots also have a 16K, double-headed inhibitor. We have determined the primary structure of a Bowman-Birk inhibitor from a dicot, horsegram, by sequential edman analysis of the intact protein and peptides derived from enzymatic and chemical cleavage. The 76-residue-long inhibitor is very similar to that ofMacrotyloma axillare. An analysis of this inhibitor along with 26 other Bowman-Birk inhibitor domains (MW 8K) available in the SWISSPROT databank revealed that the proteins from monocots and dicots belong to related but distinct families. Inhibitors from monocots show larger variation in sequence. Sequence comparison shows that a crucial disulphide which connects the amino and carboxy termini of the active site loop is lost in monocots. The loss of a reactive site in monocots seems to be correlated to this. However, it appears that this disulphide is not absolutely essential for retention of inhibitory function. Our analysis suggests that gene duplication leading to a 16K inhibitor in monocots has occurred, probably after the divergence of monocots and dicots, and also after the loss of second reactive site in monocots.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Understanding the key factors that influence the interaction preferences of amino acids in the folding of proteins have remained a challenge. Here we present a knowledge-based approach for determining the effective interactions between amino acids based on amino acid type, their secondary structure, and the contact based environment that they find themselves in the native state structure as measured by their number of neighbors. We find that the optimal information is approximately encoded in a 60 x 60 matrix describing the 20 types of amino acids in three distinct secondary structures (helix, beta strand, and loop). We carry out a clustering scheme to understand the similarity between these interactions and to elucidate a nonredundant set. We demonstrate that the inferred energy parameters can be used for assessing the fit of a given sequence into a putative native state structure.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The 3' terminal 1255 nt sequence of Physalis mottle virus (PhMV) genomic RNA has been determined from a set of overlapping cDNA clones. The open reading frame (ORF) at the 3' terminus corresponds to the amino acid sequence of the coat protein (CP) determined earlier except for the absence of the dipeptide, Lys-Leu, at position 110-111. In addiition, the sequence upstream of the CP gene contains the message coding for 178 amino acid residues of the C-terminus of the putative replicase protein (RP). The sequence downstream of the CP gene contains an untranslated region whose terminal 80 nucleotides can be folded into a characteristic tRNA-like structure. A phylogenetic tree constructed after aligning separately the sequence of the CP, the replicase protein (RP) and the tRNA-like structure determined in this study with the corresponding sequences of other tymoviruses shows that PhMV wrongly named belladonna mottle virus [BDMV(I)] is a separate tymovirus and not another strain of BDMV(E) as originally envisaged. The phylogenetic tree in all the three cases is identical showing that any subset of genomic sequence of sufficient length can be used for establishing evolutionary relationships among tymoviruses.