917 resultados para Protein structure prediction
Resumo:
Inteins are protein-splicing elements, most of which contain conserved sequence blocks that define a family of homing endonucleases. Like group I introns that encode such endonucleases, inteins are mobile genetic elements. Recent crystallography and computer modeling studies suggest that inteins consist of two structural domains that correspond to the endonuclease and the protein-splicing elements. To determine whether the bipartite structure of inteins is mirrored by the functional independence of the protein-splicing domain, the entire endonuclease component was deleted from the Mycobacterium tuberculosis recA intein. Guided by computer modeling studies, and taking advantage of genetic systems designed to monitor intein function, the 440-aa Mtu recA intein was reduced to a functional mini-intein of 137 aa. The accuracy of splicing of several mini-inteins was verified. This work not only substantiates structure predictions for intein function but also supports the hypothesis that, like group I introns, mobile inteins arose by an endonuclease gene invading a sequence encoding a small, functional splicing element.
Resumo:
We examine the occurrence of the ≈300 known protein folds in different groups of organisms. To do this, we characterize a large fraction of the currently known protein sequences (≈140,000) in structural terms, by matching them to known structures via sequence comparison (or by secondary-structure class prediction for those without structural homologues). Overall, we find that an appreciable fraction of the known folds are present in each of the major groups of organisms (e.g., bacteria and eukaryotes share 156 of 275 folds), and most of the common folds are associated with many families of nonhomologous sequences (i.e., >10 sequence families for each common fold). However, different groups of organisms have characteristically distinct distributions of folds. So, for instance, some of the most common folds in vertebrates, such as globins or zinc fingers, are rare or absent in bacteria. Many of these differences in fold usage are biologically reasonable, such as the folds of metabolic enzymes being common in bacteria and those associated with extracellular transport and communication being common in animals. They also have important implications for database-based methods for fold recognition, suggesting that an unknown sequence from a plant is more likely to have a certain fold (e.g., a TIM barrel) than an unknown sequence from an animal.
Resumo:
The structural basis of species specificity of transmissible spongiform encephalopathies, such as bovine spongiform encephalopathy or “mad cow disease” and Creutzfeldt–Jakob disease in humans, has been investigated using the refined NMR structure of the C-terminal domain of the mouse prion protein with residues 121–231. A database search for mammalian prion proteins yielded 23 different sequences for the fragment 124–226, which display a high degree of sequence identity and show relevant amino acid substitutions in only 18 of the 103 positions. Except for a unique isolated negative surface charge in the bovine protein, the amino acid differences are clustered in three distinct regions of the three-dimensional structure of the cellular form of the prion protein. Two of these regions represent potential species-dependent surface recognition sites for protein–protein interactions, which have independently been implicated from in vitro and in vivo studies of prion protein transformation. The third region consists of a cluster of interior hydrophobic side chains that may affect prion protein transformation at later stages, after initial conformational changes in the cellular protein.
Resumo:
The three-dimensional structure of the N-terminal domain (residues 18–112) of α2-macroglobulin receptor-associated protein (RAP) has been determined by NMR spectroscopy. The structure consists of three helices composed of residues 23–34, 39–65, and 73–88. The three helices are arranged in an up-down-up antiparallel topology. The C-terminal 20 residues were shown not to be in a well defined conformation. A structural model for the binding of RAP to the family of low-density lipoprotein receptors is proposed. It defines a role in binding for both the unordered C terminus and the structural scaffold of the core structure. Pathogenic epitopes for the rat disease Heymann nephritis, an experimental model of human membranous glomerulonephritis, have been identified in RAP and in the large endocytic receptor gp330/megalin. Here we provide the three-dimensional structure of the pathogenic epitope in RAP. The amino acid residues known to form the epitope are in a helix–loop–helix conformation, and from the structure it is possible to rationalize the published results obtained from studies of fragments of the N-terminal domain.
Resumo:
The yabJ gene in Bacillus subtilis is required for adenine-mediated repression of purine biosynthetic genes in vivo and codes for an acid-soluble, 14-kDa protein. The molecular mechanism of YabJ is unknown. YabJ is a member of a large, widely distributed family of proteins of unknown biochemical function. The 1.7-Å crystal structure of YabJ reveals a trimeric organization with extensive buried hydrophobic surface and an internal water-filled cavity. The most important finding in the structure is a deep, narrow cleft between subunits lined with nine side chains that are invariant among the 25 most similar homologs. This conserved site is proposed to be a binding or catalytic site for a ligand or substrate that is common to YabJ and other members of the YER057c/YjgF/UK114 family of proteins.
Resumo:
Arabinogalactan proteins (AGPs) are proteoglycans of higher plants, which are implicated in growth and development. We recently have shown that two AGPs, NaAGP1 (from Nicotiana alata styles) and PcAGP1 (from Pyrus communis cell suspension culture), are modified by the addition of a glycosylphosphatidylinositol (GPI) anchor. However, paradoxically, both AGPs were buffer soluble rather than membrane associated. We now show that pear suspension cultured cells also contain membrane-bound GPI-anchored AGPs. This GPI anchor has the minimal core oligosaccharide structure, d-Manα(1–2)-d-Manα(1–6)-d-Manα(1–4)-d-GlcN-inositol, which is consistent with those found in animals, protozoa, and yeast, but with a partial β(1–4)-galactosyl substitution of the 6-linked Man residue, and has a phosphoceramide lipid composed primarily of phytosphingosine and tetracosanoic acid. The secreted form of PcAGP1 contains a truncated GPI lacking the phosphoceramide moiety, suggesting that it is released from the membrane by the action of a phospholipase D. The implications of these findings are discussed in relation to the potential mechanisms by which GPI-anchored AGPs may be involved in signal transduction pathways.
Resumo:
A physical theory of protein secondary structure is proposed and tested by performing exceedingly simple Monte Carlo simulations. In essence, secondary structure propensities are predominantly a consequence of two competing local effects, one favoring hydrogen bond formation in helices and turns, the other opposing the attendant reduction in sidechain conformational entropy on helix and turn formation. These sequence specific biases are densely dispersed throughout the unfolded polypeptide chain, where they serve to preorganize the folding process and largely, but imperfectly, anticipate the native secondary structure.
Resumo:
Bacteriophage λ encodes a number of genes involved in the recombinational repair of DNA double-strand breaks. The product of one of these genes, rap, has been purified. Truncated Rap proteins that copurify with the full-length form are derived, at least in part, from a ρ-dependent transcription terminator located within its coding sequence. Full-length and certain truncated Rap polypeptides bind preferentially to branched DNA substrates, including synthetic Holliday junctions and D-loops. In the presence of manganese ions, Rap acts as an endonuclease that cleaves at the branch point of Holliday and D-loop substrates. It shows no obvious sequence preference or symmetry of cleavage on a Holliday junction. The biochemical analysis of Rap gives an insight into how recombinants could be generated by the nicking of a D-loop without the formation of a classical Holliday junction.
Resumo:
Nonribosomal nucleolar protein gar2 is required for 18S rRNA and 40S ribosomal subunit production in Schizosaccharomyces pombe. We have investigated the consequences of the absence of each structural domain of gar2 on cell growth, 18S rRNA production, and nucleolar structure. Deletion of gar2 RNA-binding domains (RBDs) causes stronger inhibition of growth and 18S rRNA accumulation than the absence of the whole protein, suggesting that other factors may be titrated by its remaining N-terminal basic/acidic serine-rich domain. These drastic functional defects correlate with striking nucleolar hypertrophy. Point mutations in the conserved RNP1 motifs of gar2 RBDs supposed to inhibit RNA–protein interactions are sufficient to induce severe nucleolar modifications but only in the presence of the N-terminal domain of the protein. Gar2 and its mutants also distribute differently in glycerol gradients: gar2 lacking its RBDs is found either free or assembled into significantly larger complexes than the wild-type protein. We propose that gar2 helps the assembly on rRNA of factors necessary for 40S subunit synthesis by providing a physical link between them. These factors may be recruited by the N-terminal domain of gar2 and may not be released if interaction of gar2 with rRNA is impaired.
Resumo:
The endosperm of a sorghum mutant cultivar, with high in vitro uncooked and cooked protein digestibilities, was examined by transmission electron microscopy and α-, β-, and γ-kafirins (storage proteins) were localized within its protein bodies. Transmission electron microscopy micrographs revealed that these protein bodies had a unique microstructure related to high protein digestibility. They were irregular in shape and had numerous invaginations, often reaching to the central area of the protein body. Protein bodies from normal cultivars, such as P721N studied here, with much lower uncooked and cooked digestibilities are spherical and contain no invaginations. Immunocytochemistry results showed that the relative location of α- and β-kafirins within the protein bodies of the highly digestible genotype were similar to the normal cultivar, P721N. γ-Kafirin, however, was concentrated in dark-staining regions at the base of the folds instead of at the protein body periphery, as is typical of normal cultivars. The resulting easy accessibility of digestive enzymes to α-kafirin, the major storage protein, in addition to the increased surface area of the protein bodies of the highly digestible cultivar appear to account for its high in vitro protein digestibility.
Resumo:
We have searched for a minimal interaction motif in τ protein that supports the aggregation into Alzheimer-like paired helical filaments. Digestion of the repeat domain with different proteases yields a GluC-induced fragment comprising 43 residues (termed PHF43), which represents the third repeat of τ plus some flanking residues. This fragment self assembles readily into thin filaments without a paired helical appearance, but these filaments are highly competent to nucleate bona fide PHFs from full-length τ. Probing the interactions of PHF43 with overlapping peptides derived from the full τ sequence yields a minimal hexapeptide interaction motif of 306VQIVYK311 at the beginning of the third internal repeat. This motif coincides with the highest predicted β-structure potential in τ. CD and Fourier transform infrared spectroscopy shows that PHF43 acquires pronounced β structure in conditions of self assembly. Point mutations in the hexapeptide region by proline-scanning mutagenesis prevent the aggregation. The data indicate that PHF assembly is initiated by a short fragment containing the minimal interaction motif forming a local β structure embedded in a largely random-coil protein.
Resumo:
Staphylococcus aureus produces a virulence factor, protein A (SpA), that contains five homologous Ig-binding domains. The interactions of SpA with the Fab region of membrane-anchored Igs can stimulate a large fraction of B cells, contributing to lymphocyte clonal selection. To understand the molecular basis for this activity, we have solved the crystal structure of the complex between domain D of SpA and the Fab fragment of a human IgM antibody to 2.7-Å resolution. In the complex, helices II and III of domain D interact with the variable region of the Fab heavy chain (VH) through framework residues, without the involvement of the hypervariable regions implicated in antigen recognition. The contact residues are highly conserved in human VH3 antibodies but not in other families. The contact residues from domain D also are conserved among all SpA Ig-binding domains, suggesting that each could bind in a similar manner. Features of this interaction parallel those reported for staphylococcal enterotoxins that are superantigens for many T cells. The structural homology between Ig VH regions and the T-cell receptor Vβ regions facilitates their comparison, and both types of interactions involve lymphocyte receptor surface remote from the antigen binding site. However, T-cell superantigens reportedly interact through hydrogen bonds with T-cell receptor Vβ backbone atoms in a primary sequence-independent manner, whereas SpA relies on a sequence-restricted conformational binding with residue side chains, suggesting that this common bacterial pathogen has adopted distinct molecular recognition strategies for affecting large sets of B and T lymphocytes.
Resumo:
I attempt to reconcile apparently conflicting factors and mechanisms that have been proposed to determine the rate constant for two-state folding of small proteins, on the basis of general features of the structures of transition states. Φ-Value analysis implies a transition state for folding that resembles an expanded and distorted native structure, which is built around an extended nucleus. The nucleus is composed predominantly of elements of partly or well-formed native secondary structure that are stabilized by local and long-range tertiary interactions. These long-range interactions give rise to connecting loops, frequently containing the native loops that are poorly structured. I derive an equation that relates differences in the contact order of a protein to changes in the length of linking loops, which, in turn, is directly related to the unfavorable free energy of the loops in the transition state. Kinetic data on loop extension mutants of CI2 and α-spectrin SH3 domain fit the equation qualitatively. The rate of folding depends primarily on the interactions that directly stabilize the nucleus, especially those in native-like secondary structure and those resulting from the entropy loss from the connecting loops, which vary with contact order. This partitioning of energy accounts for the success of some algorithms that predict folding rates, because they use these principles either explicitly or implicitly. The extended nucleus model thus unifies the observations of rate depending on both stability and topology.
Resumo:
The NMR structures of the recombinant human prion protein, hPrP(23–230), and two C-terminal fragments, hPrP(90–230) and hPrP(121–230), include a globular domain extending from residues 125–228, for which a detailed structure was obtained, and an N-terminal flexibly disordered “tail.” The globular domain contains three α-helices comprising the residues 144–154, 173–194, and 200–228 and a short anti-parallel β-sheet comprising the residues 128–131 and 161–164. Within the globular domain, three polypeptide segments show increased structural disorder: i.e., a loop of residues 167–171, the residues 187–194 at the end of helix 2, and the residues 219–228 in the C-terminal part of helix 3. The local conformational state of the polypeptide segments 187–193 in helix 2 and 219–226 in helix 3 is measurably influenced by the length of the N-terminal tail, with the helical states being most highly populated in hPrP(23–230). When compared with the previously reported structures of the murine and Syrian hamster prion proteins, the length of helix 3 coincides more closely with that in the Syrian hamster protein whereas the disordered loop 167–171 is shared with murine PrP. These species variations of local structure are in a surface area of the cellular form of PrP that has previously been implicated in intermolecular interactions related both to the species barrier for infectious transmission of prion disease and to immune reactions.
Resumo:
The conformational space annealing (CSA) method for global optimization has been applied to the 10-55 fragment of the B-domain of staphylococcal protein A (protein A) and to a 75-residue protein, apo calbindin D9K (PDB ID code 1CLB), by using the UNRES off-lattice united-residue force field. Although the potential was not calibrated with these two proteins, the native-like structures were found among the low-energy conformations, without the use of threading or secondary-structure predictions. This is because the CSA method can find many distinct families of low-energy conformations. Starting from random conformations, the CSA method found that there are two families of low-energy conformations for each of the two proteins, the native-like fold and its mirror image. The CSA method converged to the same low-energy folds in all cases studied, as opposed to other optimization methods. It appears that the CSA method with the UNRES force field, which is based on the thermodynamic hypothesis, can be used in prediction of protein structures in real time.