972 resultados para protein folding
Resumo:
Microsecond long Molecular Dynamics (MD) trajectories of biomolecular processes are now possible due to advances in computer technology. Soon, trajectories long enough to probe dynamics over many milliseconds will become available. Since these timescales match the physiological timescales over which many small proteins fold, all atom MD simulations of protein folding are now becoming popular. To distill features of such large folding trajectories, we must develop methods that can both compress trajectory data to enable visualization, and that can yield themselves to further analysis, such as the finding of collective coordinates and reduction of the dynamics. Conventionally, clustering has been the most popular MD trajectory analysis technique, followed by principal component analysis (PCA). Simple clustering used in MD trajectory analysis suffers from various serious drawbacks, namely, (i) it is not data driven, (ii) it is unstable to noise and change in cutoff parameters, and (iii) since it does not take into account interrelationships amongst data points, the separation of data into clusters can often be artificial. Usually, partitions generated by clustering techniques are validated visually, but such validation is not possible for MD trajectories of protein folding, as the underlying structural transitions are not well understood. Rigorous cluster validation techniques may be adapted, but it is more crucial to reduce the dimensions in which MD trajectories reside, while still preserving their salient features. PCA has often been used for dimension reduction and while it is computationally inexpensive, being a linear method, it does not achieve good data compression. In this thesis, I propose a different method, a nonmetric multidimensional scaling (nMDS) technique, which achieves superior data compression by virtue of being nonlinear, and also provides a clear insight into the structural processes underlying MD trajectories. I illustrate the capabilities of nMDS by analyzing three complete villin headpiece folding and six norleucine mutant (NLE) folding trajectories simulated by Freddolino and Schulten [1]. Using these trajectories, I make comparisons between nMDS, PCA and clustering to demonstrate the superiority of nMDS. The three villin headpiece trajectories showed great structural heterogeneity. Apart from a few trivial features like early formation of secondary structure, no commonalities between trajectories were found. There were no units of residues or atoms found moving in concert across the trajectories. A flipping transition, corresponding to the flipping of helix 1 relative to the plane formed by helices 2 and 3 was observed towards the end of the folding process in all trajectories, when nearly all native contacts had been formed. However, the transition occurred through a different series of steps in all trajectories, indicating that it may not be a common transition in villin folding. The trajectories showed competition between local structure formation/hydrophobic collapse and global structure formation in all trajectories. Our analysis on the NLE trajectories confirms the notion that a tight hydrophobic core inhibits correct 3-D rearrangement. Only one of the six NLE trajectories folded, and it showed no flipping transition. All the other trajectories get trapped in hydrophobically collapsed states. The NLE residues were found to be buried deeply into the core, compared to the corresponding lysines in the villin headpiece, thereby making the core tighter and harder to undo for 3-D rearrangement. Our results suggest that the NLE may not be a fast folder as experiments suggest. The tightness of the hydrophobic core may be a very important factor in the folding of larger proteins. It is likely that chaperones like GroEL act to undo the tight hydrophobic core of proteins, after most secondary structure elements have been formed, so that global rearrangement is easier. I conclude by presenting facts about chaperone-protein complexes and propose further directions for the study of protein folding.
Resumo:
The survival and descent of cells is universally dependent on maintaining their proteins in a properly folded condition. It is widely accepted that the information for the folding of the nascent polypeptide chain into a native protein is encrypted in the amino acid sequence, and the Nobel Laureate Christian Anfinsen was the first to demonstrate that a protein could spontaneously refold after complete unfolding. However, it became clear that the observed folding rates for many proteins were much slower than rates estimated in vivo. This led to the recognition of required protein-protein interactions that promote proper folding. A unique group of proteins, the molecular chaperones, are responsible for maintaining protein homeostasis during normal growth as well as stress conditions. Chaperonins (CPNs) are ubiquitous and essential chaperones. They form ATP-dependent, hollow complexes that encapsulate polypeptides in two back-to-back stacked multisubunit rings, facilitating protein folding through highly cooperative allosteric articulation. CPNs are usually classified into Group I and Group II. Here, I report the characterization of a novel CPN belonging to a third Group, recently discovered in bacteria. Group III CPNs have close phylogenetic association to the Group II CPNs found in Archaea and Eukarya, and may be a relic of the Last Common Ancestor of the CPN family. The gene encoding the Group III CPN from Carboxydothermus hydrogenoformans and Candidatus Desulforudis audaxviator was cloned in E. coli and overexpressed in order to both characterize the protein and to demonstrate its ability to function as an ATPase chaperone. The opening and closing cycle of the Chy chaperonin was examined via site-directed mutations affecting the ATP binding site at R155. To relate the mutational analysis to the structure of the CPN, the crystal structure of both the AMP-PNP (an ATP analogue) and ADP bound forms were obtained in collaboration with Sun-Shin Cha in Seoul, South Korea. The ADP and ATP binding site substitutions resulted in frozen forms of the structures in open and closed conformations. From this, mutants were designed to validate hypotheses regarding key ATP interacting sites as well as important stabilizing interactions, and to observe the physical properties of the resulting complexes by calorimetry.
Resumo:
Background: Stabilization strategies adopted by proteins under extreme conditions are very complex and involve various kinds of interactions. Recent studies have shown that a large proportion of proteins have their N- and C-terminal elements in close contact and suggested they play a role in protein folding and stability. However, the biological significance of this contact remains elusive. Methodology: In the present study, we investigate the role of N- and C-terminal residue interaction using a family 10 xylanase (BSX) with a TIM-barrel structure that shows stability under high temperature,alkali pH, and protease and SDS treatment. Based on crystal structure,an aromatic cluster was identified that involves Phe4, Trp6 and Tyr343 holding the Nand C-terminus together; this is a unique and important feature of this protein that might be crucial for folding and stabilityunder poly-extreme conditions. Conclusion: A series of mutants was created to disrupt this aromatic cluster formation and study the loss of stability and function under given conditions. While the deletions of Phe4 resulted in loss of stability, removal of Trp6 and Tyr343 affected in vivo folding and activity. Alanine substitution with Phe4, Trp6 and Tyr343 drastically decreased stability under all parameters studied. Importantly,substitution of Phe4 with Trp increased stability in SDS treatment.Mass spectrometry results of limited proteolysis further demonstrated that the Arg344 residue is highly susceptible to trypsin digestion in sensitive mutants such as DF4, W6A and Y343A, suggesting again that disruption of the Phe4-Trp6-Tyr343 (F-W-Y) cluster destabilizes the N-and C-terminal interaction. Our results underscore the importance of N- and C-terminal contact through aromatic interactions in protein folding and stability under extreme conditions, and these results may be useful to improve the stability of other proteins under suboptimal conditions.
Resumo:
Protein folding is a relatively fast process considering the astronomical number of conformations in which a protein could find itself. Within the framework of a lattice model, we show that one can design rapidly folding sequences by assigning the strongest attractive couplings to the contacts present in a target native state, Our protein design can be extended to situations with both attractive and repulsive contacts. Frustration is minimized by ensuring that all the native contacts are again strongly attractive. Strikingly, this ensures the inevitability of folding and accelerates the folding process by an order of magnitude, The evolutionary implications of our findings are discussed.
Resumo:
A fundamental question in protein folding is whether the coil to globule collapse transition occurs during the initial stages of folding (burst phase) or simultaneously with the protein folding transition. Single molecule fluorescence resonance energy transfer (FRET) and small-angle X-ray scattering (SAXS) experiments disagree on whether Protein L collapse transition occurs during the burst phase of folding. We study Protein L folding using a coarse-grained model and molecular dynamics simulations. The collapse transition in Protein L is found to be concomitant with the folding transition. In the burst phase of folding, we find that FRET experiments overestimate radius of gyration, R-g, of the protein due to the application of Gaussian polymer chain end-to-end distribution to extract R-g from the FRET efficiency. FRET experiments estimate approximate to 6 angstrom decrease in R-g when the actual decrease is approximate to 3 angstrom on guanidinium chloride denaturant dilution from 7.5 to 1 M, thereby suggesting pronounced compaction in the protein dimensions in the burst phase. The approximate to 3 angstrom decrease is close to the statistical uncertainties of the R-g data measured from SAXS experiments, which suggest no compaction, leading to a disagreement with the FRET experiments. The transition-state ensemble (TSE) structures in Protein L folding are globular and extensive in agreement with the Psi-analysis experiments. The results support the hypothesis that the TSE of single domain proteins depends on protein topology and is not stabilized by local interactions alone.
Resumo:
Molecular dynamics simulations of the model protein chignolin with explicit solvent were carried out, in order to analyze the influence of the Berendsen thermostat on the evolution and folding of the peptide. The dependence of the peptide behavior on temperature was tested with the commonly employed thermostat scheme consisting of one thermostat for the protein and another for the solvent. The thermostat coupling time of the protein was increased to infinity, when the protein is not in direct contact with the thermal bath, a situation known as minimally invasive thermostat. In agreement with other works, it was observed that only in the last situation the instantaneous temperature of the model protein obeys a canonical distribution. As for the folding studies, it was shown that, in the applications of the commonly utilized thermostat schemes, the systems are trapped in local minima regions from which it has difficulty escaping. With the minimally invasive thermostat the time that the protein needs to fold was reduced by two to three times. These results show that the obstacles to the evolution of the extended peptide to the folded structure can be overcome when the temperature of the peptide is not directly controlled.
Resumo:
Structure and folding of membrane proteins are important issues in molecular and cell biology. In this work new approaches are developed to characterize the structure of folded, unfolded and partially folded membrane proteins. These approaches combine site-directed spin labeling and pulse EPR techniques. The major plant light harvesting complex LHCIIb was used as a model system. Measurements of longitudinal and transversal relaxation times of electron spins and of hyperfine couplings to neighboring nuclei by electron spin echo envelope modulation(ESEEM) provide complementary information about the local environment of a single spin label. By double electron electron resonance (DEER) distances in the nanometer range between two spin labels can be determined. The results are analyzed in terms of relative water accessibilities of different sites in LHCIIb and its geometry. They reveal conformational changes as a function of micelle composition. This arsenal of methods is used to study protein folding during the LHCIIb self assembly and a spatially and temporally resolved folding model is proposed. The approaches developed here are potentially applicable for studying structure and folding of any protein or other self-assembling structure if site-directed spin labeling is feasible and the time scale of folding is accessible to freeze-quench techniques.
Resumo:
A full quantitative understanding of the protein folding problem is now becoming possible with the help of the energy landscape theory and the protein folding funnel concept. Good folding sequences have a landscape that resembles a rough funnel where the energy bias towards the native state is larger than its ruggedness. Such a landscape leads not only to fast folding and stable native conformations but, more importantly, to sequences that are robust to variations in the protein environment and to sequence mutations. In this paper, an off-lattice model of sequences that fold into a β-barrel native structure is used to describe a framework that can quantitatively distinguish good and bad folders. The two sequences analyzed have the same native structure, but one of them is minimally frustrated whereas the other one exhibits a high degree of frustration.
Resumo:
Protein folding is a relatively fast process considering the astronomical number of conformations in which a protein could find itself. Within the framework of a lattice model, we show that one can design rapidly folding sequences by assigning the strongest attractive couplings to the contacts present in a target native state. Our protein design can be extended to situations with both attractive and repulsive contacts. Frustration is minimized by ensuring that all the native contacts are again strongly attractive. Strikingly, this ensures the inevitability of folding and accelerates the folding process by an order of magnitude. The evolutionary implications of our findings are discussed.
Resumo:
The folding and targeting of membrane proteins poses a major challenge to the cell, as they must remain insertion competent while their highly hydrophobic transmembrane (TM) domains are transferred from the ribosome, through the aqueous cytosol and into the lipid bilayer. The biogenesis of a mature membrane protein takes place through the insertion and integration into the lipid bilayer. A number of TM proteins have been shown to gain some degree of secondary structure within the ribosome tunnel and to retain this conformation throughout maturation. Although studies into the folding and targeting of a number of membrane proteins have been carried out to date, there is little information on one of the largest class of eukaryotic membrane proteins; the G-protein-coupled receptors (GPCRs). This project studies the early folding events of the human ortholog of GPR35. To analyse the structure of the 1st TM domain, intermediates were generated and assessed by the biochemical method of pegylation (PEG-MAL). A structurally-similar microbial opsin (Bacterioopsin) was also used to investigate the differences in the early protein folding within eukaryotic and prokaryotic translation systems. Results showed that neither the 1st TM domain of GPR35 nor Bacterioopsin were capable of compacting in the ribosome tunnel before their N-terminus reached the ribosome exit point. The results for this assay remained consistent whether the proteins were translated in a eukaryotic or prokaryotic translation system. To examine the communication mechanism between the ribosome, the nascent chain and the protein targeting pathway, crosslinking experiments were carried out using the homobifunctional lysine cross-linker BS3. Specifically, the data generated here show that the nascent chain of GPR35 reaches the ribosomal protein uL23 in an extended conformation and interacts with the SRP protein as it exits the ribosome tunnel. This confirms the role of SRP in the co-translational targeting of GPR35. Using these methods insights into the early folding of GPCRs has been obtained. Further experiments using site-directed mutagenesis to reduce hydrophobicity in the 1st TM domain of GPR35, highlighted the mechanisms by which GPCRs are targeted to the endoplasmic reticulum. Confirming that hydrophobicity within the signal anchor sequence is essential of SRP-dependent targeting. Following the successful interaction of the nascent GPR35 and SRP, GPR35 is successfully targeted to ER membranes, shown here as dog pancreas microsomes (DPMs). Glycosylation of the GPR35 N-terminus was used to determine nascent chain structure as it is inserted into the ER membrane. These glycosylation experiments confirm that TM1 has obtained its compacted state whilst residing in the translocon. Finally, a site-specific cross-linking approach using the homobifunctional cysteine cross-linker, BMH, was used to study the lateral integration of GPR35 into the ER. Cross-linking of GPR35 TM1 and TM2 could be detected adjacent to a protein of ~45kDa, believed to be Sec61α. The loss of this adduct, as the nascent chain extends, showed the lateral movement of GPR35 TM1 from the translocon was dependent on the subsequent synthesis of TM2.
Resumo:
Communication within and across proteins is crucial for the biological functioning of proteins. Experiments such as mutational studies on proteins provide important information on the amino acids, which are crucial for their function. However, the protein structures are complex and it is unlikely that the entire responsibility of the function rests on only a few amino acids. A large fraction of the protein is expected to participate in its function at some level or other. Thus, it is relevant to consider the protein structures as a completely connected network and then deduce the properties, which are related to the global network features. In this direction, our laboratory has been engaged in representing the protein structure as a network of non-covalent connections and we have investigated a variety of problems in structural biology, such as the identification of functional and folding clusters, determinants of quaternary association and characterization of the network properties of protein structures. We have also addressed a few important issues related to protein dynamics, such as the process of oligomerization in multimers, mechanism on protein folding, and ligand induced communications (allosteric effect). In this review we highlight some of the investigations which we have carried out in the recent past. A review on protein structure graphs was presented earlier, in which the focus was on the graphs and graph spectral properties and their implementation in the study of protein structure graphs/networks (PSN). In this article, we briefly summarize the relevant parts of the methodology and the focus is on the advancement brought out in the understanding of protein structure-function relationships through structure networks. The investigations of structural/biological problems are divided into two parts, in which the first part deals with the analysis of PSNs based on static structures obtained from x-ray crystallography. The second part highlights the changes in the network, associated with biological functions, which are deduced from the network analysis on the structures obtained from molecular dynamics simulations.
Resumo:
Prediction of thermodynamic parameters of protein-protein and antigen-antibody complex formation from high resolution structural parameters has recently received much attention, since an understanding of the contributions of different fundamental processes like hydrophobic interactions, hydrogen bonding, salt bridge formation, solvent reorganization etc. to the overall thermodynamic parameters and their relations with the structural parameters would lead to rational drug design. Using the results of the dissolution of hydrocarbons and other model compounds the changes in heat capacity (DeltaCp), enthalpy (DeltaH) and entropy (DeltaS) have been empirically correlated with the polar and apolar surface areas buried during the process of protein folding/unfolding and protein-ligand complex formation. In this regard, the polar and apolar surfaces removed from the solvent in a protein-ligand complex have been calculated from the experimentally observed values of changes in heat capacity (DeltaCp) and enthalpy (DeltaH) for protein-ligand complexes for which accurate thermodynamic and high resolution structural data are available, and the results have been compared with the x-ray crystallographic observations. Analyses of the available results show poor correlation between the thermodynamic and structural parameters. Probable reasons for this discrepancy are mostly related with the reorganization of water accompanying the reaction which is indeed proven by the analyses of the energetics of the binding of the wheat germ agglutinin to oligosaccharides.
Resumo:
Neurotrophic factors (NTFs) are secreted proteins which promote the survival of neurons, formation and maintenance of neuronal contacts and regulate synaptic plasticity. NTFs are also potential drug candidates for the treatment of neurodegenerative diseases. Parkinson’s disease (PD) is mainly caused by the degeneration of midbrain dopaminergic neurons. Current therapies for PD do not stop the neurodegeneration or repair the affected neurons. Thus, search of novel neurotrophic factors for midbrain dopaminergic neurons, which could also be used as therapeutic proteins, is highly warranted. In the present study, we identified and characterized a novel protein named conserved dopamine neurotrophic factor (CDNF), a homologous protein to mesencephalic astrocyte-derived neurotrophic factor (MANF). Others have shown that MANF supports the survival of embryonic midbrain dopaminergic neurons in vitro, and protects cultured cells against endoplasmic reticulum (ER) stress. CDNF and MANF form a novel evolutionary conserved protein family with characteristic eight conserved cysteine residues in their primary structure. The vertebrates have CDNF and MANF encoding genes, whereas the invertebrates, including Drosophila and Caenorhabditis have a single homologous CDNF/MANF gene. In this study we show that CDNF and MANF are secreted proteins. They are widely expressed in the mammalian brain, including the midbrain and striatum, and in several non-neuronal tissues. We expressed and purified recombinant human CDNF and MANF proteins, and tested the neurotrophic activity of CDNF on midbrain dopaminergic neurons using a 6-hydroxydopamine (6-OHDA) rat model of PD. In this model, a single intrastriatal injection of CDNF protected midbrain dopaminergic neurons and striatal dopaminergic fibers from the 6-OHDA toxicity. Importantly, an intrastriatal injection of CDNF also restored the functional activity of the nigrostriatal dopaminergic system when given after the striatal 6-OHDA lesion. Thus, our study shows that CDNF is a potential novel therapeutic protein for the treatment of PD. In order to elucidate the molecular mechanisms of CDNF and MANF activity, we resolved their crystal structure. CDNF and MANF proteins have two domains; an amino (N)-terminal saposin-like domain and a presumably unfolded carboxy (C)-terminal domain. The saposin-like domain, which is formed by five α-helices and stabilized by three intradomain disulphide bridges, may bind to lipids or membranes. The C-terminal domain contains an internal cysteine bridge in a CXXC motif similar to that of thiol/disulphide oxidoreductases and isomerases, and may thus facilitate protein folding in the ER. Our studies suggest that CDNF and MANF are novel potential therapeutic proteins for the treatment of neurodegenerative diseases. Future studies will reveal the neurotrophic and cytoprotective mechanisms of CDNF and MANF in more detail.
Resumo:
The molecular mechanism of helix nucleation in peptides and proteins is not yet understood and the question of whether sharp turns in the polypeptide backbone serve as nuclei for protein folding has evoked controversy1,2. A recent study of the conformation of a tetrapeptide containing the stereochemically constrained residue alpha-aminoisobutyric acid, both in solution and the solid state, yielded a structure consisting of two consecutive beta-turns, leading to an incipient 310 helical conformation3,4. This led us to speculate that specific tri- and tetra-peptide sequences may indeed provide a helical twist to the amino-terminal segment of helical regions in proteins and provide a nucleation site for further propagation. The transformation from a 310 helical structure to an alpha-helix should be facile and requires only small changes in the phi and psi conformational angles and a rearrangement of the hydrogen bonding pattern5. If such a mechanism is involved then it should be possible to isolate an incipient 310 helical conformation in a tripeptide amide or tetrapeptide sequence, based purely on the driving force derived from short-range interactions. We have synthesised and studied the model peptide pivaloyl-Pro-Pro-Ala-NHMe (compound I) and provide here spectroscopic evidence for a 310 helical conformation in compound I.
Resumo:
Heat shock protein 90 participates in diverse biological processes ranging from protein folding, cell cycle, signal transduction and development to evolution in all eukaryotes. It is also critically involved in regulating growth of protozoa such as Dictyostelium discoideum, Leishmania donovani, Plasmodium falciparum, Trypanosoma cruzi, and Trypanosoma evansi. Selective inhibition of Hsp90 has also been explored as an intervention strategy against important human diseases such as cancer, malaria, or trypanosomiasis. Giardia lamblia, a simple protozoan parasite of humans and animals, is an important cause of diarrheal disease with significant morbidity and some mortality in tropical countries. Here we show that the G. lamblia cytosolic hsp90 ( glhsp90) is split in two similar sized fragments located 777 kb apart on the same scaffold. Intrigued by this unique arrangement, which appears to be specific for the Giardiinae, we have investigated the biosynthesis of GlHsp90. We used genome sequencing to confirm the split nature of the giardial hsp90. However, a specific antibody raised against the peptide detected a product with a mass of about 80 kDa, suggesting a post-transcriptional rescue of the genomic defect. We show evidence for the joining of the two independent Hsp90 transcripts in-trans to one long mature mRNA presumably by RNA splicing. The splicing junction carries hallmarks of classical cis-spliced introns, suggesting that the regular cis-splicing machinery may be sufficient for repair of the open reading frame. A complementary 26-nt sequence in the ``intron'' regions adjacent to the splice sites may assist in positioning the two pre-mRNAs for processing. This is the first example of post-transcriptional rescue of a split gene by trans-splicing.