933 resultados para Protein secondary structure
Resumo:
Receptor activity modifying proteins (RAMPs) are a family of single-pass transmembrane proteins that dimerize with G-protein-coupled receptors. They may alter the ligand recognition properties of the receptors (particularly for the calcitonin receptor-like receptor, CLR). Very little structural information is available about RAMPs. Here, an ab initio model has been generated for the extracellular domain of RAMP1. The disulfide bond arrangement (Cys 27-Cys82, Cys40-Cys72, and Cys 57-Cys104) was determined by site-directed mutagenesis. The secondary structure (a-helices from residues 29-51, 60-80, and 87-100) was established from a consensus of predictive routines. Using these constraints, an assemblage of 25,000 structures was constructed and these were ranked using an all-atom statistical potential. The best 1000 conformations were energy minimized. The lowest scoring model was refined by molecular dynamics simulation. To validate our strategy, the same methods were applied to three proteins of known structure; PDB:1HP8, PDB:1V54 chain H (residues 21-85), and PDB:1T0P. When compared to the crystal structures, the models had root mean-square deviations of 3.8 Å, 4.1 Å, and 4.0 Å, respectively. The model of RAMP1 suggested that Phe93, Tyr 100, and Phe101 form a binding interface for CLR, whereas Trp74 and Phe92 may interact with ligands that bind to the CLR/RAMP1 heterodimer. © 2006 by the Biophysical Society.
Resumo:
G-protein coupled receptors (GPCRs) constitute the largest class of membrane proteins and are a major drug target. A serious obstacle to studying GPCR structure/function characteristics is the requirement to extract the receptors from their native environment in the plasma membrane, coupled with the inherent instability of GPCRs in the detergents required for their solubilization. In the present study, we report the first solubilization and purification of a functional GPCR [human adenosine A
Resumo:
The protein folding problem has been one of the most challenging subjects in biological physics due to its complexity. Energy landscape theory based on statistical mechanics provides a thermodynamic interpretation of the protein folding process. We have been working to answer fundamental questions about protein-protein and protein-water interactions, which are very important for describing the energy landscape surface of proteins correctly. At first, we present a new method for computing protein-protein interaction potentials of solvated proteins directly from SAXS data. An ensemble of proteins was modeled by Metropolis Monte Carlo and Molecular Dynamics simulations, and the global X-ray scattering of the whole model ensemble was computed at each snapshot of the simulation. The interaction potential model was optimized and iterated by a Levenberg-Marquardt algorithm. Secondly, we report that terahertz spectroscopy directly probes hydration dynamics around proteins and determines the size of the dynamical hydration shell. We also present the sequence and pH-dependence of the hydration shell and the effect of the hydrophobicity. On the other hand, kinetic terahertz absorption (KITA) spectroscopy is introduced to study the refolding kinetics of ubiquitin and its mutants. KITA results are compared to small angle X-ray scattering, tryptophan fluorescence, and circular dichroism results. We propose that KITA monitors the rearrangement of hydrogen bonding during secondary structure formation. Finally, we present development of the automated single molecule operating system (ASMOS) for a high throughput single molecule detector, which levitates a single protein molecule in a 10 µm diameter droplet by the laser guidance. I also have performed supporting calculations and simulations with my own program codes.
Resumo:
Microsecond long Molecular Dynamics (MD) trajectories of biomolecular processes are now possible due to advances in computer technology. Soon, trajectories long enough to probe dynamics over many milliseconds will become available. Since these timescales match the physiological timescales over which many small proteins fold, all atom MD simulations of protein folding are now becoming popular. To distill features of such large folding trajectories, we must develop methods that can both compress trajectory data to enable visualization, and that can yield themselves to further analysis, such as the finding of collective coordinates and reduction of the dynamics. Conventionally, clustering has been the most popular MD trajectory analysis technique, followed by principal component analysis (PCA). Simple clustering used in MD trajectory analysis suffers from various serious drawbacks, namely, (i) it is not data driven, (ii) it is unstable to noise and change in cutoff parameters, and (iii) since it does not take into account interrelationships amongst data points, the separation of data into clusters can often be artificial. Usually, partitions generated by clustering techniques are validated visually, but such validation is not possible for MD trajectories of protein folding, as the underlying structural transitions are not well understood. Rigorous cluster validation techniques may be adapted, but it is more crucial to reduce the dimensions in which MD trajectories reside, while still preserving their salient features. PCA has often been used for dimension reduction and while it is computationally inexpensive, being a linear method, it does not achieve good data compression. In this thesis, I propose a different method, a nonmetric multidimensional scaling (nMDS) technique, which achieves superior data compression by virtue of being nonlinear, and also provides a clear insight into the structural processes underlying MD trajectories. I illustrate the capabilities of nMDS by analyzing three complete villin headpiece folding and six norleucine mutant (NLE) folding trajectories simulated by Freddolino and Schulten [1]. Using these trajectories, I make comparisons between nMDS, PCA and clustering to demonstrate the superiority of nMDS. The three villin headpiece trajectories showed great structural heterogeneity. Apart from a few trivial features like early formation of secondary structure, no commonalities between trajectories were found. There were no units of residues or atoms found moving in concert across the trajectories. A flipping transition, corresponding to the flipping of helix 1 relative to the plane formed by helices 2 and 3 was observed towards the end of the folding process in all trajectories, when nearly all native contacts had been formed. However, the transition occurred through a different series of steps in all trajectories, indicating that it may not be a common transition in villin folding. The trajectories showed competition between local structure formation/hydrophobic collapse and global structure formation in all trajectories. Our analysis on the NLE trajectories confirms the notion that a tight hydrophobic core inhibits correct 3-D rearrangement. Only one of the six NLE trajectories folded, and it showed no flipping transition. All the other trajectories get trapped in hydrophobically collapsed states. The NLE residues were found to be buried deeply into the core, compared to the corresponding lysines in the villin headpiece, thereby making the core tighter and harder to undo for 3-D rearrangement. Our results suggest that the NLE may not be a fast folder as experiments suggest. The tightness of the hydrophobic core may be a very important factor in the folding of larger proteins. It is likely that chaperones like GroEL act to undo the tight hydrophobic core of proteins, after most secondary structure elements have been formed, so that global rearrangement is easier. I conclude by presenting facts about chaperone-protein complexes and propose further directions for the study of protein folding.
Resumo:
Phosphorylation is amongst the most crucial and well-studied post-translational modifications. It is involved in multiple cellular processes which makes phosphorylation prediction vital for understanding protein functions. However, wet-lab techniques are labour and time intensive. Thus, computational tools are required for efficiency. This project aims to provide a novel way to predict phosphorylation sites from protein sequences by adding flexibility and Sezerman Grouping amino acid similarity measure to previous methods, as discovering new protein sequences happens at a greater rate than determining protein structures. The predictor – NOPAY - relies on Support Vector Machines (SVMs) for classification. The features include amino acid encoding, amino acid grouping, predicted secondary structure, predicted protein disorder, predicted protein flexibility, solvent accessibility, hydrophobicity and volume. As a result, we have managed to improve phosphorylation prediction accuracy for Homo sapiens by 3% and 6.1% for Mus musculus. Sensitivity at 99% specificity was also increased by 6% for Homo sapiens and for Mus musculus by 5% on independent test sets. In this study, we have managed to increase phosphorylation prediction accuracy for Homo sapiens and Mus musculus. When there is enough data, future versions of the software may also be able to predict other organisms.
Resumo:
The folding and targeting of membrane proteins poses a major challenge to the cell, as they must remain insertion competent while their highly hydrophobic transmembrane (TM) domains are transferred from the ribosome, through the aqueous cytosol and into the lipid bilayer. The biogenesis of a mature membrane protein takes place through the insertion and integration into the lipid bilayer. A number of TM proteins have been shown to gain some degree of secondary structure within the ribosome tunnel and to retain this conformation throughout maturation. Although studies into the folding and targeting of a number of membrane proteins have been carried out to date, there is little information on one of the largest class of eukaryotic membrane proteins; the G-protein-coupled receptors (GPCRs). This project studies the early folding events of the human ortholog of GPR35. To analyse the structure of the 1st TM domain, intermediates were generated and assessed by the biochemical method of pegylation (PEG-MAL). A structurally-similar microbial opsin (Bacterioopsin) was also used to investigate the differences in the early protein folding within eukaryotic and prokaryotic translation systems. Results showed that neither the 1st TM domain of GPR35 nor Bacterioopsin were capable of compacting in the ribosome tunnel before their N-terminus reached the ribosome exit point. The results for this assay remained consistent whether the proteins were translated in a eukaryotic or prokaryotic translation system. To examine the communication mechanism between the ribosome, the nascent chain and the protein targeting pathway, crosslinking experiments were carried out using the homobifunctional lysine cross-linker BS3. Specifically, the data generated here show that the nascent chain of GPR35 reaches the ribosomal protein uL23 in an extended conformation and interacts with the SRP protein as it exits the ribosome tunnel. This confirms the role of SRP in the co-translational targeting of GPR35. Using these methods insights into the early folding of GPCRs has been obtained. Further experiments using site-directed mutagenesis to reduce hydrophobicity in the 1st TM domain of GPR35, highlighted the mechanisms by which GPCRs are targeted to the endoplasmic reticulum. Confirming that hydrophobicity within the signal anchor sequence is essential of SRP-dependent targeting. Following the successful interaction of the nascent GPR35 and SRP, GPR35 is successfully targeted to ER membranes, shown here as dog pancreas microsomes (DPMs). Glycosylation of the GPR35 N-terminus was used to determine nascent chain structure as it is inserted into the ER membrane. These glycosylation experiments confirm that TM1 has obtained its compacted state whilst residing in the translocon. Finally, a site-specific cross-linking approach using the homobifunctional cysteine cross-linker, BMH, was used to study the lateral integration of GPR35 into the ER. Cross-linking of GPR35 TM1 and TM2 could be detected adjacent to a protein of ~45kDa, believed to be Sec61α. The loss of this adduct, as the nascent chain extends, showed the lateral movement of GPR35 TM1 from the translocon was dependent on the subsequent synthesis of TM2.
Resumo:
In this paper was demonstrated that umbelliferone induces changes in structure and pharmacological activities of Bn IV, a lysine 49 secretory phospholipase A(2) (sPLA2) from Both tops neuwiedi. Incubation of Bn IV with umbelliferone virtually abolished platelet aggregation, edema, and myotoxicity induced by native Bn IV. The amino acid sequence of Bn IV showed high sequence similarities with other Lys49 sPLA2s from B. jararacussu (BthTx-I), B. pirajai (PrTx-I), and B. neuwiedi pauloensis (Bn SP6 and Bn SP7). This sPLA2 also has a highly conserved C-terminal amino acid sequence, which has been shown as important for the pharmacological activities of Lys49 sPLA2. Sequencing of Bn IV previously treated with umbelliferone revealed modification of S(1) and S(20). Fluorescent spectral analysis and circular dichroism (CD) studies showed that umbelliferone modified the secondary structure of this protein. Moreover, the pharmacological activity of Bn IV is driven by synergism of the C-terminal region with the a-helix motifs, which are involved in substrate binding of the Asp49 and Lys49 residues of 5PLA2 and have a direct effect on the Ca2+-independent membrane damage of some secretory snake venom PLA2. For Bn IV, these interactions are potentially important for triggering the pharmacological activity of this 5PLA2. (C) 2011 Elsevier Ltd. All rights reserved.
Resumo:
Dissertação de mestrado, Biotecnologia, Faculdade de Ciências e Tecnologia, Universidade do Algarve, 2014
Resumo:
The assembly of outer membranes of the cell wall of Gram-negative bacteria and of various organelles of eukaryotic cells requires the evolutionarily conserved β-barrel-assembly machinery (BAM) complex. This thesis describes the biochemical and biophysical properties of the periplasmic domain of the β-barrel assembly machinery protein A (PD-BamA) of the E. coli BAM complex, its effect on insertion and folding of the Outer membrane protein A (OmpA) into lipid bilayers and the identification of regions of PD-BamA that may be involved in protein-protein interactions. The secondary structure of PD-BamA in mixed lipid bilayers, analyzed by Circular dichroism (CD) spectroscopy, contained less β-sheet at an increased content of phosphatidylglycerol (PG) in the lipid membrane. This result showed membrane binding, albeit only in the presence of negatively charged lipids. Fluorescence spectroscopy demonstrated that PD-BamA only binds to lipid bilayers containing the negatively charged DOPG, confirming the results of CD spectroscopy. PD-BamA did not bind to zwitterionic but overall neutral lipid bilayers. PD-BamA bound to OmpA at a stoichiometry of 1:1. PD-BamA strongly facilitated insertion and folding of OmpA into lipid membranes. Kinetics of PD-BamA mediated folding of OmpA was well described by two parallel folding processes, a fast folding process and a slow folding process, differing by 2-3 orders of magnitude in their rate constants. The folding yields of OmpA depended on the concentration of lipid membranes and also on the lipid head groups. The presence of PD-BamA resulted in increased folding yields of OmpA in negatively charged DOPG, but PD-BamA did not affect the folding kinetics of OmpA into bilayers of zwitterionic but overall neutral lipids. The efficiency of folding and insertion of OmpA into lipid bilayers strongly depended on the ratio PD-BamA/OmpA and was optimal at equimolar concentrations of PD-BamA and OmpA. To examine complexes of unfolded OmpA with PD-BamA in more detail, site-directed spectroscopy was used to explore contact regions in both, PD-BamA and OmpA. Similarly, contact regions were also investigated for another protein complex formed by PD-BamA and the lipoprotein BamD. The obtained data suggest, that the site of interaction on PD-BamA for OmpA might be oriented towards the exterior environment away from the preceding POTRA domains, but that PD-BamA is oriented with its short α-helix α1 of POTRA domain 5 towards the C-terminal end of BamD.
Resumo:
Background The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. Results We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. Conclusion The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.
Resumo:
A library containing approximately 40,000 small RNA sequences was constructed for Brassica napus. Analysis of 3025 sequences obtained from this library resulted in the identification of 11 conserved miRNA families, which were validated by secondary structure prediction using surrounding sequences in the Brassica genome. Two 21 nt small RNA sequences reside within the arm of a pre-miRNA like stem-loop structure, making them likely candidates for novel non-conserved miRNAs in B. napus. Most of the conserved miRNAs were expressed at similar levels in a F1 hybrid B. napus line and its four double haploid progeny that showed marked variations in phenotypes, but many were differentially expressed between B. napus and Arabidopsis. The miR169 family was expressed at high levels in young leaves and stems, but was undetectable in roots and mature leaves, suggesting that miR169 expression is developmentally regulated in B. napus. © 2007 Federation of European Biochemical Societies.
Resumo:
In this paper, the complete mitochondrial genome of Acraea issoria (Lepidoptera: Nymphalidae: Heliconiinae: Acraeini) is reported; a circular molecule of 15,245 bp in size. For A. issoria, genes are arranged in the same order and orientation as the complete sequenced mitochondrial genomes of the other lepidopteran species, except for the presence of an extra copy of tRNAIle(AUR)b in the control region. All protein-coding genes of A. issoria mitogenome start with a typical ATN codon and terminate in the common stop codon TAA, except that COI gene uses TTG as its initial codon and terminates in a single T residue. All tRNA genes possess the typical clover leaf secondary structure except for tRNASer(AGN), which has a simple loop with the absence of the DHU stem. The sequence, organization and other features including nucleotide composition and codon usage of this mitochondrial genome were also reported and compared with those of other sequenced lepidopterans mitochondrial genomes. There are some short microsatellite-like repeat regions (e.g., (TA)9, polyA and polyT) scattered in the control region, however, the conspicuous macro-repeats units commonly found in other insect species are absent.
Resumo:
We present a machine learning model that predicts a structural disruption score from a protein s primary structure. SCHEMA was introduced by Frances Arnold and colleagues as a method for determining putative recombination sites of a protein on the basis of the full (PDB) description of its structure. The present method provides an alternative to SCHEMA that is able to determine the same score from sequence data only. Circumventing the need for resolving the full structure enables the exploration of yet unresolved and even hypothetical sequences for protein design efforts. Deriving the SCHEMA score from a primary structure is achieved using a two step approach: first predicting a secondary structure from the sequence and then predicting the SCHEMA score from the predicted secondary structure. The correlation coefficient for the prediction is 0.88 and indicates the feasibility of replacing SCHEMA with little loss of precision.
Resumo:
The chemokine receptor CCR5 contains seven transmembrane-spanning domains. It binds chemokines and acts as co-receptor for macrophage (m)-tropic (or R5) strains of HIV-1. Monoclonal antibodies (mAb) to CCR5, 3A9 and 5C7, were used for biopanning a nonapeptide cysteine (C)-constrained phage-displayed random peptide library to ascertain contact residues and define tertiary structures of possible epitopes on CCR5. Reactivity of antibodies with phagotopes was established by enzyme-linked immunosorbent assay (ELISA). mAb 3A9 identified a phagotope C-HASIYDFGS-C (3A9/1), and 5C7 most frequently identified C-PHWLRDLRV-C (5C7/1). Corresponding peptides were synthesized. Phagotopes and synthetic peptides reacted in ELISA with corresponding antibodies and synthetic peptides inhibited antibody binding to the phagotopes. Reactivity by immunofluorescence of 3A9 with CCR5 was strongly inhibited by the corresponding peptide. Both mAb 3A9 and 5C7 reacted similarly with phagotopes and the corresponding peptide selected by the alternative mAb. The sequences of peptide inserts of phagotopes could be aligned as mimotopes of the sequence of CCR5. For phage 3A9/1, the motif SIYD aligned to residues at the N terminus and FG to residues on the first extracellular loop; for 5C7/1, residues at the N terminus, first extracellular loop, and possibly the third extracellular loop could be aligned and so would contribute to the mimotope. The synthetic peptides corresponding to the isolated phagotopes showed a CD4-dependent reactivity with gp120 of a primary, m-tropic HIV-1 isolate. Thus reactivity of antibodies raised to CCR5 against phage-displayed peptides defined mimotopes that reflect binding sites for these antibodies and reveal a part of the gp120 binding sites on CCR5.
Resumo:
The structural stabilizing property of 2,2,2-trifluoroethanol (TFE) in peptides has been widely demonstrated, More recently, TFE has been shown to enhance secondary structure content in globular proteins, and to influence quaternary interactions in protein multimers. The molecular mechanisms by which TFE exerts its Influence on peptide and protein structures remain poorly understood. The present analysis integrates the known physical properties of TFE with a variety of experimental observations on the interaction of TFE with peptides and proteins and on the properties of fluorocarbons. Two features of TFE, namely the hydrophobicity of the trifluoromethyl group and the hydrogen bonding character (strong donor and poor acceptor), emerge as the most important factors for rationalising the observed effects of TFE. A model is proposed for TFE interaction with peptides which involves an initial replacement of the hydration shell by fluoroalcohol molecules, a process driven by apolar interactions and favourable entropy of dehydration. Subsequent bifurcated hydrogen-bond formation with peptide carbonyl groups, which leave intramolecular interactions unaffected, promotes secondary structure formation.