963 resultados para Protein secondary structure


Relevância:

100.00% 100.00%

Publicador:

Resumo:

We describe a new method for using neural networks to predict residue contact pairs in a protein. The main inputs to the neural network are a set of 25 measures of correlated mutation between all pairs of residues in two windows of size 5 centered on the residues of interest. While the individual pair-wise correlations are a relatively weak predictor of contact, by training the network on windows of correlation the accuracy of prediction is significantly improved. The neural network is trained on a set of 100 proteins and then tested on a disjoint set of 1033 proteins of known structure. An average predictive accuracy of 21.7% is obtained taking the best L/2 predictions for each protein, where L is the sequence length. Taking the best L/10 predictions gives an average accuracy of 30.7%. The predictor is also tested on a set of 59 proteins from the CASP5 experiment. The accuracy is found to be relatively consistent across different sequence lengths, but to vary widely according to the secondary structure. Predictive accuracy is also found to improve by using multiple sequence alignments containing many sequences to calculate the correlations. (C) 2004 Wiley-Liss, Inc.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Cystic fibrosis is caused by mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) gene, which encodes a chloride channel present in many cells. In cardiomyocytes, we report that multiple exon 1 usage and alternative splicing produces four CFTR transcripts, with different 5'-untranslated regions, CFTRTRAD-139, CFTR-1C/-1A, CFTR-1C, and CFTR-1B. CFTR transcripts containing the novel upstream exons (exons -1C, -1B, and -1A) represent more than 90% of cardiac expressed CFTR mRNA. Regulation of cardiac CFTR expression, in response to developmental and pathological stimuli, is exclusively due to the modulation of CFTR-1C and CFTR-1C/-1A expression. Upstream open reading frames have been identified in the 5'-untranslated regions of all CFTR transcripts that, in conjunction with adjacent stem-loop structures, modulate the efficiency of translation initiation at the AUG codon of the main CFTR coding region in CFTRTRAD-139 and CFTR-1C/-1A transcripts. Exon(-1A), only present in CFTR-1C/-1A transcripts, encodes an AUG codon that is in-frame with the main CFTR open reading frame, the efficient translation of which produces a novel CFTR protein isoform with a curtailed amino terminus. As the expression of this CFTR transcript parallels the spatial and temporal distribution of the cAMP-activated whole-cell current density in normal and diseased hearts, we suggest that CFTR-1C/-1A provides the molecular basis for the cardiac cAMP-activated chloride channel. Our findings provide further insight into the complex nature of in vivo CFTR expression, to which multiple mRNA transcripts, protein isoforms, and post-transcriptional regulatory mechanisms are now added.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Background: Protein tertiary structure can be partly characterized via each amino acid's contact number measuring how residues are spatially arranged. The contact number of a residue in a folded protein is a measure of its exposure to the local environment, and is defined as the number of C-beta atoms in other residues within a sphere around the C-beta atom of the residue of interest. Contact number is partly conserved between protein folds and thus is useful for protein fold and structure prediction. In turn, each residue's contact number can be partially predicted from primary amino acid sequence, assisting tertiary fold analysis from sequence data. In this study, we provide a more accurate contact number prediction method from protein primary sequence. Results: We predict contact number from protein sequence using a novel support vector regression algorithm. Using protein local sequences with multiple sequence alignments (PSI-BLAST profiles), we demonstrate a correlation coefficient between predicted and observed contact numbers of 0.70, which outperforms previously achieved accuracies. Including additional information about sequence weight and amino acid composition further improves prediction accuracies significantly with the correlation coefficient reaching 0.73. If residues are classified as being either contacted or non-contacted, the prediction accuracies are all greater than 77%, regardless of the choice of classification thresholds. Conclusion: The successful application of support vector regression to the prediction of protein contact number reported here, together with previous applications of this approach to the prediction of protein accessible surface area and B-factor profile, suggests that a support vector regression approach may be very useful for determining the structure-function relation between primary sequence and higher order consecutive protein structural and functional properties.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The caseins (alpha(s1), alpha(s2), beta, and kappa) are phosphoproteins present in bovine milk that have been studied for over a century and whose structures remain obscure. Here we describe the chemical synthesis and structure elucidation of the N-terminal segment (1-44) of bovine K-casein, the protein which maintains the micellar structure of the caseins. K-Casein (1-44) was synthesised by highly optimised Boc solid-phase peptide chemistry and characterised by mass spectrometry. Structure elucidation was carried out by circular dichroism and nuclear magnetic resonance spectroscopy. CD analysis demonstrated that the segment was ill defined in aqueous medium but in 30% trifluoroethanol it exhibited considerable helical structure. Further, NMR analysis showed the presence of a helical segment containing 26 residues which extends from Pro(8) to Arg(34). This is the first report which demonstrates extensive secondary structure within the casein class of proteins. (c) 2006 Elsevier Inc. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this study, we propose a novel method to predict the solvent accessible surface areas of transmembrane residues. For both transmembrane alpha-helix and beta-barrel residues, the correlation coefficients between the predicted and observed accessible surface areas are around 0.65. On the basis of predicted accessible surface areas, residues exposed to the lipid environment or buried inside a protein can be identified by using certain cutoff thresholds. We have extensively examined our approach based on different definitions of accessible surface areas and a variety of sets of control parameters. Given that experimentally determining the structures of membrane proteins is very difficult and membrane proteins are actually abundant in nature, our approach is useful for theoretically modeling membrane protein tertiary structures, particularly for modeling the assembly of transmembrane domains. This approach can be used to annotate the membrane proteins in proteomes to provide extra structural and functional information.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The green fluorescent protein (avGFP), its variants, and the closely related GFP-like proteins are characterized structurally by a cyclic tri-peptide chromophore located centrally within a conserved beta-can fold. Traditionally, these GFP family members have been isolated from the Cnidaria although recently, distantly related GFP-like proteins from the Bilateria, a sister group of the Cnidaria have been described, although no representative structure from this phylum has been reported to date. We have determined to 2.1 angstrom resolution the crystal structure of copGFP, a representative GFP-like protein from a copepod, a member of the Bilateria. The structure of copGFP revealed that, despite sharing only 19% sequence identity with GFP, the tri-peptide chromophore (Gly57-Tyr58-Gly59) of copGFP adopted a cis coplanar conformation within the conserved beta-can fold. However, the immediate environment surrounding the chromophore of copGFP was markedly atypical when compared to other members of the GFP-superfamily, with a large network of bulky residues observed to surround the chromophore. Arg87 and Glu222 (GFP numbering 96 and 222), the only two residues conserved between copGFP, GFP and GFP-like proteins are involved in autocatalytic genesis of the chromophore. Accordingly, the copGFP structure provides an alternative platform for the development of a new suite of fluorescent protein tools. Moreover, the structure suggests that the autocatalytic genesis of the chromophore is remarkably tolerant to a high degree of sequence and structural variation within the beta-can fold of the GFP superfamily. (c) 2006 Elsevier Ltd . All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Plant resistance proteins (R proteins) recognize corresponding pathogen avirulence (Avr) proteins either indirectly through detection of changes in their host protein targets or through direct R-Avr protein interaction. Although indirect recognition imposes selection against Avr effector function, pathogen effector molecules recognized through direct interaction may overcome resistance through sequence diversification rather than loss of function. Here we show that the flax rust fungus AvrLS67 genes, whose products are recognized by the L5, L6, and L7 R proteins of flax, are highly diverse, with 12 sequence variants identified from six rust strains. Seven AvrL567 variants derived from Avr alleles induce necrotic responses when expressed in flax plants containing corresponding resistance genes (R genes), whereas five variants from avr alleles do not. Differences in recognition specificity between AvA567 variants and evidence for diversifying selection acting on these genes suggest they have been involved in a gene-specific arms race with the corresponding flax R genes. Yeast two-hybrid assays indicate that recognition is based on direct R-Avr protein interaction and recapitulate the interaction specificity observed in planta. Biochemical analysis of Escherichia coli-produced AvrL567 proteins shows that variants that escape recognition nevertheless maintain a conserved structure and stability, suggesting that the amino acid sequence differences directly affect the R-Avr protein interaction. We suggest that direct recognition associated with high genetic diversity at corresponding R and Avr gene loci represents an alternative outcome of plant-pathogen coevolution to indirect recognition associated with simple balanced polymorphisms for functional and nonfunctional R and Avr genes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Receptor activity modifying proteins (RAMPs) are a family of single-pass transmembrane proteins that dimerize with G-protein-coupled receptors. They may alter the ligand recognition properties of the receptors (particularly for the calcitonin receptor-like receptor, CLR). Very little structural information is available about RAMPs. Here, an ab initio model has been generated for the extracellular domain of RAMP1. The disulfide bond arrangement (Cys 27-Cys82, Cys40-Cys72, and Cys 57-Cys104) was determined by site-directed mutagenesis. The secondary structure (a-helices from residues 29-51, 60-80, and 87-100) was established from a consensus of predictive routines. Using these constraints, an assemblage of 25,000 structures was constructed and these were ranked using an all-atom statistical potential. The best 1000 conformations were energy minimized. The lowest scoring model was refined by molecular dynamics simulation. To validate our strategy, the same methods were applied to three proteins of known structure; PDB:1HP8, PDB:1V54 chain H (residues 21-85), and PDB:1T0P. When compared to the crystal structures, the models had root mean-square deviations of 3.8 Å, 4.1 Å, and 4.0 Å, respectively. The model of RAMP1 suggested that Phe93, Tyr 100, and Phe101 form a binding interface for CLR, whereas Trp74 and Phe92 may interact with ligands that bind to the CLR/RAMP1 heterodimer. © 2006 by the Biophysical Society.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

G-protein coupled receptors (GPCRs) constitute the largest class of membrane proteins and are a major drug target. A serious obstacle to studying GPCR structure/function characteristics is the requirement to extract the receptors from their native environment in the plasma membrane, coupled with the inherent instability of GPCRs in the detergents required for their solubilization. In the present study, we report the first solubilization and purification of a functional GPCR [human adenosine A2A receptor (A2AR)], in the total absence of detergent at any stage, by exploiting spontaneous encapsulation by styrene maleic acid (SMA) co-polymer direct from the membrane into a nanoscale SMA lipid particle (SMALP). Furthermore, the A2AR-SMALP, generated from yeast (Pichia pastoris) or mammalian cells, exhibited increased thermostability (∼5°C) compared with detergent [DDM (n-dodecyl-β-D-maltopyranoside)]-solubilized A2AR controls. The A2AR-SMALP was also stable when stored for prolonged periods at 4°C and was resistant to multiple freeze-thaw cycles, in marked contrast with the detergent-solubilized receptor. These properties establish the potential for using GPCR-SMALP in receptor-based drug discovery assays. Moreover, in contrast with nanodiscs stabilized by scaffold proteins, the non-proteinaceous nature of the SMA polymer allowed unobscured biophysical characterization of the embedded receptor. Consequently, CD spectroscopy was used to relate changes in secondary structure to loss of ligand binding ([3H]ZM241385) capability. SMALP-solubilization of GPCRs, retaining the annular lipid environment, will enable a wide range of therapeutic targets to be prepared in native-like state to aid drug discovery and understanding of GPCR molecular mechanisms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The protein folding problem has been one of the most challenging subjects in biological physics due to its complexity. Energy landscape theory based on statistical mechanics provides a thermodynamic interpretation of the protein folding process. We have been working to answer fundamental questions about protein-protein and protein-water interactions, which are very important for describing the energy landscape surface of proteins correctly. At first, we present a new method for computing protein-protein interaction potentials of solvated proteins directly from SAXS data. An ensemble of proteins was modeled by Metropolis Monte Carlo and Molecular Dynamics simulations, and the global X-ray scattering of the whole model ensemble was computed at each snapshot of the simulation. The interaction potential model was optimized and iterated by a Levenberg-Marquardt algorithm. Secondly, we report that terahertz spectroscopy directly probes hydration dynamics around proteins and determines the size of the dynamical hydration shell. We also present the sequence and pH-dependence of the hydration shell and the effect of the hydrophobicity. On the other hand, kinetic terahertz absorption (KITA) spectroscopy is introduced to study the refolding kinetics of ubiquitin and its mutants. KITA results are compared to small angle X-ray scattering, tryptophan fluorescence, and circular dichroism results. We propose that KITA monitors the rearrangement of hydrogen bonding during secondary structure formation. Finally, we present development of the automated single molecule operating system (ASMOS) for a high throughput single molecule detector, which levitates a single protein molecule in a 10 µm diameter droplet by the laser guidance. I also have performed supporting calculations and simulations with my own program codes.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Microsecond long Molecular Dynamics (MD) trajectories of biomolecular processes are now possible due to advances in computer technology. Soon, trajectories long enough to probe dynamics over many milliseconds will become available. Since these timescales match the physiological timescales over which many small proteins fold, all atom MD simulations of protein folding are now becoming popular. To distill features of such large folding trajectories, we must develop methods that can both compress trajectory data to enable visualization, and that can yield themselves to further analysis, such as the finding of collective coordinates and reduction of the dynamics. Conventionally, clustering has been the most popular MD trajectory analysis technique, followed by principal component analysis (PCA). Simple clustering used in MD trajectory analysis suffers from various serious drawbacks, namely, (i) it is not data driven, (ii) it is unstable to noise and change in cutoff parameters, and (iii) since it does not take into account interrelationships amongst data points, the separation of data into clusters can often be artificial. Usually, partitions generated by clustering techniques are validated visually, but such validation is not possible for MD trajectories of protein folding, as the underlying structural transitions are not well understood. Rigorous cluster validation techniques may be adapted, but it is more crucial to reduce the dimensions in which MD trajectories reside, while still preserving their salient features. PCA has often been used for dimension reduction and while it is computationally inexpensive, being a linear method, it does not achieve good data compression. In this thesis, I propose a different method, a nonmetric multidimensional scaling (nMDS) technique, which achieves superior data compression by virtue of being nonlinear, and also provides a clear insight into the structural processes underlying MD trajectories. I illustrate the capabilities of nMDS by analyzing three complete villin headpiece folding and six norleucine mutant (NLE) folding trajectories simulated by Freddolino and Schulten [1]. Using these trajectories, I make comparisons between nMDS, PCA and clustering to demonstrate the superiority of nMDS. The three villin headpiece trajectories showed great structural heterogeneity. Apart from a few trivial features like early formation of secondary structure, no commonalities between trajectories were found. There were no units of residues or atoms found moving in concert across the trajectories. A flipping transition, corresponding to the flipping of helix 1 relative to the plane formed by helices 2 and 3 was observed towards the end of the folding process in all trajectories, when nearly all native contacts had been formed. However, the transition occurred through a different series of steps in all trajectories, indicating that it may not be a common transition in villin folding. The trajectories showed competition between local structure formation/hydrophobic collapse and global structure formation in all trajectories. Our analysis on the NLE trajectories confirms the notion that a tight hydrophobic core inhibits correct 3-D rearrangement. Only one of the six NLE trajectories folded, and it showed no flipping transition. All the other trajectories get trapped in hydrophobically collapsed states. The NLE residues were found to be buried deeply into the core, compared to the corresponding lysines in the villin headpiece, thereby making the core tighter and harder to undo for 3-D rearrangement. Our results suggest that the NLE may not be a fast folder as experiments suggest. The tightness of the hydrophobic core may be a very important factor in the folding of larger proteins. It is likely that chaperones like GroEL act to undo the tight hydrophobic core of proteins, after most secondary structure elements have been formed, so that global rearrangement is easier. I conclude by presenting facts about chaperone-protein complexes and propose further directions for the study of protein folding.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Phosphorylation is amongst the most crucial and well-studied post-translational modifications. It is involved in multiple cellular processes which makes phosphorylation prediction vital for understanding protein functions. However, wet-lab techniques are labour and time intensive. Thus, computational tools are required for efficiency. This project aims to provide a novel way to predict phosphorylation sites from protein sequences by adding flexibility and Sezerman Grouping amino acid similarity measure to previous methods, as discovering new protein sequences happens at a greater rate than determining protein structures. The predictor – NOPAY - relies on Support Vector Machines (SVMs) for classification. The features include amino acid encoding, amino acid grouping, predicted secondary structure, predicted protein disorder, predicted protein flexibility, solvent accessibility, hydrophobicity and volume. As a result, we have managed to improve phosphorylation prediction accuracy for Homo sapiens by 3% and 6.1% for Mus musculus. Sensitivity at 99% specificity was also increased by 6% for Homo sapiens and for Mus musculus by 5% on independent test sets. In this study, we have managed to increase phosphorylation prediction accuracy for Homo sapiens and Mus musculus. When there is enough data, future versions of the software may also be able to predict other organisms.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The folding and targeting of membrane proteins poses a major challenge to the cell, as they must remain insertion competent while their highly hydrophobic transmembrane (TM) domains are transferred from the ribosome, through the aqueous cytosol and into the lipid bilayer. The biogenesis of a mature membrane protein takes place through the insertion and integration into the lipid bilayer. A number of TM proteins have been shown to gain some degree of secondary structure within the ribosome tunnel and to retain this conformation throughout maturation. Although studies into the folding and targeting of a number of membrane proteins have been carried out to date, there is little information on one of the largest class of eukaryotic membrane proteins; the G-protein-coupled receptors (GPCRs). This project studies the early folding events of the human ortholog of GPR35. To analyse the structure of the 1st TM domain, intermediates were generated and assessed by the biochemical method of pegylation (PEG-MAL). A structurally-similar microbial opsin (Bacterioopsin) was also used to investigate the differences in the early protein folding within eukaryotic and prokaryotic translation systems. Results showed that neither the 1st TM domain of GPR35 nor Bacterioopsin were capable of compacting in the ribosome tunnel before their N-terminus reached the ribosome exit point. The results for this assay remained consistent whether the proteins were translated in a eukaryotic or prokaryotic translation system. To examine the communication mechanism between the ribosome, the nascent chain and the protein targeting pathway, crosslinking experiments were carried out using the homobifunctional lysine cross-linker BS3. Specifically, the data generated here show that the nascent chain of GPR35 reaches the ribosomal protein uL23 in an extended conformation and interacts with the SRP protein as it exits the ribosome tunnel. This confirms the role of SRP in the co-translational targeting of GPR35. Using these methods insights into the early folding of GPCRs has been obtained. Further experiments using site-directed mutagenesis to reduce hydrophobicity in the 1st TM domain of GPR35, highlighted the mechanisms by which GPCRs are targeted to the endoplasmic reticulum. Confirming that hydrophobicity within the signal anchor sequence is essential of SRP-dependent targeting. Following the successful interaction of the nascent GPR35 and SRP, GPR35 is successfully targeted to ER membranes, shown here as dog pancreas microsomes (DPMs). Glycosylation of the GPR35 N-terminus was used to determine nascent chain structure as it is inserted into the ER membrane. These glycosylation experiments confirm that TM1 has obtained its compacted state whilst residing in the translocon. Finally, a site-specific cross-linking approach using the homobifunctional cysteine cross-linker, BMH, was used to study the lateral integration of GPR35 into the ER. Cross-linking of GPR35 TM1 and TM2 could be detected adjacent to a protein of ~45kDa, believed to be Sec61α. The loss of this adduct, as the nascent chain extends, showed the lateral movement of GPR35 TM1 from the translocon was dependent on the subsequent synthesis of TM2.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

In this paper was demonstrated that umbelliferone induces changes in structure and pharmacological activities of Bn IV, a lysine 49 secretory phospholipase A(2) (sPLA2) from Both tops neuwiedi. Incubation of Bn IV with umbelliferone virtually abolished platelet aggregation, edema, and myotoxicity induced by native Bn IV. The amino acid sequence of Bn IV showed high sequence similarities with other Lys49 sPLA2s from B. jararacussu (BthTx-I), B. pirajai (PrTx-I), and B. neuwiedi pauloensis (Bn SP6 and Bn SP7). This sPLA2 also has a highly conserved C-terminal amino acid sequence, which has been shown as important for the pharmacological activities of Lys49 sPLA2. Sequencing of Bn IV previously treated with umbelliferone revealed modification of S(1) and S(20). Fluorescent spectral analysis and circular dichroism (CD) studies showed that umbelliferone modified the secondary structure of this protein. Moreover, the pharmacological activity of Bn IV is driven by synergism of the C-terminal region with the a-helix motifs, which are involved in substrate binding of the Asp49 and Lys49 residues of 5PLA2 and have a direct effect on the Ca2+-independent membrane damage of some secretory snake venom PLA2. For Bn IV, these interactions are potentially important for triggering the pharmacological activity of this 5PLA2. (C) 2011 Elsevier Ltd. All rights reserved.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Dissertação de mestrado, Biotecnologia, Faculdade de Ciências e Tecnologia, Universidade do Algarve, 2014