869 resultados para universal in silico predictor of protein protein interaction
Resumo:
Allergy is an overreaction by the immune system to a previously encountered, ordinarily harmless substance - typically proteins - resulting in skin rash, swelling of mucous membranes, sneezing or wheezing, or other abnormal conditions. The use of modified proteins is increasingly widespread: their presence in food, commercial products, such as washing powder, and medical therapeutics and diagnostics, makes predicting and identifying potential allergens a crucial societal issue. The prediction of allergens has been explored widely using bioinformatics, with many tools being developed in the last decade; many of these are freely available online. Here, we report a set of novel models for allergen prediction utilizing amino acid E-descriptors, auto- and cross-covariance transformation, and several machine learning methods for classification, including logistic regression (LR), decision tree (DT), naïve Bayes (NB), random forest (RF), multilayer perceptron (MLP) and k nearest neighbours (kNN). The best performing method was kNN with 85.3% accuracy at 5-fold cross-validation. The resulting model has been implemented in a revised version of the AllerTOP server (http://www.ddg-pharmfac.net/AllerTOP). © Springer-Verlag 2014.
Resumo:
It has been generally acknowledged that the module structure of protein interaction networks plays a crucial role with respect to the functional understanding of these networks. In this paper, we study evolutionary aspects of the module structure of protein interaction networks, which forms a mesoscopic level of description with respect to the architectural principles of networks. The purpose of this paper is to investigate limitations of well known gene duplication models by showing that these models are lacking crucial structural features present in protein interaction networks on a mesoscopic scale. This observation reveals our incomplete understanding of the structural evolution of protein networks on the module level. © 2012 Emmert-Streib.
Resumo:
Protein–ligand binding site prediction methods aim to predict, from amino acid sequence, protein–ligand interactions, putative ligands, and ligand binding site residues using either sequence information, structural information, or a combination of both. In silico characterization of protein–ligand interactions has become extremely important to help determine a protein’s functionality, as in vivo-based functional elucidation is unable to keep pace with the current growth of sequence databases. Additionally, in vitro biochemical functional elucidation is time-consuming, costly, and may not be feasible for large-scale analysis, such as drug discovery. Thus, in silico prediction of protein–ligand interactions must be utilized to aid in functional elucidation. Here, we briefly discuss protein function prediction, prediction of protein–ligand interactions, the Critical Assessment of Techniques for Protein Structure Prediction (CASP) and the Continuous Automated EvaluatiOn (CAMEO) competitions, along with their role in shaping the field. We also discuss, in detail, our cutting-edge web-server method, FunFOLD for the structurally informed prediction of protein–ligand interactions. Furthermore, we provide a step-by-step guide on using the FunFOLD web server and FunFOLD3 downloadable application, along with some real world examples, where the FunFOLD methods have been used to aid functional elucidation.
Resumo:
Introduction: Ovarian adenocarcinoma is frequently detected at the late stage, when therapy efficacy is limited and death occurs in up to 50% of the cases. A potential novel treatment for this disease is a monoclonal antibody that recognizes phosphate transporter sodium-dependent phosphate transporter protein 2b (NaPi2b). Materials and Methods: To better understand the expression of this protein in different histologic types of ovarian carcinomas, we immunostained 50 tumor samples with anti-NaPi2b monoclonal antibody MX35 and, in parallel, we assessed the expression of the gene encoding NaPi2b (SCL34A2) by in silico analysis of microarray data. Results: Both approaches detected higher expression of NaPi2b (SCL34A2) in ovarian carcinoma than in normal tissue. Moreover, a comprehensive analysis indicates that SCL34A2 is the only gene of the several phosphate transporters genes whose expression differentiates normal from carcinoma samples, suggesting it might exert a major role in ovarian carcinomas. Immunohistochemical and mRNA expression data have also shown that 2 histologic subtypes of ovarian carcinoma express particularly high levels of NaPi2b: serous and clear cell adenocarcinomas. Serous adenocarcinomas are the most frequent, contrasting with clear cell carcinomas, rare, and with worse prognosis. Conclusion: This identification of subgroups of patients expressing NaPi2b may be important in selecting cohorts who most likely should be included in future clinical trials, as a recently generated humanized version of MX35 has been developed.
Resumo:
Proteomics approaches have made important contributions to the characterisation of platelet regulatory mechanisms. A common problem encountered with this method, however, is the masking of low-abundance (e.g. signalling) proteins in complex mixtures by highly abundant proteins. In this study, subcellular fractionation of washed human platelets either inactivated or stimulated with the glycoprotein (GP) VI collagen receptor agonist, collagen-related peptide, reduced the complexity of the platelet proteome. The majority of proteins identified by tandem mass spectrometry are involved in signalling. The effect of GPVI stimulation on levels of specific proteins in subcellular compartments was compared and analysed using in silico quantification, and protein associations were predicted using STRING (the search tool for recurring instances of neighbouring genes/proteins). Interestingly, we observed that some proteins that were previously unidentified in platelets including teneurin-1 and Van Gogh-like protein 1, translocated to the membrane upon GPVI stimulation. Newly identified proteins may be involved in GPVI signalling nodes of importance for haemostasis and thrombosis.
Resumo:
The identification of targets whose interaction is likely to result in the successful treatment of a disease is of growing interest for natural product scientists. In the current study we performed an exemplary application of a virtual parallel screening approach to identify potential targets for 16 secondary metabolites isolated and identified from the aerial parts of the medicinal plant RUTA GRAVEOLENS L. Low energy conformers of the isolated constituents were simultaneously screened against a set of 2208 pharmacophore models generated in-house for the IN SILICO prediction of putative biological targets, i. e., target fishing. Based on the predicted ligand-target interactions, we focused on three biological targets, namely acetylcholinesterase (AChE), the human rhinovirus (HRV) coat protein and the cannabinoid receptor type-2 (CB (2)). For a critical evaluation of the applied parallel screening approach, virtual hits and non-hits were assayed on the respective targets. For AChE the highest scoring virtual hit, arborinine, showed the best inhibitory IN VITRO activity on AChE (IC (50) 34.7 muM). Determination of the anti-HRV-2 effect revealed 6,7,8-trimethoxycoumarin and arborinine to be the most active antiviral constituents with IC (50) values of 11.98 muM and 3.19 muM, respectively. Of these, arborinine was predicted virtually. Of all the molecules subjected to parallel screening, one virtual CB (2) ligand was obtained, i. e., rutamarin. Interestingly, in experimental studies only this compound showed a selective activity to the CB (2) receptor ( Ki of 7.4 muM) by using a radioligand displacement assay. The applied parallel screening paradigm with constituents of R. GRAVEOLENS on three different proteins has shown promise as an IN SILICO tool for rational target fishing and pharmacological profiling of extracts and single chemical entities in natural product research.
Resumo:
Identification of protein interaction interfaces is very important for understanding the molecular mechanisms underlying biological phenomena. Here, we present a novel method for predicting protein interaction interfaces from sequences by using PAM matrix (PIFPAM). Sequence alignments for interacting proteins were constructed and parsed into segments using sliding windows. By calculating distance matrix for each segment, the correlation coefficients between segments were estimated. The interaction interfaces were predicted by extracting highly correlated segment pairs from the correlation map. The predictions achieved an accuracy 0.41-0.71 for eight intraprotein interaction examples, and 0.07-0.60 for four interprotein interaction examples. Compared with three previously published methods, PIFPAM predicted more contacting site pairs for 11 out of the 12 example proteins, and predicted at least 34% more contacting site pairs for eight proteins of them. The factors affecting the predictions were also analyzed. Since PIFPAM uses only the alignments of the two interacting proteins as input, it is especially useful when no three-dimensional protein structure data are available.
Resumo:
The enzyme UDP-galactose 4'-epimerase (GALE) catalyses the reversible epimerisation of both UDP-galactose and UDP-N-acetyl-galactosamine. Deficiency of the human enzyme (hGALE) is associated with type III galactosemia. The majority of known mutations in hGALE are missense and private thus making clinical guidance difficult. In this study a bioinformatics approach was employed to analyse the structural effects due to each mutation using both the UDP-glucose and UDP-N-acetylglucosamine bound structures of the wild-type protein. Changes to the enzyme's overall stability, substrate/cofactor binding and propensity to aggregate were also predicted. These predictions were found to be in good agreement with previous in vitro and in vivo studies when data was available and allowed for the differentiation of those mutants that severely impair the enzyme's activity against UDP-galactose. Next this combination of techniques were applied to another twenty-six reported variants from the NCBI dbSNP database that have yet to be studied to predict their effects. This identified p.I14T, p.R184H and p.G302R as likely severely impairing mutations. Although severely impaired mutants were predicted to decrease the protein's stability, overall predicted stability changes only weakly correlated with residual activity against UDP-galactose. This suggests other protein functions such as changes in cofactor and substrate binding may also contribute to the mechanism of impairment. Finally this investigation shows that this combination of different in silico approaches is useful in predicting the effects of mutations and that it could be the basis of an initial prediction of likely clinical severity when new hGALE mutants are discovered.
Resumo:
Fasciolosis is an important foodborne, zoonotic disease of livestock and humans, with global annual health and economic losses estimated at several billion US$. Fasciola hepatica is the major species in temperate regions, while F. gigantica dominates in the tropics. In the absence of commercially available vaccines to control fasciolosis, increasing reports of resistance to current chemotherapeutic strategies and the spread of fasciolosis into new areas, new functional genomics approaches are being used to identify potential new drug targets and vaccine candidates. The glutathione transferase (GST) superfamily is both a candidate drug and vaccine target. This study reports the identification of a putatively novel Sigma class GST, present in a water-soluble cytosol extract from the tropical liver fluke F. gigantica. The GST was cloned and expressed as an enzymically active recombinant protein. This GST shares a greater identity with the human schistosomiasis GST vaccine currently at Phase II clinical trials than previously discovered F. gigantica GSTs, stimulating interest in its immuno-protective properties. In addition, in silico analysis of the GST superfamily of both F. gigantica and F. hepatica has revealed an additional Mu class GST, Omega class GSTs, and for the first time, a Zeta class member.
Resumo:
Cardiac myocyte hypertrophy involves changes in cell structure and alterations in protein expression regulated at both the transcriptional and translational levels. Hypertrophic G protein-coupled receptor (GPCR) agonists such as endothelin-(ET-1) and phenylephrine stimulate a number of protein kinase cascades in the heart. Mitogen-activated protein kinase (MAPK) cascades stimulated include the extracellularly regulated kinase cascade, the stress-activated protein kinase/c-Jun N-terminal kinase cascade, and the p38 MAPK cascade. All 3 pathways have been implicated in hypertrophy, but recent ex vivo evidence also suggests that there may be additional effects on cell survival. ET-1 and phenylephrine also stimulate the protein kinase B pathway, and this may be involved in the regulation of protein synthesis by these agonists. Thus, protein kinase-mediated signaling may be important in the regulation of the development of myocyte hypertrophy.
Resumo:
Stimulation of phosphatidylinositol 3'-kinase (PI3K) and protein kinase B (PKB) is implicated in the regulation of protein synthesis in various cells. One mechanism involves PI3K/PKB-dependent phosphorylation of 4E-BP1, which dissociates from eIF4E, allowing initiation of translation from the 7-methylGTP cap of mRNAs. We examined the effects of insulin and H(2)O(2) on this pathway in neonatal cardiac myocytes. Cardiac myocyte protein synthesis was increased by insulin, but was inhibited by H(2)O(2). PI3K inhibitors attenuated basal levels of protein synthesis and inhibited the insulin-induced increase in protein synthesis. Insulin or H(2)O(2) increased the phosphorylation (activation) of PKB through PI3K, but, whereas insulin induced a sustained response, the response to H(2)O(2) was transient. 4E-BP1 was phosphorylated in unstimulated cells, and 4E-BP1 phosphorylation was increased by insulin. H(2)O(2) stimulated dephosphorylation of 4E-BP1 by increasing protein phosphatase (PP1/PP2A) activity. This increased the association of 4E-BP1 with eIF4E, consistent with H(2)O(2) inhibition of protein synthesis. The effects of H(2)O(2) were sufficient to override the stimulation of protein synthesis and 4E-BP1 phosphorylation induced by insulin. These results indicate that PI3K and PKB are important regulators of protein synthesis in cardiac myocytes, but other factors, including phosphatase activity, modulate the overall response.
Resumo:
Aquafeed production faces global issues related to availability of feed ingredients. Feed manufacturers require greater flexibility in order to develop nutritional and cost-effective formulations that take into account nutrient content and availability of ingredients. The search for appropriate ingredients requires detailed screening of their potential nutritional value and variability at the industrial level. In vitro digestion of feedstuffs by enzymes extracted from the target species has been correlated with apparent protein digestibility (APD) in fish and shrimp species. The present study verified the relationship between APD and in vitro degree of protein hydrolysis (DH) with Litopenaeus vannamei hepatopancreas enzymes in several different ingredients (n = 26): blood meals, casein, corn gluten meal, crab meal, distiller`s dried grains with solubles, feather meal, fish meals, gelatin, krill meals, poultry by-product meal, soybean meals, squid meals and wheat gluten. The relationship between APD and DH was further verified in diets formulated with these ingredients at 30% inclusion into a reference diet. APD was determined in vivo (30.1 +/- 0.5 degrees C, 32.2 +/- 0.4%.) with juvenile L vannamei (9 to 12 g) after placement of test ingredients into a reference diet (35 g kg(-1) CP: 8.03 g kg(-1) lipid; 2.01 kcal g(-1)) with chromic oxide as the inert marker. In vitro DH was assessed in ingredients and diets with standardized hepatopancreas enzymes extracted from pond-reared shrimp. The DH of ingredients was determined under different assay conditions to check for the most suitable in vitro protocol for APD prediction: different batches of enzyme extracts (HPf5 or HPf6), temperatures (25 or 30 degrees C) and enzyme activity (azocasein): crude protein ratios (4 U: 80 mg CP or 4 U: 40 mg CP). DH was not affected by ingredient proximate composition. APD was significantly correlated to DH in regressions considering either ingredients or diets. The relationships between APD and DH of the ingredients could be suitably adjusted to a Rational Function (y = (a + bx)/(1 + cx + dx2), n = 26. Best in vitro APD predictions were obtained at 25 degrees C, 4 U: 80 mg CP both for ingredients (R(2) = 0.86: P = 0.001) and test diets (R(2) = 0.96; P = 0.007). The regression model including all 26 ingredients generated higher prediction residuals (i.e., predicted APD - determined APD) for corn gluten meal, feather meal. poultry by-product meal and krill flour. The remaining test ingredients presented mean prediction residuals of 3.5 points. A model including only ingredients with APD>80% showed higher prediction precision (R(2) = 0.98: P = 0.000004; n = 20) with average residual of 1.8 points. Predictive models including only ingredients from the same origin (e.g., marine-based, R(2) = 0.98; P = 0.033) also displayed low residuals. Since in vitro techniques have been usually validated through regressions against in vivo APD, the DH predictive capacity may depend on the consistency of the in vivo methodology. Regressions between APD and DH suggested a close relationship between peptide bond breakage by hepatopancreas digestive proteases and the apparent nitrogen assimilation in shrimp, and this may be a useful tool to provide rapid nutritional information. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
The vast majority of known proteins have not yet been experimentally characterized and little is known about their function. The design and implementation of computational tools can provide insight into the function of proteins based on their sequence, their structure, their evolutionary history and their association with other proteins. Knowledge of the three-dimensional (3D) structure of a protein can lead to a deep understanding of its mode of action and interaction, but currently the structures of <1% of sequences have been experimentally solved. For this reason, it became urgent to develop new methods that are able to computationally extract relevant information from protein sequence and structure. The starting point of my work has been the study of the properties of contacts between protein residues, since they constrain protein folding and characterize different protein structures. Prediction of residue contacts in proteins is an interesting problem whose solution may be useful in protein folding recognition and de novo design. The prediction of these contacts requires the study of the protein inter-residue distances related to the specific type of amino acid pair that are encoded in the so-called contact map. An interesting new way of analyzing those structures came out when network studies were introduced, with pivotal papers demonstrating that protein contact networks also exhibit small-world behavior. In order to highlight constraints for the prediction of protein contact maps and for applications in the field of protein structure prediction and/or reconstruction from experimentally determined contact maps, I studied to which extent the characteristic path length and clustering coefficient of the protein contacts network are values that reveal characteristic features of protein contact maps. Provided that residue contacts are known for a protein sequence, the major features of its 3D structure could be deduced by combining this knowledge with correctly predicted motifs of secondary structure. In the second part of my work I focused on a particular protein structural motif, the coiled-coil, known to mediate a variety of fundamental biological interactions. Coiled-coils are found in a variety of structural forms and in a wide range of proteins including, for example, small units such as leucine zippers that drive the dimerization of many transcription factors or more complex structures such as the family of viral proteins responsible for virus-host membrane fusion. The coiled-coil structural motif is estimated to account for 5-10% of the protein sequences in the various genomes. Given their biological importance, in my work I introduced a Hidden Markov Model (HMM) that exploits the evolutionary information derived from multiple sequence alignments, to predict coiled-coil regions and to discriminate coiled-coil sequences. The results indicate that the new HMM outperforms all the existing programs and can be adopted for the coiled-coil prediction and for large-scale genome annotation. Genome annotation is a key issue in modern computational biology, being the starting point towards the understanding of the complex processes involved in biological networks. The rapid growth in the number of protein sequences and structures available poses new fundamental problems that still deserve an interpretation. Nevertheless, these data are at the basis of the design of new strategies for tackling problems such as the prediction of protein structure and function. Experimental determination of the functions of all these proteins would be a hugely time-consuming and costly task and, in most instances, has not been carried out. As an example, currently, approximately only 20% of annotated proteins in the Homo sapiens genome have been experimentally characterized. A commonly adopted procedure for annotating protein sequences relies on the "inheritance through homology" based on the notion that similar sequences share similar functions and structures. This procedure consists in the assignment of sequences to a specific group of functionally related sequences which had been grouped through clustering techniques. The clustering procedure is based on suitable similarity rules, since predicting protein structure and function from sequence largely depends on the value of sequence identity. However, additional levels of complexity are due to multi-domain proteins, to proteins that share common domains but that do not necessarily share the same function, to the finding that different combinations of shared domains can lead to different biological roles. In the last part of this study I developed and validate a system that contributes to sequence annotation by taking advantage of a validated transfer through inheritance procedure of the molecular functions and of the structural templates. After a cross-genome comparison with the BLAST program, clusters were built on the basis of two stringent constraints on sequence identity and coverage of the alignment. The adopted measure explicity answers to the problem of multi-domain proteins annotation and allows a fine grain division of the whole set of proteomes used, that ensures cluster homogeneity in terms of sequence length. A high level of coverage of structure templates on the length of protein sequences within clusters ensures that multi-domain proteins when present can be templates for sequences of similar length. This annotation procedure includes the possibility of reliably transferring statistically validated functions and structures to sequences considering information available in the present data bases of molecular functions and structures.
Resumo:
It is known that the phospholipids of the brain cells of fish are altered during cold adaptation. In particular, the 1-monounsaturated 2-polyunsaturated phosphatidylethanolamines (PEs) increase 2- to 3-fold upon adaptation to cold. One of the most striking changes is in the 18:1/22:6 species of PE. We determined how this lipid affected the bilayer-to-hexagonal-phase transition temperature of 16:1/16:1 PE. We found that it was more effective in lowering this transition temperature than were other, less unsaturated, PE species. In addition, it was not simply the presence of the 18:1/22:6 acyl chains which caused this effect, since the 18:1/22:6 species of phosphatidylcholine had the opposite effect on this transition temperature. Zwitterionic substances that lower the bilayer-to-hexagonal-phase transition temperature often cause an increase in the activity of protein kinase C (PKC). Indeed, the 18:1/22:6 PE caused an increase in the rate of histone phosphorylation by PKC which was greater than that caused by other, less unsaturated, PEs. The 18:1/22:6 phosphatidylcholine had no effect on this enzyme. The stimulation of the activity of PKC by the 18:1/22:6 PE is a consequence of this lipid's increasing the partitioning of PKC to the membrane.
Resumo:
Protein molecular motors are natural nano-machines that convert the chemical energy from the hydrolysis of adenosine triphosphate into mechanical work. These efficient machines are central to many biological processes, including cellular motion, muscle contraction and cell division. The remarkable energetic efficiency of the protein molecular motors coupled with their nano-scale has prompted an increasing number of studies focusing on their integration in hybrid micro- and nanodevices, in particular using linear molecular motors. The translation of these tentative devices into technologically and economically feasible ones requires an engineering, design-orientated approach based on a structured formalism, preferably mathematical. This contribution reviews the present state of the art in the modelling of protein linear molecular motors, as relevant to the future design-orientated development of hybrid dynamic nanodevices. © 2009 The Royal Society of Chemistry.