842 resultados para in-silico
Resumo:
CXCL-8 (Interleukin 8) is a CXC chemokine with a central role in the human immune response. We have undertaken extensive in silico analyses to elucidate the interactions of CXCL-8 with its various binding partners, which are crucial for its biological function. Sequence and structure analyses showed that residues in the thirdq β-sheet and basic residues in the heparin binding site are highly variable, while residues in the second β-sheet are highly conserved. Molecular dynamics simulations in aqueous solution of dimeric CXCL-8 have been performed with starting geometries from both X-ray and NMR structures showed shearing movements between the two antiparallel C-terminal helices. Dynamic conservation analyses of these simulations agreed with experimental data indicating that structural differences between the two structures at quaternary level arise from changes in the secondary structure of the N-terminal loop, the 310-helix, the 30s, 40s, and 50s loops and the third β-sheet, resulting in a different interhelical separation. Nevertheless, the observation of these different states indicates that CXCL-8 has the potential to undergo conformational changes, and it seems likely that this feature is relevant to the mode of binding of glycosaminoglycan (GAG) mimetics such as cyclitols. Simulations of the receptor peptide fragment−CXCL-8 complex identified several specific interactions of the receptor peptide with CXCL-8 that could be exploited in the structure-based design of competitive peptides and nonpeptidic molecules targeting CXCL-8 for combating inflammatory diseases. Simulations of the CXCL-8 dimer complexed with a 24-mer heparin fragment and of the CXCL-8−receptor peptide complex revealed that Arg60, Lys64, and Arg68 in the dimer bind to cyclitols in a horseshoe pattern, defining a region which is spatially distinct from the receptor binding site. There appears to be an optimum number of sulfates and an optimum length of alkyl spacers required for the interaction of cyclitol inhibitors with the dimeric form of CXCL-8. Calculation of the binding affinities of cyclitol inhibitors reflected satisfactorily the ranking of experimentally determined inhibitory potencies. The findings of these molecular modeling studies will help in the search for inhibitors which can modulate various CXCL-8 biological activities and serve as an excellent model system to study CXC-inhibitor interactions.
Resumo:
The incidence of human infections by the fungal pathogen Candida species has been increasing in recent years. Enolase is an essential protein in fungal metabolism. Sequence data is available for human and a number of medically important fungal species. An understanding of the structural and functional features of fungal enolases may provide the structural basis for their use as a target for the development of new anti-fungal drugs. We have obtained the sequence of the enolase of Candida krusei (C. krusei), as it is a significant medically important fungal pathogen. We have then used multiple sequence alignments with various enolase isoforms in order to identify C. krusei specific amino acid residues. The phylogenetic tree of enolases shows that the C. krusei enolase assembles on the tree with the fungal genes. Importantly, C. krusei lacks four amino acids in the active site compared to human enolase, as revealed by multiple sequence alignments. These differences in the substrate binding site may be exploited for the design of new anti-fungal drugs to selectively block this enzyme. The lack of the important amino acids in the active site also indicates that C. krusei enolase might have evolved as a member of a mechanistically diverse enolase superfamily catalying somewhat different reactions.
Resumo:
Genetic engineering of Bacillus thuringiensis (Bt) Cry proteins has resulted in the synthesis of various novel toxin proteins with enhanced insecticidal activity and specificity towards different insect pests. In this study, a fusion protein consisting of the DI–DII domains of Cry1Ac and garlic lectin (ASAL) has been designed in silico by replacing the DIII domain of Cry1Ac with ASAL. The binding interface between the DI–DII domains of Cry1Ac and lectin has been identified using protein–protein docking studies. Free energy of binding calculations and interaction profiles between the Cry1Ac and lectin domains confirmed the stability of fusion protein. A total of 18 hydrogen bonds was observed in the DI–DII–lectin fusion protein compared to 11 hydrogen bonds in the Cry1Ac (DI–DII–DIII) protein. Molecular mechanics/Poisson–Boltzmann (generalized-Born) surface area [MM/PB (GB) SA] methods were used for predicting free energy of interactions of the fusion proteins. Protein–protein docking studies based on the number of hydrogen bonds, hydrophobic interactions, aromatic–aromatic, aromatic–sulphur, cation–pi interactions and binding energy of Cry1Ac/fusion proteins with the aminopeptidase N (APN) of Manduca sexta rationalised the higher binding affinity of the fusion protein with the APN receptor compared to that of the Cry1Ac–APN complex, as predicted by ZDOCK, Rosetta and ClusPro analysis. The molecular binding interface between the fusion protein and the APN receptor is well packed, analogously to that of the Cry1Ac–APN complex. These findings offer scope for the design and development of customized fusion molecules for improved pest management in crop plants.
Resumo:
Antitubercular treatment is directed against actively replicating organisms. There is an urgent need to develop drugs targeting persistent subpopulations of Mycobacterium tuberculosis. The DevR response regulator is believed to play a key role in bacterial dormancy adaptation during hypoxia. We developed a homology-based model of DevR and used it for the rational design of inhibitors. A phenylcoumarin derivative (compound 10) identified by in silico pharmacophore-based screening of 2.5 million compounds employing protocols with some novel features including a water-based pharmacophore query, was characterized further. Compound 10 inhibited DevR binding to target DNA, down-regulated dormancy genes transcription, and drastically reduced survival of hypoxic but not nutrient-starved dormant bacteria or actively growing organ ` isms. Our findings suggest that compound 10 ``locks'' DevR in an inactive conformation that is unable to bind cognate DNA and induce the dormancy regulon. These results provide proof-of-concept for DevR as a novel target to develop molecules with sterilizing activity against tubercle bacilli.
Resumo:
The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool `PromPredict'. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.
Resumo:
Aiming to identify novel genetic variants and to confirm previously identified genetic variants associated with bone mineral density (BMD), we conducted a three-stage genome-wide association (GWA) meta-analysis in 27 061 study subjects. Stage 1 meta-analyzed seven GWA samples and 11 140 subjects for BMDs at the lumbar spine, hip and femoral neck, followed by a Stage 2 in silico replication of 33 SNPs in 9258 subjects, and by a Stage 3 de novo validation of three SNPs in 6663 subjects. Combining evidence from all the stages, we have identified two novel loci that have not been reported previously at the genome-wide significance (GWS; 5.0 × 10-8) level: 14q24.2 (rs227425, P-value 3.98 × 10-13, SMOC1) in the combined sample of males and females and 21q22.13 (rs170183, P-value 4.15 × 10-9, CLDN14) in the female-specific sample. The two newly identified SNPs were also significant in the GEnetic Factors for OSteoporosis consortium (GEFOS, n 5 32 960) summary results. We have also independently confirmed 13 previously reported loci at the GWS level: 1p36.12 (ZBTB40), 1p31.3 (GPR177), 4p16.3 (FGFRL1), 4q22.1 (MEPE), 5q14.3 (MEF2C), 6q25.1 (C6orf97, ESR1), 7q21.3 (FLJ42280, SHFM1), 7q31.31 (FAM3C, WNT16), 8q24.12 (TNFRSF11B), 11p15.3 (SOX6), 11q13.4 (LRP5), 13q14.11 (AKAP11) and 16q24 (FOXL1). Gene expression analysis in osteogenic cells implied potential functional association of the two candidate genes (SMOC1 and CLDN14) in bone metabolism. Our findings independently confirm previously identified biological pathways underlying bone metabolism and contribute to the discovery of novel pathways, thus providing valuable insights into the intervention and treatment of osteoporosis. © The Author 2013. Published by Oxford University Press.
Resumo:
Lysophosphatidic acid (LPA) acts as a signaling molecule that regulates diverse cellular processes and it can rapidly be metabolized by phosphatase and acyltransferase LPA phosphatase gene has not been identified and characterized in plants so far The BLAST search revealed that the At3g03520 is similar to phospholipase family. and distantly related to bacterial phosphatases The conserved motif. (J)4XXXNXSFD, was identified in both At3g03520 like phospholipases and acid phosphatases In silico expression analysis of At3g03520 revealed a high expression during phosphate starvation and abiotic stresses. This gene was overexpressed in Escherichia coli and shown to posses LPA specific phosphatase activity These results Suggest that this gene possibly plays a role in signal transduction and storage lipid synthesis.
Resumo:
Computational modelling of mechanisms underlying processes in the real world can be of great value in understanding complex biological behaviours. Uptake in general biology and ecology has been rapid. However, it often requires specific data sets that are overly costly in time and resources to collect. The aim of the current study was to test whether a generic behavioural ecology model constructed using published data could give realistic outputs for individual species. An individual-based model was developed using the Pattern-Oriented Modelling (POM) strategy and protocol, based on behavioural rules associated with insect movement choices. Frugivorous Tephritidae (fruit flies) were chosen because of economic significance in global agriculture and the multiple published data sets available for a range of species. The Queensland fruit fly (Qfly), Bactrocera tryoni, was identified as a suitable individual species for testing. Plant canopies with modified architecture were used to run predictive simulations. A field study was then conducted to validate our model predictions on how plant architecture affects fruit flies’ behaviours. Characteristics of plant architecture such as different shapes, e.g., closed-canopy and vase-shaped, affected fly movement patterns and time spent on host fruit. The number of visits to host fruit also differed between the edge and centre in closed-canopy plants. Compared to plant architecture, host fruit has less contribution to effects on flies’ movement patterns. The results from this model, combined with our field study and published empirical data suggest that placing fly traps in the upper canopy at the edge should work best. Such a modelling approach allows rapid testing of ideas about organismal interactions with environmental substrates in silico rather than in vivo, to generate new perspectives. Using published data provides a saving in time and resources. Adjustments for specific questions can be achieved by refinement of parameters based on targeted experiments.
Resumo:
Hantaviruses, members of the genus Hantavirus in the Bunyaviridae family, are enveloped single-stranded RNA viruses with tri-segmented genome of negative polarity. In humans, hantaviruses cause two diseases, hemorrhagic fever with renal syndrome (HFRS) and hantavirus pulmonary syndrome (HPS), which vary in severity depending on the causative agent. Each hantavirus is carried by a specific rodent host and is transmitted to humans through excreta of infected rodents. The genome of hantaviruses encodes four structural proteins: the nucleocapsid protein (N), the glycoproteins (Gn and Gc), and the polymerase (L) and also the nonstructural protein (NSs). This thesis deals with the functional characterization of hantavirus N protein with regard to its structure. Structural studies of the N protein have progressed slowly and the crystal structure of the whole protein is still not available, therefore biochemical assays coupled with bioinformatical modeling proved essential for studying N protein structure and functions. Presumably, during RNA encapsidation, the N protein first forms intermediate trimers and then oligomers. First, we investigated the role of N-terminal domain in the N protein oligomerization. The results suggested that the N-terminal region of the N protein forms a coiled-coil, in which two antiparallel alpha helices interact via their hydrophobic seams. Hydrophobic residues L4, I11, L18, L25 and V32 in the first helix and L44, V51, L58 and L65 in the second helix were crucial for stabilizing the structure. The results were consistent with the head-to-head, tail-to-tail model for hantavirus N protein trimerization. We demonstrated that an intact coiled-coil structure of the N terminus is crucial for the oligomerization capacity of the N protein. We also added new details to the head-to-head, tail-to-tail model of trimerization by suggesting that the initial step is based on interaction(s) between intact intra-molecular coiled-coils of the monomers. We further analyzed the importance of charged aa residues located within the coiled-coil for the N protein oligomerization. To predict the interacting surfaces of the monomers we used an upgraded in silico model of the coiled-coil domain that was docked into a trimer. Next the predicted target residues were mutated. The results obtained using the mammalian two-hybrid assay suggested that conserved charged aa residues within the coiled-coil make a substantial contribution to the N protein oligomerization. This contribution probably involves the formation of interacting surfaces of the N monomers and also stabilization of the coiled-coil via intramolecular ionic bridging. We proposed that the tips of the coiled-coils are the first to come into direct contact and thus initiate tight packing of the three monomers into a compact structure. This was in agreement with the previous results showing that an increase in ionic strength abolished the interaction between N protein molecules. We also showed that residues having the strongest effect on the N protein oligomerization are not scattered randomly throughout the coiled-coil 3D model structure, but form clusters. Next we found evidence for the hantaviral N protein interaction with the cytoplasmic tail of the glycoprotein Gn. In order to study this interaction we used the GST pull-down assay in combination with mutagenesis technique. The results demonstrated that intact, properly folded zinc fingers of the Gn protein cytoplasmic tail as well as the middle domain of the N protein (that includes aa residues 80 248 and supposedly carries the RNA-binding domain) are essential for the interaction. Since hantaviruses do not have a matrix protein that mediates the packaging of the viral RNA in other negatve stranded viruses (NSRV), hantaviral RNPs should be involved in a direct interaction with the intraviral domains of the envelope-embedded glycoproteins. By showing the N-Gn interaction we provided the evidence for one of the crucial steps in the virus replication at which RNPs are directed to the site of the virus assembly. Finally we started analysis of the N protein RNA-binding region, which is supposedly located in the middle domain of the N protein molecule. We developed a model for the initial step of RNA-binding by the hantaviral N protein. We hypothesized that the hantaviral N protein possesses two secondary structure elements that initiate the RNA encapsidation. The results suggest that amino acid residues (172-176) presumably act as a hook to catch vRNA and that the positively charged interaction surface (aa residues 144-160) enhances the initial N-RNA interacation. In conclusion, we elucidated new functions of hantavirus N protein. Using in silico modeling we predicted the domain structure of the protein and using experimental techniques showed that each domain is responsible for executing certain function(s). We showed that intact N terminal coiled-coil domain is crucial for oligomerization and charged residues located on its surface form a interaction surface for the N monomers. The middle domain is essential for interaction with the cytoplasmic tail of the Gn protein and RNA binding.
Resumo:
Veri-aivoeste suojelee aivoja verenkierron vierasaineilta. Veri-aivoestettä tutkivia in vivo ja in vitro -menetelmiä on raportoitu laajasti kirjallisuudessa. Yhdisteiden farmakokinetiikka aivoissa kuvaavia tietokonemalleja on esitetty vain muutamia. Tässä tutkimuksessa kerättiin kirjallisuudesta aineisto eri in vitro ja in vivo -menetelmillä määritetyistä veri-aivoesteen permeabiliteettikertoimista. Lisäksi tutkimuksessa rakennettiin kaksi veri-aivoesteen farmakokineettista tietokonemallia, mikrodialyysimalli ja efluksimalli. Mikrodialyysimalli on yksinkertainen kahdesta tilasta (verenkierto ja aivot) koostuva farmakokineettinen malli. Mikrodialyysimallilla simuloitiin in vivo määritettyjen parametrien perusteella viiden yhdisteen pitoisuuksia rotan aivoissa ja verenkierrossa. Mallilla ei saatu täsmällisesti in vivo -tilannetta vastaavia pitoisuuskuvaajia johtuen mallin rakenteessa tehdyistä yksinkertaistuksista, kuten aivokudostilan ja kuljetinproteiinien kinetiikan puuttuminen. Efluksimallissa on kolme tilaa, verenkierto, veri-aivoesteen endoteelisolutila ja aivot. Efluksimallilla tutkittiin teoreettisten simulaatioiden avulla veri-aivoesteen luminaalisella membraanilla sijaitsevan aktiivisen efluksiproteiinin ja passiivisen permeaation merkitystä yhdisteen pitoisuuksiin aivojen solunulkoisessa nesteessä. Tutkittava parametri oli vapaan yhdisteen pitoisuuksien suhde aivojen ja verenkierron välillä vakaassa tilassa (Kp,uu). Tuloksissa havaittiin efluksiproteiinin vaikutus pitoisuuksiin Michaelis-Mentenin kinetiikan mukaisesti. Efluksimalli sopii hyvin teoreettisten simulaatioiden tekemiseen. Malliin voidaan lisätä aktiivisia kuljettimia. Teoreettisten simulaatioiden avulla voidaan yhdistää in vitro ja in vivo tutkimuksien tuloksia ja osatekijöitä voidaan tutkia yhdessä simulaatiossa.
Resumo:
QSPR-malli kuvaa kvantitatiivista riippuvuutta muuttujien ja biologisen ominaisuuden välillä. Näin ollen QSPR mallit ovat käyttökelpoisia lääkekehityksen apuvälineitä. Kirjallisessa osassa kerrotaan sarveiskalvon, suoliston ja veriaivoesteen permeabiliteetin malleista. Useimmin käytettyjä muuttujia ovat yhdisteen rasvaliukoisuus, polaarinen pinta-ala, vetysidosten muodostuminen ja varaus. Myös yhdisteen koko vaikuttaa läpäisevyyteen, vaikka tutkimuksissa onkin erilaista tietoa tämän merkittävyydestä. Malliin vaikuttaa myös muiden kuin mallissa mukana olevien muuttujien suuruusluokka esimerkkinä Lipinskin ‖rule of 5‖ luokittelu. Tässä luokittelussa yhdisteen ominaisuus ei saa ylittää tiettyjä raja-arvoja. Muussa tapauksessa sen imeytyminen suun kautta otettuna todennäköisesti vaarantuu. Lisäksi kirjallisessa osassa tutustuttiin kuljetinproteiineihin ja niiden toimintaan silmän sarveiskalvossa, suolistossa ja veriaivoesteessä. Nykyisin on kehitetty erilaisia QSAR-malleja kuljetinproteiineille ennustamaan mahdollisten substraatittien tai inhibiittorien vuorovaikutuksia kuljetinproteiinin kanssa. Kokeellisen osan tarkoitus oli rakentaa in silico -malli sarveiskalvon passiiviselle permeabiliteetille. Työssä tehtiin QSPR-malli 54 yhdisteen ACDLabs-ohjelmalla laskettujen muuttujien arvojen avulla. Permeabiliteettikertoimien arvot saatiin kirjallisuudesta kanin sarveiskalvon läpäisevyystutkimuksista. Lopullisen mallin muuttujina käytettiin oktanoli-vesijakaantumiskerrointa (logD) pH:ssa 7,4 ja vetysidosatomien kokonaismäärää. Yhtälö oli muotoa log10(permeabiliteettikerroin) = -3,96791 - 0,177842Htotal + 0,311963logD(pH7,4). R2-korrelaatiokerroin oli 0,77 ja Q2-korrelaatiokerroin oli 0,75. Lopullisen mallin hyvyyttä arvioitiin 15 yhdisteen ulkoisella testijoukolla, jolloin ennustettua permeabiliteettia verrattiin kokeelliseen permeabiliteettiin. QSPR-malli arvioitiin myös farmakokineettisen simulaation avulla. Simulaatiossa laskettiin seitsemän yhdisteen kammionestepitoisuudet in vivo vakaassa tilassa käyttäen simulaatioissa QSPR mallilla ennustettuja permeabiliteettikertoimia. Lisäksi laskettiin sarveiskalvon imeytymisen nopeusvakio (Kc) 13 yhdisteelle farmakokineettisen simulaation avulla ja verrattiin tätä lopullisella mallilla ennustettuun permeabiliteettiin. Tulosten perusteella saatiin tilastollisesti hyvä QSPR-malli kuvaamaan sarveiskalvon passiivista permeabiliteettia, jolloin tätä mallia voidaan käyttää lääkekehityksen alkuvaiheessa. QSPR-malli ennusti permeabiliteettikertoimet hyvin, mikä nähtiin vertaamalla mallilla ennustettuja arvoja kokeellisiin tuloksiin. Lisäksi yhdisteiden kammionestepitoisuudet voitiin simuloida käyttäen apuna QSPR-mallilla ennustettuja permeabiliteettikertoimien arvoja.
Resumo:
The blood-brain barrier (BBB) is a unique barrier that strictly regulates the entry of endogenous substrates and xenobiotics into the brain. This is due to its tight junctions and the array of transporters and metabolic enzymes that are expressed. The determination of brain concentrations in vivo is difficult, laborious and expensive which means that there is interest in developing predictive tools of brain distribution. Predicting brain concentrations is important even in early drug development to ensure efficacy of central nervous system (CNS) targeted drugs and safety of non-CNS drugs. The literature review covers the most common current in vitro, in vivo and in silico methods of studying transport into the brain, concentrating on transporter effects. The consequences of efflux mediated by p-glycoprotein, the most widely characterized transporter expressed at the BBB, is also discussed. The aim of the experimental study was to build a pharmacokinetic (PK) model to describe p-glycoprotein substrate drug concentrations in the brain using commonly measured in vivo parameters of brain distribution. The possibility of replacing in vivo parameter values with their in vitro counterparts was also studied. All data for the study was taken from the literature. A simple 2-compartment PK model was built using the Stella™ software. Brain concentrations of morphine, loperamide and quinidine were simulated and compared with published studies. Correlation of in vitro measured efflux ratio (ER) from different studies was evaluated in addition to studying correlation between in vitro and in vivo measured ER. A Stella™ model was also constructed to simulate an in vitro transcellular monolayer experiment, to study the sensitivity of measured ER to changes in passive permeability and Michaelis-Menten kinetic parameter values. Interspecies differences in rats and mice were investigated with regards to brain permeability and drug binding in brain tissue. Although the PK brain model was able to capture the concentration-time profiles for all 3 compounds in both brain and plasma and performed fairly well for morphine, for quinidine it underestimated and for loperamide it overestimated brain concentrations. Because the ratio of concentrations in brain and blood is dependent on the ER, it is suggested that the variable values cited for this parameter and its inaccuracy could be one explanation for the failure of predictions. Validation of the model with more compounds is needed to draw further conclusions. In vitro ER showed variable correlation between studies, indicating variability due to experimental factors such as test concentration, but overall differences were small. Good correlation between in vitro and in vivo ER at low concentrations supports the possibility of using of in vitro ER in the PK model. The in vitro simulation illustrated that in the simulation setting, efflux is significant only with low passive permeability, which highlights the fact that the cell model used to measure ER must have low enough paracellular permeability to correctly mimic the in vivo situation.
Resumo:
The cis-regulatory regions on DNA serve as binding sites for proteins such as transcription factors and RNA polymerase. The combinatorial interaction of these proteins plays a crucial role in transcription initiation, which is an important point of control in the regulation of gene expression. We present here an analysis of the performance of an in silico method for predicting cis-regulatory regions in the plant genomes of Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) on the basis of free energy of DNA melting. For protein-coding genes, we achieve recall and precision of 96% and 42% for Arabidopsis and 97% and 31% for rice, respectively. For noncoding RNA genes, the program gives recall and precision of 94% and 75% for Arabidopsis and 95% and 90% for rice, respectively. Moreover, 96% of the false-positive predictions were located in noncoding regions of primary transcripts, out of which 20% were found in the first intron alone, indicating possible regulatory roles. The predictions for orthologous genes from the two genomes showed a good correlation with respect to prediction scores and promoter organization. Comparison of our results with an existing program for promoter prediction in plant genomes indicates that our method shows improved prediction capability.
Resumo:
An in silico approach was adopted to identify potential cyclooxygenase-2 inhibitors through molecular docking studies. The in vivo studies indicated that synthetic palmitoyl derivatives of salicylic acid, para amino phenol, para amino benzoic acid, and anthranilic acid possessed significant pharmacological activities like anti-inflammatory, analgesic, and antipyretic activities. None of the tested substances produced any significant gastric lesions in experimental animals. In an attempt to understand the ligandprotein interactions in terms of the binding affinity, the above synthetic molecules were subjected to docking analysis using AutoDock. The palmitoyl derivatives palmitoyl anthranilic acid, palmitoyl para amino benzoic acid, palmitoyl para amino phenol, and palmitoyl salicylic acid showed better binding energy than the known inhibitor diclofenac bound to 1PXX. All the palmitoyl derivatives made similar interactions with the binding site residues of cyclooxygenase-2 as compared to that of the known inhibitor. Thus, structure-based drug discovery approach was successfully employed to identify some promising pro-drugs for the treatment of pain and inflammation.
Resumo:
Convergence of the vast sequence space of proteins into a highly restricted fold/conformational space suggests a simple yet unique underlying mechanism of protein folding that has been the subject of much debate in the last several decades. One of the major challenges related to the understanding of protein folding or in silico protein structure prediction is the discrimination of non-native structures/decoys from the native structure. Applications of knowledge-based potentials to attain this goal have been extensively reported in the literature. Also, scoring functions based on accessible surface area and amino acid neighbourhood considerations were used in discriminating the decoys from native structures. In this article, we have explored the potential of protein structure network (PSN) parameters to validate the native proteins against a large number of decoy structures generated by diverse methods. We are guided by two principles: (a) the PSNs capture the local properties from a global perspective and (b) inclusion of non-covalent interactions, at all-atom level, including the side-chain atoms, in the network construction accommodates the sequence dependent features. Several network parameters such as the size of the largest cluster, community size, clustering coefficient are evaluated and scored on the basis of the rank of the native structures and the Z-scores. The network analysis of decoy structures highlights the importance of the global properties contributing to the uniqueness of native structures. The analysis also exhibits that the network parameters can be used as metrics to identify the native structures and filter out non-native structures/decoys in a large number of data-sets; thus also has a potential to be used in the protein `structure prediction' problem.