903 resultados para Scoring
Resumo:
We have developed CowLog, which is open-source software for recording behaviors from digital video and is easy to use and modify. CowLog tracks the time code from digital video files. The program is suitable for coding any digital video, but the authors have used it in animal research. The program has two main windows: a coding window, which is a graphical user interface used for choosing video files and defining output files that also has buttons for scoring behaviors, and a video window, which displays the video used for coding. The windows can be used in separate displays. The user types the key codes for the predefined behavioral categories, and CowLog transcribes their timing from the video time code to a data file. CowLog comes with an additional feature, an R package called Animal, for elementary analyses of the data files. With the analysis package, the user can calculate the frequencies, bout durations, and total durations of the coded behaviors and produce summary plots from the data.
Resumo:
The main purpose of revascularization procedures for critical limb ischaemia (CLI) is to preserve the leg and sustain the patient s ambulatory status. Other goals are ischaemic pain relief and healing of ischaemic ulcers. Patients with CLI are usually old and have several comorbidities affecting the outcome. Revascularization for CLI is meaningless unless both life and limb are preserved. Therefore, the knowledge of both patient- and bypass-related risk factors is of paramount importance in clinical decision-making, patient selection and resource allocation. The aim of this study was to identify patient- and graft-related predictors of impaired outcome after infrainguinal bypass for CLI. The purpose was to assess the outcome of high-risk patients undergoing infrainguinal bypass and to evaluate the usefulness of specific risk scoring methods. The results of bypasses in the absence of optimal vein graft material were also evaluated, and the feasibility of the new method of scaffolding suboptimal vein grafts was assessed. The results of this study showed that renal insufficiency - not only renal failure but also moderate impairment in renal function - seems to be a significant risk factor for both limb loss and death after infrainguinal bypass in patients with CLI. Low estimated GFR (PIENEMPI KUIN 30 ml/min/1.73 m2) is a strong independent marker of poor prognosis. Furthermore, estimated GFR is a more accurate predictor of survival and leg salvage after infrainguinal bypass in CLI patients than serum creatinine level alone. We also found out that the life expectancy of octogenarians with CLI is short. In this patient group endovascular revascularization is associated with a better outcome than bypass in terms of survival, leg salvage and amputation-free survival especially in presence of coronary artery disease. This study was the first one to demonstrate that Finnvasc and modified Prevent III risk scoring methods both predict the long-term outcome of patients undergoing both surgical and endovascular infrainguinal revascularization for CLI. Both risk scoring methods are easy to use and might be helpful in clinical practice as an aid in preoperative patient selection and decision-making. Similarly than in previous studies, we found out that a single-segment great saphenous vein graft is superior to any other autologous vein graft in terms of mid-term patency and leg salvage. However, if optimal vein graft is lacking, arm vein conduits are superior to prosthetic grafts especially in infrapopliteal bypasses for CLI. We studied also the new method of scaffolding suboptimal quality vein grafts and found out that this method may enable the use of vein grafts of compromised quality otherwise unsuitable for bypass grafting. The remarkable finding was that patients with the combination of high operative risk due to severe comorbidities and risk graft have extremely poor survival, suggesting that only relatively fit patients should undergo complex bypasses with risk grafts. The results of this study can be used in clinical practice as an aid in preoperative patient selection and decision-making. In the future, the need of vascular surgery will increase significantly as the elderly and diabetic population increases, which emphasises the importance of focusing on those patients that will gain benefit from infrainguinal bypass. Therefore, the individual risk of the patient, ambulatory status, outcome expectations, the risk of bypass procedure as well as technical factors such as the suitability of outflow anatomy and the available vein material should all be assessed and taken into consideration when deciding on the best revascularization strategy.
Resumo:
We propose an efficient and parameter-free scoring criterion, the factorized conditional log-likelihood (ˆfCLL), for learning Bayesian network classifiers. The proposed score is an approximation of the conditional log-likelihood criterion. The approximation is devised in order to guarantee decomposability over the network structure, as well as efficient estimation of the optimal parameters, achieving the same time and space complexity as the traditional log-likelihood scoring criterion. The resulting criterion has an information-theoretic interpretation based on interaction information, which exhibits its discriminative nature. To evaluate the performance of the proposed criterion, we present an empirical comparison with state-of-the-art classifiers. Results on a large suite of benchmark data sets from the UCI repository show that ˆfCLL-trained classifiers achieve at least as good accuracy as the best compared classifiers, using significantly less computational resources.
Resumo:
Kalateollisuus ja kalakauppa tarvitsisivat menetelmän, jolla kalan säilyvyyttä voitaisiin arvioida reaaliaikaisesti ja luotettavasti. Laatuindeksimenetelmä QIM® (engl. Quality Index Method) on käytössä jo useassa Euroopan maassa useille eri kalalajeille. QIM® pyrkii lajikohtaisesti ennustamaan aistinvaraisten ominaisuuksien muutoksien perusteella jäljellä olevaa säilyvyysaikaa. Työn tavoitteena oli luoda QIM® meressä kasvatetulle siialle. Tämä on ensimmäinen suomalaiselle kalalle tehtävä QIM®, ja tavoitteena on saada meressä kasvatetun siian QIM® myös viralliseen QIM®-käsikirjaan. Tutkimus tehtiin Elintarviketurvallisuusvirasto Eviralle. Tutkittavat kalat Evira osti Kalatukku E. Eriksson Oy:ltä. QIM®-tuloksen tueksi tutkittiin yhden erän pH ja indikoitiin pilaantuminen myös mikrobiologisesti. Luotiin myös kasvatetun siian profiili (arvioijia 13). Itse tutkimusosassa kaksi profiiliraatia (n = 9) ja QIM®-raati (n = 5) arvioivat raa’an ja kypsän kalan. Tulos varmistettiin myös aistinvaraisella kalan laadunarviointimenetelmällä (Evira 8001). QIM®-raati loi QIM®-luonnoksen ja luonnoksen toimivuutta testattiin. Kiinteänä osana työn toteutusta oli myös eri vaiheiden muutosten valokuvaus. Tämän tutkimuksen mukaan luotu QIM®-luonnos on toimiva pohja validoitaessaa QIM®-menetelmää siialle. Voidaan myös todeta, että QIM® soveltuu meressä kasvatetulle siialle. Kypsän kalan aistinvaraisella laadunarviointimenetelmällä (Evira 8001) analysoitiin säilytysajankohta, jolloin kypsästä kalasta voitiin todeta kalan kauppakelvottomuus – tätä pidettiin ajanhetkenä, jolloin raa’an kalan tutkiminen voitiin lopettaa. Tutkimuksessa käytetty mikrobiologinen menetelmä ”Mikrobien lukumäärän määrittäminen” (Evira 3420/1) korreloi QIM®-tuloksen kanssa; kalanäyte oli tässä tutkimuksessa niin mikrobiologisesti arvioituna kuin laatuindeksinkin mukaan käyttökelvotonta viidentenätoista päivänä. Tutkittujen kalojen pH-arvoja ei voitu verrata laatuindeksiin, sillä tässä tutkimuksessa mitattujen pH-arvojen tuloksista ei voitu päätellä pilaantumisen etenemistä. Kun QIM® meressä kasvatetulle siialle on validoitu (Evira), valmista meressä kasvatetun siian QI-menetelmää voidaan hyödyntää jatkossa Suomen kalateollisuudessa ja -markkinoilla. Olisi hyvä, jos QIM® luotaisiin myös muille Suomen yleisimmille kauppakaloille, jotta pakkauksiin merkityt viimeiset käyttöpäivät perustuisivat yhteen yhteiseen menetelmään ja näin viimeisellä käyttöpäivällä olisi tieteellinen pohja.
Resumo:
A successful protein-protein docking study culminates in identification of decoys at top ranks with near-native quaternary structures. However, this task remains enigmatic because no generalized scoring functions exist that effectively infer decoys according to the similarity to near-native quaternary structures. Difficulties arise because of the highly irregular nature of the protein surface and the significant variation of the nonbonding and solvation energies based on the chemical composition of the protein-protein interface. In this work, we describe a novel method combining an interface-size filter, a regression model for geometric compatibility (based on two correlated surface and packing parameters), and normalized interaction energy (calculated from correlated nonbonded and solvation energies), to effectively rank decoys from a set of 10,000 decoys. Tests on 30 unbound binary protein-protein complexes show that in 16 cases we can identify at least one decoy in top three ranks having <= 10 angstrom backbone root mean square deviation from true binding geometry. Comparisons with other state-of-art methods confirm the improved ranking power of our method without the use of any experiment-guided restraints, evolutionary information, statistical propensities, or modified interaction energy equations. Tests on 118 less-difficult bound binary protein-protein complexes with <= 35% sequence redundancy at the interface showed that in 77% cases, at least 1 in 10,000 decoys were identified with <= 5 angstrom backbone root mean square deviation from true geometry at first rank. The work will promote the use of new concepts where correlations among parameters provide more robust scoring models. It will facilitate studies involving molecular interactions, including modeling of large macromolecular assemblies and protein structure prediction. (C) 2010 Wiley Periodicals, Inc. J Comput Chem 32: 787-796, 2011.
Resumo:
The protein-protein docking programs typically perform four major tasks: (i) generation of docking poses, (ii) selecting a subset of poses, (iii) their structural refinement and (iv) scoring, ranking for the final assessment of the true quaternary structure. Although the tasks can be integrated or performed in a serial order, they are by nature modular, allowing an opportunity to substitute one algorithm with another. We have implemented two modular web services, (i) PRUNE: to select a subset of docking poses generated during sampling search (http://pallab.serc.iisc.ernet.in/prune) and (ii) PROBE: to refine, score and rank them (http://pallab.serc.iisc.ernet.in/probe). The former uses a new interface area based edge-scoring function to eliminate > 95% of the poses generated during docking search. In contrast to other multi-parameter-based screening functions, this single parameter based elimination reduces the computational time significantly, in addition to increasing the chances of selecting native-like models in the top rank list. The PROBE server performs ranking of pruned poses, after structure refinement and scoring using a regression model for geometric compatibility, and normalized interaction energy. While web-service similar to PROBE is infrequent, no web-service akin to PRUNE has been described before. Both the servers are publicly accessible and free for use.
Resumo:
Membrane proteins are involved in a number of important biological functions. Yet, they are poorly understood from the structure and folding point of view. The external environment being drastically different from that of globular proteins, the intra-protein interactions in membrane proteins are also expected to be different. Hence, statistical potentials representing the features of inter-residue interactions based exclusively on the structures of membrane proteins are much needed. Currently, a reasonable number of structures are available, making it possible to undertake such an analysis on membrane proteins. In this study we have examined the inter-residue interaction propensities of amino acids in the membrane spanning regions of the alpha-helical membrane (HM) proteins. Recently we have shown that valuable information can be obtained on globular proteins by the evaluation of the pair-wise interactions of amino acids by classifying them into different structural environments, based on factors such as the secondary structure or the number of contacts that a residue can make. Here we have explored the possible ways of classifying the intra-protein environment of HM proteins and have developed scoring functions based on different classification schemes. On evaluation of different schemes, we find that the scheme which classifies amino acids to different intra-contact environment is the most promising one. Based on this classification scheme, we also redefine the hydrophobicity scale of amino acids in HM proteins.
Resumo:
A fundamental task in bioinformatics involves a transfer of knowledge from one protein molecule onto another by way of recognizing similarities. Such similarities are obtained at different levels, that of sequence, whole fold, or important substructures. Comparison of binding sites is important to understand functional similarities among the proteins and also to understand drug cross-reactivities. Current methods in literature have their own merits and demerits, warranting exploration of newer concepts and algorithms, especially for large-scale comparisons and for obtaining accurate residue-wise mappings. Here, we report the development of a new algorithm, PocketAlign, for obtaining structural superpositions of binding sites. The software is available as a web-service at http://proline.physicslisc.emetin/pocketalign/. The algorithm encodes shape descriptors in the form of geometric perspectives, supplemented by chemical group classification. The shape descriptor considers several perspectives with each residue as the focus and captures relative distribution of residues around it in a given site. Residue-wise pairings are computed by comparing the set of perspectives of the first site with that of the second, followed by a greedy approach that incrementally combines residue pairings into a mapping. The mappings in different frames are then evaluated by different metrics encoding the extent of alignment of individual geometric perspectives. Different initial seed alignments are computed, each subsequently extended by detecting consequential atomic alignments in a three-dimensional grid, and the best 500 stored in a database. Alignments are then ranked, and the top scoring alignments reported, which are then streamed into Pymol for visualization and analyses. The method is validated for accuracy and sensitivity and benchmarked against existing methods. An advantage of PocketAlign, as compared to some of the existing tools available for binding site comparison in literature, is that it explores different schemes for identifying an alignment thus has a better potential to capture similarities in ligand recognition abilities. PocketAlign, by finding a detailed alignment of a pair of sites, provides insights as to why two sites are similar and which set of residues and atoms contribute to the similarity.
Resumo:
Convergence of the vast sequence space of proteins into a highly restricted fold/conformational space suggests a simple yet unique underlying mechanism of protein folding that has been the subject of much debate in the last several decades. One of the major challenges related to the understanding of protein folding or in silico protein structure prediction is the discrimination of non-native structures/decoys from the native structure. Applications of knowledge-based potentials to attain this goal have been extensively reported in the literature. Also, scoring functions based on accessible surface area and amino acid neighbourhood considerations were used in discriminating the decoys from native structures. In this article, we have explored the potential of protein structure network (PSN) parameters to validate the native proteins against a large number of decoy structures generated by diverse methods. We are guided by two principles: (a) the PSNs capture the local properties from a global perspective and (b) inclusion of non-covalent interactions, at all-atom level, including the side-chain atoms, in the network construction accommodates the sequence dependent features. Several network parameters such as the size of the largest cluster, community size, clustering coefficient are evaluated and scored on the basis of the rank of the native structures and the Z-scores. The network analysis of decoy structures highlights the importance of the global properties contributing to the uniqueness of native structures. The analysis also exhibits that the network parameters can be used as metrics to identify the native structures and filter out non-native structures/decoys in a large number of data-sets; thus also has a potential to be used in the protein `structure prediction' problem.
Resumo:
Over the past two decades, many ingenious efforts have been made in protein remote homology detection. Because homologous proteins often diversify extensively in sequence, it is challenging to demonstrate such relatedness through entirely sequence-driven searches. Here, we describe a computational method for the generation of `protein-like' sequences that serves to bridge gaps in protein sequence space. Sequence profile information, as embodied in a position-specific scoring matrix of multiply aligned sequences of bona fide family members, serves as the starting point in this algorithm. The observed amino acid propensity and the selection of a random number dictate the selection of a residue for each position in the sequence. In a systematic manner, and by applying a `roulette-wheel' selection approach at each position, we generate parent family-like sequences and thus facilitate an enlargement of sequence space around the family. When generated for a large number of families, we demonstrate that they expand the utility of natural intermediately related sequences in linking distant proteins. In 91% of the assessed examples, inclusion of designed sequences improved fold coverage by 5-10% over searches made in their absence. Furthermore, with several examples from proteins adopting folds such as TIM, globin, lipocalin and others, we demonstrate that the success of including designed sequences in a database positively sensitized methods such as PSI-BLAST and Cascade PSI-BLAST and is a promising opportunity for enormously improved remote homology recognition using sequence information alone.
Resumo:
Information diffusion and influence maximization are important and extensively studied problems in social networks. Various models and algorithms have been proposed in the literature in the context of the influence maximization problem. A crucial assumption in all these studies is that the influence probabilities are known to the social planner. This assumption is unrealistic since the influence probabilities are usually private information of the individual agents and strategic agents may not reveal them truthfully. Moreover, the influence probabilities could vary significantly with the type of the information flowing in the network and the time at which the information is propagating in the network. In this paper, we use a mechanism design approach to elicit influence probabilities truthfully from the agents. Our main contribution is to design a scoring rule based mechanism in the context of the influencer-influencee model. In particular, we show the incentive compatibility of the mechanisms and propose a reverse weighted scoring rule based mechanism as an appropriate mechanism to use.
Resumo:
Ferric uptake regulator (Fur) is a transcriptional regulator controlling the expression of genes involved in iron homeostasis and plays an important role in pathogenesis. Fur-regulated sRNAs/CDSs were found to have upstream Fur Binding Sites (FBS). We have constructed a Positional Weight Matrix from 100 known FBS (19 nt) and tracked the `Orphan' FBSs. Possible Fur regulated sRNAs and CDSs were identified by comparing their genomic locations with the `Orphan' FBSs identified. Thirty-eight `novel' and all known Fur regulated sRNAs in nine proteobacteria were identified. In addition, we identified high scoring FBSs in the promoter regions of the 304 CDSs and 68 of them were involved in siderophore biosynthesis, iron-transporters, two-component system, starch/sugar metabolism, sulphur/methane metabolism, etc. The present study shows that the Fur regulator controls the expression of genes involved in diverse metabolic activities and it is not limited to iron metabolism alone. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
Most of the biological processes are governed through specific protein-ligand interactions. Discerning different components that contribute toward a favorable protein-ligand interaction could contribute significantly toward better understanding protein function, rationalizing drug design and obtaining design principles for protein engineering. The Protein Data Bank (PDB) currently hosts the structure of similar to 68 000 protein-ligand complexes. Although several databases exist that classify proteins according to sequence and structure, a mere handful of them annotate and classify protein-ligand interactions and provide information on different attributes of molecular recognition. In this study, an exhaustive comparison of all the biologically relevant ligand-binding sites (84 846 sites) has been conducted using PocketMatch: a rapid, parallel, in-house algorithm. PocketMatch quantifies the similarity between binding sites based on structural descriptors and residue attributes. A similarity network was constructed using binding sites whose PocketMatch scores exceeded a high similarity threshold (0.80). The binding site similarity network was clustered into discrete sets of similar sites using the Markov clustering (MCL) algorithm. Furthermore, various computational tools have been used to study different attributes of interactions within the individual clusters. The attributes can be roughly divided into (i) binding site characteristics including pocket shape, nature of residues and interaction profiles with different kinds of atomic probes, (ii) atomic contacts consisting of various types of polar, hydrophobic and aromatic contacts along with binding site water molecules that could play crucial roles in protein-ligand interactions and (iii) binding energetics involved in interactions derived from scoring functions developed for docking. For each ligand-binding site in each protein in the PDB, site similarity information, clusters they belong to and description of site attributes are provided as a relational database-protein-ligand interaction clusters (PLIC).
Resumo:
m-AMSA, an established inhibitor of eukaryotic type II topoisomerases, exerts its cidal effect by binding to the enzyme-DNA complex thus inhibiting the DNA religation step. The molecule and its analogues have been successfully used as chemotherapeutic agents against different forms of cancer. After virtual screening using a homology model of the Mycobacterium tuberculosis topoisomerase I, we identified m-AMSA as a high scoring hit. We demonstrate that m-AMSA can inhibit the DNA relaxation activity of topoisomerase I from M. tuberculosis and Mycobacterium smegmatis. In a whole cell assay, m-AMSA inhibited the growth of both the mycobacteria. (C) 2014 Elsevier Inc. All rights reserved.
Resumo:
The problem of bipartite ranking, where instances are labeled positive or negative and the goal is to learn a scoring function that minimizes the probability of mis-ranking a pair of positive and negative instances (or equivalently, that maximizes the area under the ROC curve), has been widely studied in recent years. A dominant theoretical and algorithmic framework for the problem has been to reduce bipartite ranking to pairwise classification; in particular, it is well known that the bipartite ranking regret can be formulated as a pairwise classification regret, which in turn can be upper bounded using usual regret bounds for classification problems. Recently, Kotlowski et al. (2011) showed regret bounds for bipartite ranking in terms of the regret associated with balanced versions of the standard (non-pairwise) logistic and exponential losses. In this paper, we show that such (non-pairwise) surrogate regret bounds for bipartite ranking can be obtained in terms of a broad class of proper (composite) losses that we term as strongly proper. Our proof technique is much simpler than that of Kotlowski et al. (2011), and relies on properties of proper (composite) losses as elucidated recently by Reid and Williamson (2010, 2011) and others. Our result yields explicit surrogate bounds (with no hidden balancing terms) in terms of a variety of strongly proper losses, including for example logistic, exponential, squared and squared hinge losses as special cases. An important consequence is that standard algorithms minimizing a (non-pairwise) strongly proper loss, such as logistic regression and boosting algorithms (assuming a universal function class and appropriate regularization), are in fact consistent for bipartite ranking; moreover, our results allow us to quantify the bipartite ranking regret in terms of the corresponding surrogate regret. We also obtain tighter surrogate bounds under certain low-noise conditions via a recent result of Clemencon and Robbiano (2011).