982 resultados para functional prediction
Resumo:
Bioinformatics, in the last few decades, has played a fundamental role to give sense to the huge amount of data produced. Obtained the complete sequence of a genome, the major problem of knowing as much as possible of its coding regions, is crucial. Protein sequence annotation is challenging and, due to the size of the problem, only computational approaches can provide a feasible solution. As it has been recently pointed out by the Critical Assessment of Function Annotations (CAFA), most accurate methods are those based on the transfer-by-homology approach and the most incisive contribution is given by cross-genome comparisons. In the present thesis it is described a non-hierarchical sequence clustering method for protein automatic large-scale annotation, called “The Bologna Annotation Resource Plus” (BAR+). The method is based on an all-against-all alignment of more than 13 millions protein sequences characterized by a very stringent metric. BAR+ can safely transfer functional features (Gene Ontology and Pfam terms) inside clusters by means of a statistical validation, even in the case of multi-domain proteins. Within BAR+ clusters it is also possible to transfer the three dimensional structure (when a template is available). This is possible by the way of cluster-specific HMM profiles that can be used to calculate reliable template-to-target alignments even in the case of distantly related proteins (sequence identity < 30%). Other BAR+ based applications have been developed during my doctorate including the prediction of Magnesium binding sites in human proteins, the ABC transporters superfamily classification and the functional prediction (GO terms) of the CAFA targets. Remarkably, in the CAFA assessment, BAR+ placed among the ten most accurate methods. At present, as a web server for the functional and structural protein sequence annotation, BAR+ is freely available at http://bar.biocomp.unibo.it/bar2.0.
Resumo:
High-throughput assays, such as yeast two-hybrid system, have generated a huge amount of protein-protein interaction (PPI) data in the past decade. This tremendously increases the need for developing reliable methods to systematically and automatically suggest protein functions and relationships between them. With the available PPI data, it is now possible to study the functions and relationships in the context of a large-scale network. To data, several network-based schemes have been provided to effectively annotate protein functions on a large scale. However, due to those inherent noises in high-throughput data generation, new methods and algorithms should be developed to increase the reliability of functional annotations. Previous work in a yeast PPI network (Samanta and Liang, 2003) has shown that the local connection topology, particularly for two proteins sharing an unusually large number of neighbors, can predict functional associations between proteins, and hence suggest their functions. One advantage of the work is that their algorithm is not sensitive to noises (false positives) in high-throughput PPI data. In this study, we improved their prediction scheme by developing a new algorithm and new methods which we applied on a human PPI network to make a genome-wide functional inference. We used the new algorithm to measure and reduce the influence of hub proteins on detecting functionally associated proteins. We used the annotations of the Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG) as independent and unbiased benchmarks to evaluate our algorithms and methods within the human PPI network. We showed that, compared with the previous work from Samanta and Liang, our algorithm and methods developed in this study improved the overall quality of functional inferences for human proteins. By applying the algorithms to the human PPI network, we obtained 4,233 significant functional associations among 1,754 proteins. Further comparisons of their KEGG and GO annotations allowed us to assign 466 KEGG pathway annotations to 274 proteins and 123 GO annotations to 114 proteins with estimated false discovery rates of <21% for KEGG and <30% for GO. We clustered 1,729 proteins by their functional associations and made pathway analysis to identify several subclusters that are highly enriched in certain signaling pathways. Particularly, we performed a detailed analysis on a subcluster enriched in the transforming growth factor β signaling pathway (P<10-50) which is important in cell proliferation and tumorigenesis. Analysis of another four subclusters also suggested potential new players in six signaling pathways worthy of further experimental investigations. Our study gives clear insight into the common neighbor-based prediction scheme and provides a reliable method for large-scale functional annotations in this post-genomic era.
Resumo:
Pseudomonas sp. strain B13 is a bacterium known to degrade chloroaromatic compounds. The properties to use 3- and 4-chlorocatechol are determined by a self-transferable DNA element, the clc element, which normally resides at two locations in the cell's chromosome. Here we report the complete nucleotide sequence of the clc element, demonstrating the unique catabolic properties while showing its relatedness to genomic islands and integrative and conjugative elements rather than to other known catabolic plasmids. As far as catabolic functions, the clc element harbored, in addition to the genes for chlorocatechol degradation, a complete functional operon for 2-aminophenol degradation and genes for a putative aromatic compound transport protein and for a multicomponent aromatic ring dioxygenase similar to anthranilate hydroxylase. The genes for catabolic functions were inducible under various conditions, suggesting a network of catabolic pathway induction. For about half of the open reading frames (ORFs) on the clc element, no clear functional prediction could be given, although some indications were found for functions that were similar to plasmid conjugation. The region in which these ORFs were situated displayed a high overall conservation of nucleotide sequence and gene order to genomic regions in other recently completed bacterial genomes or to other genomic islands. Most notably, except for two discrete regions, the clc element was almost 100% identical over the whole length to a chromosomal region in Burkholderia xenovorans LB400. This indicates the dynamic evolution of this type of element and the continued transition between elements with a more pathogenic character and those with catabolic properties.
Resumo:
CONTEXT: Recent magnetic resonance imaging studies have attempted to relate volumetric brain measurements in early schizophrenia to clinical and functional outcome some years later. These studies have generally been negative, perhaps because gray and white matter volumes inaccurately assess the underlying dysfunction that might be predictive of outcome. OBJECTIVE: To investigate the predictive value of frontal and temporal spectroscopy measures for outcome in patients with first-episode psychoses. DESIGN: Left prefrontal cortex and left mediotemporal lobe voxels were assessed using proton magnetic resonance spectroscopy to provide the ratio of N-acetylaspartate (NAA) and choline-containing compounds to creatine and phosphocreatine (Cr) (NAA/Cr ratio). These data were used to predict outcome at 18 months after admission, as assessed by a systematic medical record audit. SETTING: Early psychosis clinic. PARTICIPANTS: Forty-six patients with first-episode psychosis. MAIN OUTCOME MEASURES: We used regression models that included age at imaging and duration of untreated psychosis to predict outcome scores on the Global Assessment of Functioning Scale, Clinical Global Impression scales, and Social and Occupational Functional Assessment Scale, as well as the number of admissions during the treatment period. We then further considered the contributions of premorbid function and baseline level of negative symptoms. RESULTS: The only spectroscopic predictor of outcome was the NAA/Cr ratio in the prefrontal cortex. Low scores on this variable were related to poorer outcome on all measures. In addition, the frontal NAA/Cr ratio explained 17% to 30% of the variance in outcome. CONCLUSIONS: Prefrontal neuronal dysfunction is an inconsistent feature of early psychosis; rather, it is an early marker of poor prognosis across the first years of illness. The extent to which this can be used to guide treatment and whether it predicts outcome some years after first presentation are questions for further research.
Resumo:
Overall introduction.- Longitudinal studies have been designed to investigate prospectively, from their beginning, the pathway leading from health to frailty and to disability. Knowledge about determinants of healthy ageing and health behaviour (resources) as well as risks of functional decline is required to propose appropriate preventative interventions. The functional status in older people is important considering clinical outcome in general, healthcare need and mortality. Part I.- Results and interventions from lucas (longitudinal urban cohort ageing study). Authors.- J. Anders, U. Dapp, L. Neumann, F. Pröfener, C. Minder, S. Golgert, A. Daubmann, K. Wegscheider,. W. von Renteln-Kruse Methods.- The LUCAS core project is a longitudinal cohort of urban community-dwelling people 60 years and older, recruited in 2000/2001. Further LUCAS projects are cross-sectional comparative and interventional studies (RCT). Results.- The emphasis will be on geriatric medical care in a population-based approach, discussing different forms of access, too. (Dapp et al. BMC Geriatrics 2012, 12:35; http://www.biomedcentral.com/1471-2318/12/35): - longitudinal data from the LUCAS urban cohort (n = 3.326) will be presented covering 10 years of observation, including the prediction of functional decline, need of nursing care, and mortality by using a self-filling screening tool; - interventions to prevent functional decline do focus on first (pre-clinical) signs of pre-frailty before entering the frailty-cascade ("Active Health Promotion in Old Age", "geriatric mobility centre") or disability ("home visits"). Conclusions.- The LUCAS research consortium was established to study particular aspects of functional competence, its changes with ageing, to detect pre-clinical signs of functional decline, and to address questions on how to maintain functional competence and to prevent adverse outcome in different settings. The multidimensional data base allows the exploration of several further questions. Gait performance was exmined by GAITRite®-System. Supported by the Federal Ministry for Education and Research (BMBF Funding No. 01ET1002A). Part II.- Selected results from the lausanne cohort 65+ (Lc65 + ) Study (Switzerland). Authors.- Prof Santos-Eggimann Brigitte, Dr Seematter-Bagnoud Laurence, Prof Büla Christophe, Dr Rochat Stéphane. Methods.- The Lc65+ cohort was launched in 2004 with the random selection of 3054 eligible individuals aged 65 to 70 (birth year 1934-1938) in the non-institutionalized population of Lausanne (Switzerland). Results.- Information is collected about life course social and health-related events, socio-economics, medical and psychosocial dimensions, lifestyle habits, limitations in activities of daily living, mobility impairments, and falls. Gait performance are objectively measured using body-fixed sensors. Frailty is assessed using Fried's frailty phenotype. Follow-up consists in annual self-completed questionnaires, as well as physical examination and physical and mental performance tests every three years. - Lausanne cohort 65+ (Lc65 + ): design and longitudinal outcomes. The baseline data collection was completed among 1422 participants in 2004-2005 through self-completed questionnaires, face-to-face interviews, physical examination and tests of mental and physical performances. Information about institutionalization, self-reported health services utilization, and death is also assessed. An additional random sample (n = 1525) of 65-70 years old subjects was recruited in 2009 (birth year 1939-1943). - lecture no 4: alcohol intake and gait parameters: prevalent and longitudinal association in the Lc65+ study. The association between alcohol intake and gait performance was investigated.
Resumo:
High-throughput prioritization of cancer-causing mutations (drivers) is a key challenge of cancer genome projects, due to the number of somatic variants detected in tumors. One important step in this task is to assess the functional impact of tumor somatic mutations. A number of computational methods have been employed for that purpose, although most were originally developed to distinguish disease-related nonsynonymous single nucleotide variants (nsSNVs) from polymorphisms. Our new method, transformed Functional Impact score for Cancer (transFIC), improves the assessment of the functional impact of tumor nsSNVs by taking into account the baseline tolerance of genes to functional variants.
Resumo:
Motivation: We compare phylogenetic approaches for inferring functional gene links. The approaches detect independent instances of the correlated gain and loss of pairs of genes from species' genomes. We investigate the effect on results of basing evidence of correlations on two phylogenetic approaches, Dollo parsminony and maximum likelihood (ML). We further examine the effect of constraining the ML model by fixing the rate of gene gain at a low value, rather than estimating it from the data. Results: We detect correlated evolution among a test set of pairs of yeast (Saccharomyces cerevisiae) genes, with a case study of 21 eukaryotic genomes and test data derived from known yeast protein complexes. If the rate at which genes are gained is constrained to be low, ML achieves by far the best results at detecting known functional links. The model then has fewer parameters but it is more realistic by preventing genes from being gained more than once. Availability: BayesTraits by M. Pagel and A. Meade, and a script to configure and repeatedly launch it by D. Barker and M. Pagel, are available at http://www.evolution.reading.ac.uk .
Resumo:
An automatic method for recognizing natively disordered regions from amino acid sequence is described and benchmarked against predictors that were assessed at the latest critical assessment of techniques for protein structure prediction (CASP) experiment. The method attains a Wilcoxon score of 90.0, which represents a statistically significant improvement on the methods evaluated on the same targets at CASP. The classifier, DISOPRED2, was used to estimate the frequency of native disorder in several representative genomes from the three kingdoms of life. Putative, long (>30 residue) disordered segments are found to occur in 2.0% of archaean, 4.2% of eubacterial and 33.0% of eukaryotic proteins. The function of proteins with long predicted regions of disorder was investigated using the gene ontology annotations supplied with the Saccharomyces genome database. The analysis of the yeast proteome suggests that proteins containing disorder are often located in the cell nucleus and are involved in the regulation of transcription and cell signalling. The results also indicate that native disorder is associated with the molecular functions of kinase activity and nucleic acid binding.
Resumo:
World-wide structural genomics initiatives are rapidly accumulating structures for which limited functional information is available. Additionally, state-of-the art structural prediction programs are now capable of generating at least low resolution structural models of target proteins. Accurate detection and classification of functional sites within both solved and modelled protein structures therefore represents an important challenge. We present a fully automatic site detection method, FuncSite, that uses neural network classifiers to predict the location and type of functionally important sites in protein structures. The method is designed primarily to require only backbone residue positions without the need for specific side-chain atoms to be present. In order to highlight effective site detection in low resolution structural models FuncSite was used to screen model proteins generated using mGenTHREADER on a set of newly released structures. We found effective metal site detection even for moderate quality protein models illustrating the robustness of the method.
Resumo:
In this paper, we present the results of the prediction of the high-pressure adsorption equilibrium of supercritical. gases (Ar, N-2, CH4, and CO2) on various activated carbons (BPL, PCB, and Norit R1 extra) at various temperatures using a density-functional-theory-based finite wall thickness (FWT) model. Pore size distribution results of the carbons are taken from our recent previous work 1,2 using this approach for characterization. To validate the model, isotherms calculated from the density functional theory (DFT) approach are comprehensively verified against those determined by grand canonical Monte Carlo (GCMC) simulation, before the theoretical adsorption isotherms of these investigated carbons calculated by the model are compared with the experimental adsorption measurements of the carbons. We illustrate the accuracy and consistency of the FWT model for the prediction of adsorption isotherms of the all investigated gases. The pore network connectivity problem occurring in the examined carbons is also discussed, and on the basis of the success of the predictions assuming a similar pore size distribution for accessible and inaccessible regions, it is suggested that this is largely related to the disordered nature of the carbon.
Resumo:
Scorpion toxins are common experimental tools for studies of biochemical and pharmacological properties of ion channels. The number of functionally annotated scorpion toxins is steadily growing, but the number of identified toxin sequences is increasing at much faster pace. With an estimated 100,000 different variants, bioinformatic analysis of scorpion toxins is becoming a necessary tool for their systematic functional analysis. Here, we report a bioinformatics-driven system involving scorpion toxin structural classification, functional annotation, database technology, sequence comparison, nearest neighbour analysis, and decision rules which produces highly accurate predictions of scorpion toxin functional properties. (c) 2005 Elsevier Inc. All rights reserved.
Resumo:
Background. Exercise therapy improves functional capacity in CHF, but selection and individualization of training would be helped by a simple non-invasive marker of peak VO2. Peak VO2 in these pts is difficult to predict without direct measurement, and LV ejection fraction is a poor predictor. Myocardial tissue velocities are less load-dependent, and may be predictive of the exercise response in CHF pts. We sought to use tissue velocity as a predictor of peak VO2 in CHF pts. Methods. Resting 2D-echocardiography and tissue Doppler imaging were performed in 182 CHF pts (159 male, age 62±10 years) before and after metabolic exercise testing. The majority of these patients (129, 71%) had an ischemic cardiomyopathy, with resting EF of 35±13% and a peak VO2 of 13.5±4.7 ml/kg/min. Results. Neither resting EF (r=0.15) nor peak EF (r=0.18, both p=NS) were correlated with peak VO2. However, peak VO2 correlated with peak systolic velocity in septal (Vss, r=0.31) and lateral walls (Vsl, r=0.26, both p=0.01). In a general linear model (r2 = 0.25), peak VO2 was calculated from the following equation: 9.6 + 0.68*Vss - 0.09*age + 0.06*maximum HR. This model proved to be a superior predictor of peak VO2 (r=0.51, p=0.01) than the standard prediction equations of Wasserman (r= -0.12, p=0.01). Conclusions. Resting tissue Doppler, age and maximum heart rate may be used to predict functional capacity in CHF patients. This may be of use in selecting and following the response to therapy, including for exercise training.
Resumo:
A periodic density functional theory method using the B3LYP hybrid exchange-correlation potential is applied to the Prussian blue analogue RbMn[Fe(CN)6] to evaluate the suitability of the method for studying, and predicting, the photomagnetic behavior of Prussian blue analogues and related materials. The method allows correct description of the equilibrium structures of the different electronic configurations with regard to the cell parameters and bond distances. In agreement with the experimental data, the calculations have shown that the low-temperature phase (LT; Fe(2+)(t(6)2g, S = 0)-CN-Mn(3+)(t(3)2g e(1)g, S = 2)) is the stable phase at low temperature instead of the high-temperature phase (HT; Fe(3+)(t(5)2g, S = 1/2)-CN-Mn(2+)(t(3)2g e(2)g, S = 5/2)). Additionally, the method gives an estimation for the enthalpy difference (HT LT) with a value of 143 J mol(-1) K(-1). The comparison of our calculations with experimental data from the literature and from our calorimetric and X-ray photoelectron spectroscopy measurements on the Rb0.97Mn[Fe(CN)6]0.98 x 1.03 H2O compound is analyzed, and in general, a satisfactory agreement is obtained. The method also predicts the metastable nature of the electronic configuration of the high-temperature phase, a necessary condition to photoinduce that phase at low temperatures. It gives a photoactivation energy of 2.36 eV, which is in agreement with photoinduced demagnetization produced by a green laser.
Resumo:
We use the density functional theory/local-density approximation (DFT/LDA)-1/2 method [L. G. Ferreira , Phys. Rev. B 78, 125116 (2008)], which attempts to fix the electron self-energy deficiency of DFT/LDA by half-ionizing the whole Bloch band of the crystal, to calculate the band offsets of two Si/SiO(2) interface models. Our results are similar to those obtained with a ""state-of-the-art"" GW approach [R. Shaltaf , Phys. Rev. Lett. 100, 186401 (2008)], with the advantage of being as computationally inexpensive as the usual DFT/LDA. Our band gap and band offset predictions are in excellent agreement with experiments.