Biblioteca Digital

57 resultados para Bioinformatics

The automation of nested clade phylogeographic analysis

Relevância:

10.00% 10.00%

Publicador:

Resumo:

ANeCA is a fully automated implementation of Nested Clade Phylogeographic Analysis. This was originally developed by Templeton and colleagues, and has been used to infer, from the pattern of gene sequence polymorphisms in a geographically structured population, the historical demographic processes that have shaped its evolution. Until now it has been necessary to perform large parts of the procedure manually. We provide a program that will take data in Nexus sequential format, and directly output a set of inferences. The software also includes TCS v1.18 and GeoDis v2.2 as part of automation.

Analysing the ability to retain sidechain hydrogen-bonds in mutant proteins

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Motivation: Hydrogen bonds are one of the most important inter-atomic interactions in biology. Previous experimental, theoretical and bioinformatics analyses have shown that the hydrogen bonding potential of amino acids is generally satisfied and that buried unsatisfied hydrogen-bond-capable residues are destabilizing. When studying mutant proteins, or introducing mutations to residues involved in hydrogen bonding, one needs to know whether a hydrogen bond can be maintained. Our aim, therefore, was to develop a rapid method to evaluate whether a sidechain can form a hydrogen-bond. Results: A novel knowledge-based approach was developed in which the conformations accessible to the residues involved are taken into account. Residues involved in hydrogen bonds in a set of high resolution crystal structures were analyzed and this analysis is then applied to a given protein. The program was applied to assess mutations in the tumour-suppressor protein, p53. This raised the number of distinct mutations identified as disrupting sidechain-sidechain hydrogen bonding from 181 in our previous analysis to 202 in this analysis.

An extensible automated protein annotation tool: standardizing input and output using validated XML

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Motivation: There is a frequent need to apply a large range of local or remote prediction and annotation tools to one or more sequences. We have created a tool able to dispatch one or more sequences to assorted services by defining a consistent XML format for data and annotations. Results: By analyzing annotation tools, we have determined that annotations can be described using one or more of the six forms of data: numeric or textual annotation of residues, domains (residue ranges) or whole sequences. With this in mind, XML DTDs have been designed to store the input and output of any server. Plug-in wrappers to a number of services have been written which are called from a master script. The resulting APATML is then formatted for display in HTML. Alternatively further tools may be written to perform post-analysis.

High throughput profile-profile based fold recognition for the entire human proteome

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: In order to maintain the most comprehensive structural annotation databases we must carry out regular updates for each proteome using the latest profile-profile fold recognition methods. The ability to carry out these updates on demand is necessary to keep pace with the regular updates of sequence and structure databases. Providing the highest quality structural models requires the most intensive profile-profile fold recognition methods running with the very latest available sequence databases and fold libraries. However, running these methods on such a regular basis for every sequenced proteome requires large amounts of processing power.In this paper we describe and benchmark the JYDE (Job Yield Distribution Environment) system, which is a meta-scheduler designed to work above cluster schedulers, such as Sun Grid Engine (SGE) or Condor. We demonstrate the ability of JYDE to distribute the load of genomic-scale fold recognition across multiple independent Grid domains. We use the most recent profile-profile version of our mGenTHREADER software in order to annotate the latest version of the Human proteome against the latest sequence and structure databases in as short a time as possible. RESULTS: We show that our JYDE system is able to scale to large numbers of intensive fold recognition jobs running across several independent computer clusters. Using our JYDE system we have been able to annotate 99.9% of the protein sequences within the Human proteome in less than 24 hours, by harnessing over 500 CPUs from 3 independent Grid domains. CONCLUSION: This study clearly demonstrates the feasibility of carrying out on demand high quality structural annotations for the proteomes of major eukaryotic organisms. Specifically, we have shown that it is now possible to provide complete regular updates of profile-profile based fold recognition models for entire eukaryotic proteomes, through the use of Grid middleware such as JYDE.

Predicting functional gene links from phylogenetic-statistical analyses of whole genomes

Relevância:

10.00% 10.00%

Publicador:

Resumo:

An important element of the developing field of proteomics is to understand protein-protein interactions and other functional links amongst genes. Across-species correlation methods for detecting functional links work on the premise that functionally linked proteins will tend to show a common pattern of presence and absence across a range of genomes. We describe a maximum likelihood statistical model for predicting functional gene linkages. The method detects independent instances of the correlated gain or loss of pairs of proteins on phylogenetic trees, reducing the high rates of false positives observed in conventional across-species methods that do not explicitly incorporate a phylogeny. We show, in a dataset of 10,551 protein pairs, that the phylogenetic method improves by up to 35% on across-species analyses at identifying known functionally linked proteins. The method shows that protein pairs with at least two to three correlated events of gain or loss are almost certainly functionally linked. Contingent evolution, in which one gene's presence or absence depends upon the presence of another, can also be detected phylogenetically, and may identify genes whose functional significance depends upon its interaction with other genes. Incorporating phylogenetic information improves the prediction of functional linkages. The improvement derives from having a lower rate of false positives and from detecting trends that across-species analyses miss. Phylogenetic methods can easily be incorporated into the screening of large-scale bioinformatics datasets to identify sets of protein links and to characterise gene networks.

Modelling human genetic history

Relevância:

10.00% 10.00%

Publicador:

Kinetic and crystallographic studies of glucopyranose spirohydantoin and glucopyranosylamine analogs inhibitors of glycogen phosphorylase

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Glycogen phosphorylase (GP) is currently exploited as a target for inhibition of hepatic glycogenolysis under high glucose conditions. Spirohydantoin of glucopyranose and N-acetyl-beta-D-glucopyranosylamine have been identified as the most potent inhibitors of GP that bind at the catalytic site. Four spirohydantoin and three beta-D-glucopyranosylamine analogs have been designed, synthesized and tested for inhibition of GP in kinetic experiments. Depending on the functional group introduced, the K(i) values varied from 16.5 microM to 1200 microM. In order to rationalize the kinetic results, we determined the crystal structures of the analogs in complex with GP. All the inhibitors bound at the catalytic site of the enzyme, by making direct and water-mediated hydrogen bonds with the protein and by inducing minor movements of the side chains of Asp283 and Asn284, of the 280s loop that blocks access of the substrate glycogen to the catalytic site, and changes in the water structure in the vicinity of the site. The differences observed in the Ki values of the analogs can be interpreted in terms of variations in hydrogen bonding and van der Waals interactions, desolvation effects, ligand conformational entropy, and displacement of water molecules on ligand binding to the catalytic site.

LVB: parsimony and simulated annealing in the search for phylogenetic trees

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Summary: The program LVB seeks parsimonious phylogenies from nucleotide alignments, using the simulated annealing heuristic. LVB runs fast and gives high quality results.

PDBSprotEC: A web-accessible database linking PDB chains to EC numbers via SwissProt

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A mapping between chains in the Protein Databank and Enzyme Classification numbers is invaluable for research into structure-function relationships. Mapping at the chain level is a non-trivial problem and we present an automatically updated Web-server, which provides this link in a queryable form and as a downloadable XML or flat file.

Is it functional?

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This report describes the first scientific meeting of the British Society for Proteome Research (BSPR), which was organised jointly with the European Bioinformatics Institute (EBI) and held in July 2004. The focus of the conference was functional proteomics with an emphasis on possible clinical application. The main subjects described here are: the need to simplify samples, the use of biological fluids verses tissue, consideration of biological and experimental variation and the creation of databases to achieve menaingful functional analysis.

Creating frameworks to support interoperability of biodiversity data

Relevância:

10.00% 10.00%

Publicador:

The challenges for molecular nutrition research 1: linking genotype to healthy nutrition

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Nutrition science finds itself at a major crossroad. On the one hand we can continue the current path, which has resulted in some substantial advances, but also many conflicting messages which impair the trust of the general population, especially those who are motivated to improve their health through diet. The other road is uncharted and is being built over the many exciting new developments in life sciences. This new era of nutrition recognizes the complex relation between the health of the individual, its genome, and the life-long dietary exposure, and has lead to the realisation that nutrition is essentially a gene - environment interaction science. This review on the relation between genotype, diet and health is the first of a series dealing with the major challenges in molecular nutrition, analyzing the foundations of nutrition research. With the unravelling of the human genome and the linking of its variability to a multitude of phenotypes from " healthy'' to an enormously complex range of predispositions, the dietary modulation of these propensities has become an area of active research. Classical genetic approaches applied so far in medical genetics have steered away from incorporating dietary effects in their models and paradoxically, most genetic studies analyzing diet-associated phenotypes and diseases simply ignore diet. Yet, a modest but increasing number of studies are accounting for diet as a modulator of genetic associations. These range from observational cohorts to intervention studies with prospectively selected genotypes. New statistical and bioinformatics approaches are becoming available to aid in design and evaluation of these studies. This review discusses the various approaches used and provides concrete recommendations for future research.

Topographic map of gammaproteobacteria using 16 s rRNA gene sequence

Relevância:

10.00% 10.00%

Publicador:

FunFOLD: an improved automated method for the prediction of ligand binding residues using 3D models of proteins

Relevância:

10.00% 10.00%

Publicador:

Biomarker discovery and redundancy reduction towards classification using a multi-factorial MALDI-TOF MS T2DM mouse model dataset

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Diabetes like many diseases and biological processes is not mono-causal. On the one hand multifactorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics.

«
1
2
3
4
»