79 resultados para Computational biology and bioinformatics
Resumo:
In recent times computational algorithms inspired by biological processes and evolution are gaining much popularity for solving science and engineering problems. These algorithms are broadly classified into evolutionary computation and swarm intelligence algorithms, which are derived based on the analogy of natural evolution and biological activities. These include genetic algorithms, genetic programming, differential evolution, particle swarm optimization, ant colony optimization, artificial neural networks, etc. The algorithms being random-search techniques, use some heuristics to guide the search towards optimal solution and speed-up the convergence to obtain the global optimal solutions. The bio-inspired methods have several attractive features and advantages compared to conventional optimization solvers. They also facilitate the advantage of simulation and optimization environment simultaneously to solve hard-to-define (in simple expressions), real-world problems. These biologically inspired methods have provided novel ways of problem-solving for practical problems in traffic routing, networking, games, industry, robotics, economics, mechanical, chemical, electrical, civil, water resources and others fields. This article discusses the key features and development of bio-inspired computational algorithms, and their scope for application in science and engineering fields.
Suite of tools for statistical N-gram language modeling for pattern mining in whole genome sequences
Resumo:
Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.
Resumo:
Flap dynamics of HIV-1 protease (HIV-pr) controls the entry of inhibitors and substrates to the active site. Dynamical models from previous simulations are not all consistent with each other and not all are supported by the NMR results. In the present work, the er effect of force field on the dynamics of HIV-pr is investigated by MD simulations using three AMBER force fields ff99, ff99SB, and ff03. The generalized order parameters for amide backbone are calculated from the three force fields and compared with the NMR S2 values. We found that the ff99SB and ff03 force field calculated order parameters agree reasonably well with the NMR S2 values, whereas ff99 calculated values deviate most from the NMR order parameters. Stereochemical geometry of protein models from each force field also agrees well with the remarks from NMR S2 values. However, between ff99SB and ff03, there are several differences, most notably in the loop regions. It is found that these loops are, in general, more flexible in the ff03 force field. This results in a larger active site cavity in the simulation with the ff03 force field. The effect of this difference in computer-aided drug design against flexible receptors is discussed.
Resumo:
This article is concerned with the evolution of haploid organisms that reproduce asexually. In a seminal piece of work, Eigen and coauthors proposed the quasispecies model in an attempt to understand such an evolutionary process. Their work has impacted antiviral treatment and vaccine design strategies. Yet, predictions of the quasispecies model are at best viewed as a guideline, primarily because it assumes an infinite population size, whereas realistic population sizes can be quite small. In this paper we consider a population genetics-based model aimed at understanding the evolution of such organisms with finite population sizes and present a rigorous study of the convergence and computational issues that arise therein. Our first result is structural and shows that, at any time during the evolution, as the population size tends to infinity, the distribution of genomes predicted by our model converges to that predicted by the quasispecies model. This justifies the continued use of the quasispecies model to derive guidelines for intervention. While the stationary state in the quasispecies model is readily obtained, due to the explosion of the state space in our model, exact computations are prohibitive. Our second set of results are computational in nature and address this issue. We derive conditions on the parameters of evolution under which our stochastic model mixes rapidly. Further, for a class of widely used fitness landscapes we give a fast deterministic algorithm which computes the stationary distribution of our model. These computational tools are expected to serve as a framework for the modeling of strategies for the deployment of mutagenic drugs.
Resumo:
In systems biology, questions concerning the molecular and cellular makeup of an organism are of utmost importance, especially when trying to understand how unreliable components-like genetic circuits, biochemical cascades, and ion channels, among others-enable reliable and adaptive behaviour. The repertoire and speed of biological computations are limited by thermodynamic or metabolic constraints: an example can be found in neurons, where fluctuations in biophysical states limit the information they can encode-with almost 20-60% of the total energy allocated for the brain used for signalling purposes, either via action potentials or by synaptic transmission. Here, we consider the imperatives for neurons to optimise computational and metabolic efficiency, wherein benefits and costs trade-off against each other in the context of self-organised and adaptive behaviour. In particular, we try to link information theoretic (variational) and thermodynamic (Helmholtz) free-energy formulations of neuronal processing and show how they are related in a fundamental way through a complexity minimisation lemma.
Resumo:
A balance between excitatory and inhibitory synaptic currents is thought to be important for several aspects of information processing in cortical neurons in vivo, including gain control, bandwidth and receptive field structure. These factors will affect the firing rate of cortical neurons and their reliability, with consequences for their information coding and energy consumption. Yet how balanced synaptic currents contribute to the coding efficiency and energy efficiency of cortical neurons remains unclear. We used single compartment computational models with stochastic voltage-gated ion channels to determine whether synaptic regimes that produce balanced excitatory and inhibitory currents have specific advantages over other input regimes. Specifically, we compared models with only excitatory synaptic inputs to those with equal excitatory and inhibitory conductances, and stronger inhibitory than excitatory conductances (i.e. approximately balanced synaptic currents). Using these models, we show that balanced synaptic currents evoke fewer spikes per second than excitatory inputs alone or equal excitatory and inhibitory conductances. However, spikes evoked by balanced synaptic inputs are more informative (bits/spike), so that spike trains evoked by all three regimes have similar information rates (bits/s). Consequently, because spikes dominate the energy consumption of our computational models, approximately balanced synaptic currents are also more energy efficient than other synaptic regimes. Thus, by producing fewer, more informative spikes approximately balanced synaptic currents in cortical neurons can promote both coding efficiency and energy efficiency.
Resumo:
The most spectacular applications of crystallography are currently concerned with biological macromolecules like proteins and their assemblies. Macromolecular crystallography originated in England in the thirties of the last century, but definitive results began to appear only around 1960. Since then macromolecular crystallography has grown to become central to modern biology. India has a long tradition in crystallography starting with the work of K. Banerjee in the thirties. In addition to their contributions to crystallography, G.N. Ramachandran and his colleagues gave a head start to India in computational biology, molecular modeling and what we now call bioinformatics. However, attempts to initiate macromolecular crystallography in India started only in the seventies. The work took off the ground after the Department of Science and Technology handsomely supported the group at Indian Institute of Science, Bangalore in 1983. The Bangalore group was also recognized as a national nucleus for the development of the area in the country. Since then macromolecular crystallography, practiced in more than 30 institutions in the country, has grown to become an important component of scientific research in India. The articles in this issue provide a flavor of activities in the area in the country. The area is still in an expanding phase and is poised to scale greater heights.
Resumo:
Information is encoded in neural circuits using both graded and action potentials, converting between them within single neurons and successive processing layers. This conversion is accompanied by information loss and a drop in energy efficiency. We investigate the biophysical causes of this loss of information and efficiency by comparing spiking neuron models, containing stochastic voltage-gated Na+ and K+ channels, with generator potential and graded potential models lacking voltage-gated Na+ channels. We identify three causes of information loss in the generator potential that are the by-product of action potential generation: (1) the voltage-gated Na+ channels necessary for action potential generation increase intrinsic noise and (2) introduce non-linearities, and (3) the finite duration of the action potential creates a `footprint' in the generator potential that obscures incoming signals. These three processes reduce information rates by similar to 50% in generator potentials, to similar to 3 times that of spike trains. Both generator potentials and graded potentials consume almost an order of magnitude less energy per second than spike trains. Because of the lower information rates of generator potentials they are substantially less energy efficient than graded potentials. However, both are an order of magnitude more efficient than spike trains due to the higher energy costs and low information content of spikes, emphasizing that there is a two-fold cost of converting analogue to digital; information loss and cost inflation.
Resumo:
Translation initiation in Hepatitis C Virus (HCV) is mediated by Internal Ribosome Entry Site (IRES), which is independent of cap-structure and uses a limited number of canonical initiation factors. During translation initiation IRES-40S complex formation depends on high affinity interaction of IRES with ribosomal proteins. Earlier, it has been shown that ribosomal protein S5 (RPS5) interacts with HCV IRES. Here, we have extensively characterized the HCV IRES-RPS5 interaction and demonstrated its role in IRES function. Computational modelling and RNA-protein interaction studies demonstrated that the beta hairpin structure within RPS5 is critically required for the binding with domains II and IV. Mutations disrupting IRES-RPS5 interaction drastically reduced the 80S complex formation and the corresponding IRES activity. Computational analysis and UV cross-linking experiments using various IRES-mutants revealed interplay between domains II and IV mediated by RPS5. In addition, present study demonstrated that RPS5 interaction is unique to HCV IRES and is not involved in 40S-3 ` UTR interaction. Further, partial silencing of RPS5 resulted in preferential inhibition of HCV RNA translation. However, global translation was marginally affected by partial silencing of RPS5. Taken together, results provide novel molecular insights into IRES-RPS5 interaction and unravel its functional significance in mediating internal initiation of translation.
Resumo:
In recent times, zebrafish has garnered lot of popularity as model organism to study human cancers. Despite high evolutionary divergence from humans, zebrafish develops almost all types of human tumors when induced. However, mechanistic details of tumor formation have remained largely unknown. Present study is aimed at analysis of repertoire of kinases in zebrafish proteome to provide insights into various cellular components. Annotation using highly sensitive remote homology detection methods revealed ``substantial expansion'' of Ser/Thr/Tyr kinase family in zebrafish compared to humans, constituting over 3% of proteome. Subsequent classification of kinases into subfamilies revealed presence of large number of CAMK group of kinases, with massive representation of PIM kinases, important for cell cycle regulation and growth. Extensive sequence comparison between human and zebrafish PIM kinases revealed high conservation of functionally important residues with a few organism specific variations. There are about 300 PIM kinases in zebrafish kinome, while human genome codes for only about 500 kinases altogether. PIM kinases have been implicated in various human cancers and are currently being targeted to explore their therapeutic potentials. Hence, in depth analysis of PIM kinases in zebrafish has opened up new avenues of research to verify the model organism status of zebrafish.
Resumo:
One of the unexplored, yet important aspects of the biology of acyl carrier proteins (ACPs) is the self-acylation and malonyl transferase activities dedicated to ACPs in polyketide synthesis. Our studies demonstrate the existence of malonyl transferase activity in ACPs involved in type II fatty acid biosynthesis from Plasmodium falciparum and Escherichia coli. We also show that the catalytic malonyl transferase activity is intrinsic to an individual ACP. Mutational analysis implicates an arginine/lysine in loop II and an arginine/glutamine in helix III as the catalytic residues for transferase function. The hydrogen bonding properties of these residues appears to be indispensable for the transferase reaction. Complementation of fabD(Ts) E. coli highlights the putative physiological role of this process. Our studies thus shed light on a key aspect of ACP biology and provide insights into the mechanism involved therein.
Resumo:
We explore here the acceleration of convergence of iterative methods for the solution of a class of quasilinear and linear algebraic equations. The specific systems are the finite difference form of the Navier-Stokes equations and the energy equation for recirculating flows. The acceleration procedures considered are: the successive over relaxation scheme; several implicit methods; and a second-order procedure. A new implicit method—the alternating direction line iterative method—is proposed in this paper. The method combines the advantages of the line successive over relaxation and alternating direction implicit methods. The various methods are tested for their computational economy and accuracy on a typical recirculating flow situation. The numerical experiments show that the alternating direction line iterative method is the most economical method of solving the Navier-Stokes equations for all Reynolds numbers in the laminar regime. The usual ADI method is shown to be not so attractive for large Reynolds numbers because of the loss of diagonal dominance. This loss can however be restored by a suitable choice of the relaxation parameter, but at the cost of accuracy. The accuracy of the new procedure is comparable to that of the well-tested successive overrelaxation method and to the available results in the literature. The second-order procedure turns out to be the most efficient method for the solution of the linear energy equation.
Resumo:
4-Methyl-5-beta-hydroxyethylthiazole kinase (ThiK) catalyses the phosphorylation of the hydroxyl group of 4-methyl-5-beta-hydroxyethylthiazole. This work reports the first crystal structure of an archaeal ThiK: that from Pyrococcus horikoshii OT3 (PhThiK) at 1.85 angstrom resolution with a phosphate ion occupying the position of the beta-phosphate of the nucleotide. The topology of this enzyme shows the typical ribokinase fold of an alpha/beta protein. The overall structure of PhThiK is similar to those of Bacillus subtilis ThiK (BsThiK) and Enterococcus faecalis V583 ThiK (EfThiK). Sequence analysis of ThiK enzymes from various sources indicated that three-quarters of the residues involved in interfacial regions are conserved. It also revealed that the amino-acid residues in the nucleotide-binding, magnesium ion-binding and substrate-binding sites are conserved. Binding of the nucleotide and substrate to the ThiK enzyme do not influence the quaternary association (trimer) as revealed by the crystal structure of PhThiK.
Resumo:
Molecular dynamics simulations have been carried out on all the jacalin-carbohydrate complexes of known structure, models of unliganded molecules derived from the complexes and also models of relevant complexes where X-ray structures are not available. Results of the simulations and the available crystal structures involving jacalin permit delineation of the relatively rigid and flexible regions of the molecule and the dynamical variability of the hydrogen bonds involved in stabilizing the structure. Local flexibility appears to be related to solvent accessibility. Hydrogen bonds involving side chains and water bridges involving buried water molecules appear to be important in the stabilization of loop structures. The lectin-carbohydrate interactions observed in crystal structures, the average parameters pertaining to them derived from simulations, energetic contribution of the stacking residue estimated from quantum mechanical calculations, and the scatter of the locations of carbohydrate and carbohydrate-binding residues are consistent with the known thermodynamic parameters of jacalin-carbohydrate interactions. The simulations, along with X-ray results, provide a fuller picture of carbohydrate binding by jacalin than provided by crystallographic analysis alone. The simulations confirm that in the unliganded structures water molecules tend to occupy the positions occupied by carbohydrate oxygens in the lectin-carbohydrate complexes. Population distributions in simulations of the free lectin, the ligands, and the complexes indicate a combination of conformational selection and induced fit. Proteins 2009; 77:760-777.