Biblioteca Digital

999 resultados para DNA Modeling

Derivation of Context-free Stochastic L-Grammar Rules for Promoter Sequence Modeling Using Support Vector Machine

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Formal grammars can used for describing complex repeatable structures such as DNA sequences. In this paper, we describe the structural composition of DNA sequences using a context-free stochastic L-grammar. L-grammars are a special class of parallel grammars that can model the growth of living organisms, e.g. plant development, and model the morphology of a variety of organisms. We believe that parallel grammars also can be used for modeling genetic mechanisms and sequences such as promoters. Promoters are short regulatory DNA sequences located upstream of a gene. Detection of promoters in DNA sequences is important for successful gene prediction. Promoters can be recognized by certain patterns that are conserved within a species, but there are many exceptions which makes the promoter recognition a complex problem. We replace the problem of promoter recognition by induction of context-free stochastic L-grammar rules, which are later used for the structural analysis of promoter sequences. L-grammar rules are derived automatically from the drosophila and vertebrate promoter datasets using a genetic programming technique and their fitness is evaluated using a Support Vector Machine (SVM) classifier. The artificial promoter sequences generated using the derived L- grammar rules are analyzed and compared with natural promoter sequences.

Modeling the quantitative specificity of DNA-binding proteins from example binding sites

Relevância:

40.00% 40.00%

Publicador:

Modeling the control of DNA replication in fission yeast

Relevância:

40.00% 40.00%

Publicador:

Resumo:

A central event in the eukaryotic cell cycle is the decision to commence DNA replication (S phase). Strict controls normally operate to prevent repeated rounds of DNA replication without intervening mitoses (“endoreplication”) or initiation of mitosis before DNA is fully replicated (“mitotic catastrophe”). Some of the genetic interactions involved in these controls have recently been identified in yeast. From this evidence we propose a molecular mechanism of “Start” control in Schizosaccharomyces pombe. Using established principles of biochemical kinetics, we compare the properties of this model in detail with the observed behavior of various mutant strains of fission yeast: wee1− (size control at Start), cdc13Δ and rum1OP (endoreplication), and wee1− rum1Δ (rapid division cycles of diminishing cell size). We discuss essential features of the mechanism that are responsible for characteristic properties of Start control in fission yeast, to expose our proposal to crucial experimental tests.

Modeling the distribution of crossovers and interference with mice DNA data

Relevância:

40.00% 40.00%

Publicador:

Resumo:

Chiasma and crossover are two related biological processes of great importance in the understanding genetic variation. The study of these processes is straightforward in organisms where all products of meiosis are recovered and can be observed. This is not the case in mammals. Our understanding of these processes depends on our ability to model them. In this study I describe the biological processes that underline chiasma and crossover as well as the two main inference problems associated with these processes: i) in mammals we only recover one of the four products of meiosis and, ii) in general, we do not observe where the crossovers actually happen, but we find an interval containing type-2 censored information. NPML estimate was proposed and used in this work and used to compare chromosome length and chromosome expansion through the crosses.

On The Consistency Of Monte Carlo Track Structure Dna Damage Simulations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Monte Carlo track structures (MCTS) simulations have been recognized as useful tools for radiobiological modeling. However, the authors noticed several issues regarding the consistency of reported data. Therefore, in this work, they analyze the impact of various user defined parameters on simulated direct DNA damage yields. In addition, they draw attention to discrepancies in published literature in DNA strand break (SB) yields and selected methodologies. The MCTS code Geant4-DNA was used to compare radial dose profiles in a nanometer-scale region of interest (ROI) for photon sources of varying sizes and energies. Then, electron tracks of 0.28 keV-220 keV were superimposed on a geometric DNA model composed of 2.7 × 10(6) nucleosomes, and SBs were simulated according to four definitions based on energy deposits or energy transfers in DNA strand targets compared to a threshold energy ETH. The SB frequencies and complexities in nucleosomes as a function of incident electron energies were obtained. SBs were classified into higher order clusters such as single and double strand breaks (SSBs and DSBs) based on inter-SB distances and on the number of affected strands. Comparisons of different nonuniform dose distributions lacking charged particle equilibrium may lead to erroneous conclusions regarding the effect of energy on relative biological effectiveness. The energy transfer-based SB definitions give similar SB yields as the one based on energy deposit when ETH ≈ 10.79 eV, but deviate significantly for higher ETH values. Between 30 and 40 nucleosomes/Gy show at least one SB in the ROI. The number of nucleosomes that present a complex damage pattern of more than 2 SBs and the degree of complexity of the damage in these nucleosomes diminish as the incident electron energy increases. DNA damage classification into SSB and DSB is highly dependent on the definitions of these higher order structures and their implementations. The authors' show that, for the four studied models, different yields are expected by up to 54% for SSBs and by up to 32% for DSBs, as a function of the incident electrons energy and of the models being compared. MCTS simulations allow to compare direct DNA damage types and complexities induced by ionizing radiation. However, simulation results depend to a large degree on user-defined parameters, definitions, and algorithms such as: DNA model, dose distribution, SB definition, and the DNA damage clustering algorithm. These interdependencies should be well controlled during the simulations and explicitly reported when comparing results to experiments or calculations.

Molecular determinants of improved cathepsin B inhibition by new cystatins obtained by DNA shuffling

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Cystatins are inhibitors of cysteine proteases. The majority are only weak inhibitors of human cathepsin B, which has been associated with cancer, Alzheimer's disease and arthritis. Results: Starting from the sequences of oryzacystatin-1 and canecystatin-1, a shuffling library was designed and a hybrid clone obtained, which presented higher inhibitory activity towards cathepsin B. This clone presented two unanticipated point mutations as well as an N-terminal deletion. Reversing each point mutation independently or both simultaneously abolishes the inhibitory activity towards cathepsin B. Homology modeling together with experimental studies of the reverse mutants revealed the likely molecular determinants of the improved inhibitory activity to be related to decreased protein stability. Conclusion: A combination of experimental approaches including gene shuffling, enzyme assays and reverse mutation allied to molecular modeling has shed light upon the unexpected inhibitory properties of certain cystatin mutants against Cathepsin B. We conclude that mutations disrupting the hydrophobic core of phytocystatins increase the flexibility of the N-terminus, leading to an increase in inhibitory activity. Such mutations need not affect the inhibitory site directly but may be observed distant from it and manifest their effects via an uncoupling of its three components as a result of increased protein flexibility.

Constraint-based modeling of minimum set covering: application to species differentation

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Work presented in the context of the European Master in Computational Logics, as partial requisit for the graduation as Master in Computational Logics

Development of computational tools for the integrated analysis of DNA microarray data with applications in cancer research

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The MAP-i Doctoral Program of the Universities of Minho, Aveiro and Porto

Modeling sequencing errors by combining Hidden Markov models.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Among the largest resources for biological sequence data is the large amount of expressed sequence tags (ESTs) available in public and proprietary databases. ESTs provide information on transcripts but for technical reasons they often contain sequencing errors. Therefore, when analyzing EST sequences computationally, such errors must be taken into account. Earlier attempts to model error prone coding regions have shown good performance in detecting and predicting these while correcting sequencing errors using codon usage frequencies. In the research presented here, we improve the detection of translation start and stop sites by integrating a more complex mRNA model with codon usage bias based error correction into one hidden Markov model (HMM), thus generalizing this error correction approach to more complex HMMs. We show that our method maintains the performance in detecting coding sequences.

MAMOT: hidden Markov modeling tool.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Hidden Markov models (HMMs) are probabilistic models that are well adapted to many tasks in bioinformatics, for example, for predicting the occurrence of specific motifs in biological sequences. MAMOT is a command-line program for Unix-like operating systems, including MacOS X, that we developed to allow scientists to apply HMMs more easily in their research. One can define the architecture and initial parameters of the model in a text file and then use MAMOT for parameter optimization on example data, decoding (like predicting motif occurrence in sequences) and the production of stochastic sequences generated according to the probabilistic model. Two examples for which models are provided are coiled-coil domains in protein sequences and protein binding sites in DNA. A wealth of useful features include the use of pseudocounts, state tying and fixing of selected parameters in learning, and the inclusion of prior probabilities in decoding. AVAILABILITY: MAMOT is implemented in C++, and is distributed under the GNU General Public Licence (GPL). The software, documentation, and example model files can be found at http://bcf.isb-sib.ch/mamot

Design of potent inhibitors of human RAD51 recombinase based on BRC motifs of BRCA2 protein: modeling and experimental validation of a chimera peptide.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We have previously shown that a 28-amino acid peptide derived from the BRC4 motif of BRCA2 tumor suppressor inhibits selectively human RAD51 recombinase (HsRad51). With the aim of designing better inhibitors for cancer treatment, we combined an in silico docking approach with in vitro biochemical testing to construct a highly efficient chimera peptide from eight existing human BRC motifs. We built a molecular model of all BRC motifs complexed with HsRad51 based on the crystal structure of the BRC4 motif-HsRad51 complex, computed the interaction energy of each residue in each BRC motif, and selected the best amino acid residue at each binding position. This analysis enabled us to propose four amino acid substitutions in the BRC4 motif. Three of these increased the inhibitory effect in vitro, and this effect was found to be additive. We thus obtained a peptide that is about 10 times more efficient in inhibiting HsRad51-ssDNA complex formation than the original peptide.

Estimating dormant and active hematopoietic stem cell kinetics through extensive modeling of bromodeoxyuridine label-retaining cell dynamics.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Bone marrow hematopoietic stem cells (HSCs) are responsible for both lifelong daily maintenance of all blood cells and for repair after cell loss. Until recently the cellular mechanisms by which HSCs accomplish these two very different tasks remained an open question. Biological evidence has now been found for the existence of two related mouse HSC populations. First, a dormant HSC (d-HSC) population which harbors the highest self-renewal potential of all blood cells but is only induced into active self-renewal in response to hematopoietic stress. And second, an active HSC (a-HSC) subset that by and large produces the progenitors and mature cells required for maintenance of day-to-day hematopoiesis. Here we present computational analyses further supporting the d-HSC concept through extensive modeling of experimental DNA label-retaining cell (LRC) data. Our conclusion that the presence of a slowly dividing subpopulation of HSCs is the most likely explanation (amongst the various possible causes including stochastic cellular variation) of the observed long term Bromodeoxyuridine (BrdU) retention, is confirmed by the deterministic and stochastic models presented here. Moreover, modeling both HSC BrdU uptake and dilution in three stages and careful treatment of the BrdU detection sensitivity permitted improved estimates of HSC turnover rates. This analysis predicts that d-HSCs cycle about once every 149-193 days and a-HSCs about once every 28-36 days. We further predict that, using LRC assays, a 75%-92.5% purification of d-HSCs can be achieved after 59-130 days of chase. Interestingly, the d-HSC proportion is now estimated to be around 30-45% of total HSCs - more than twice that of our previous estimate.

High-throughput SELEX SAGE method for quantitative modeling of transcription-factor binding sites.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The ability to determine the location and relative strength of all transcription-factor binding sites in a genome is important both for a comprehensive understanding of gene regulation and for effective promoter engineering in biotechnological applications. Here we present a bioinformatically driven experimental method to accurately define the DNA-binding sequence specificity of transcription factors. A generalized profile was used as a predictive quantitative model for binding sites, and its parameters were estimated from in vitro-selected ligands using standard hidden Markov model training algorithms. Computer simulations showed that several thousand low- to medium-affinity sequences are required to generate a profile of desired accuracy. To produce data on this scale, we applied high-throughput genomics methods to the biochemical problem addressed here. A method combining systematic evolution of ligands by exponential enrichment (SELEX) and serial analysis of gene expression (SAGE) protocols was coupled to an automated quality-controlled sequence extraction procedure based on Phred quality scores. This allowed the sequencing of a database of more than 10,000 potential DNA ligands for the CTF/NFI transcription factor. The resulting binding-site model defines the sequence specificity of this protein with a high degree of accuracy not achieved earlier and thereby makes it possible to identify previously unknown regulatory sequences in genomic DNA. A covariance analysis of the selected sites revealed non-independent base preferences at different nucleotide positions, providing insight into the binding mechanism.

Investigations on the cation induced liquid crystalline phases of high molecular weight DNA

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Liquid Crystalline DNA is emerging as an active area of research, due to its potential applications in diverse fields, ranging from nanoelectronics to therapeutics. Since, counter ion neutralization is an essential requirement for the expression of LC DNA, and the present level of understanding on the LC phase behavior of high molecular weight DNA is inadequate, a thorough investigation is required to understand the nature and stability of these phases under the influence of various cationic species. The present study is, therefore mainly focused on a comparative investigation of the effect of metal ions of varying charge, size, hydration and binding modes on the LC phase behavior of high molecular weight DNA. The main objectives of the works are investigations on the induction and stabilization of LC phases of high molecular weight DNA by alkali metal ions, investigations on the induction and stabilization of LC phases of high molecular weight DNA by alkaline earth metal ions, effects of multivalent, transition and heavy metal ions on the LC phase behavior of high molecular weight DNA and investigations on spermine induced LC behavior of high molecular weight DNA in the presence of alkali and alkaline earth metal ions. The critical DNA concentration (CD) required for the expression of LC phases, phase transitions and their stability varied considerably when the binding site of the metal ions changed from phosphate groups to the nitrogenous bases of DNA, with Li+ giving the highest stability. Multiple LC phases with different textures, sometimes diffused and unstable or otherwise mainly distinct and clear, were observed on mixing metal ions with DNA solutions, which in turn depended on the charge, size, hydration factor, binding modes, concentration of the metal ions and time. Molecular modeling studies on binding of selected metal ions to DNA supported the experimental findings

Syntheses of Cross-Linking Agents, Molecular Modeling of Oligonucleotides and Polypeptide-Ion Complexes

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Each section of this thesis will be subdivided into three parts encompassing all of the research in which I have been involved during the past three years. These will be referred to under the headings "Syntheses:' "Molecular Modeling," and "Cross-linking Efficiencies." Each of these subdivisions may have divisions within them when necessary in order to fully detail the research.

«
1
2
3
4
5
6
7
8
...
66
67
»