2 resultados para Generation from examples

em Bucknell University Digital Commons - Pensilvania - USA


Relevância:

40.00% 40.00%

Publicador:

Resumo:

The goals of this article are to (1) provide further validation of the Glycam06 force field, specifically for its use in implicit solvent molecular dynamic (MD) simulations, and (2) to present the extension of G.N. Ramachandran's idea of plotting amino acid phi and psi angles to the glycosidic phi, psi, and omega angles formed between carbohydrates. As in traditional Ramachandran plots, these carbohydrate Ramachandran-type (carb-Rama) plots reveal the coupling between the glycosidic angles by displaying the allowed and disallowed conformational space. Considering two-bond glycosidic linkages, there are 18 possible conformational regions that can be defined by (α, ϕ, ψ) and (β, ϕ, ψ), whereas for three-bond linkages, there are 54 possible regions that can be defined by (α, ϕ, ψ, ω) and (β, ϕ, ψ, ω). Illustrating these ideas are molecular dynamic simulations on an implicitly hydrated oligosaccharide (700 ns) and its eight constituent disaccharides (50 ns/disaccharide). For each linkage, we compare and contrast the oligosaccharide and respective disaccharide carb-Rama plots, validate the simulations and the Glycam06 force field through comparison to experimental data, and discuss the general trends observed in the plots.

Relevância:

40.00% 40.00%

Publicador:

Resumo:

With the advent of cheaper and faster DNA sequencing technologies, assembly methods have greatly changed. Instead of outputting reads that are thousands of base pairs long, new sequencers parallelize the task by producing read lengths between 35 and 400 base pairs. Reconstructing an organism’s genome from these millions of reads is a computationally expensive task. Our algorithm solves this problem by organizing and indexing the reads using n-grams, which are short, fixed-length DNA sequences of length n. These n-grams are used to efficiently locate putative read joins, thereby eliminating the need to perform an exhaustive search over all possible read pairs. Our goal was develop a novel n-gram method for the assembly of genomes from next-generation sequencers. Specifically, a probabilistic, iterative approach was utilized to determine the most likely reads to join through development of a new metric that models the probability of any two arbitrary reads being joined together. Tests were run using simulated short read data based on randomly created genomes ranging in lengths from 10,000 to 100,000 nucleotides with 16 to 20x coverage. We were able to successfully re-assemble entire genomes up to 100,000 nucleotides in length.