899 resultados para SMT tools
Suite of tools for statistical N-gram language modeling for pattern mining in whole genome sequences
Resumo:
Genome sequences contain a number of patterns that have biomedical significance. Repetitive sequences of various kinds are a primary component of most of the genomic sequence patterns. We extended the suffix-array based Biological Language Modeling Toolkit to compute n-gram frequencies as well as n-gram language-model based perplexity in windows over the whole genome sequence to find biologically relevant patterns. We present the suite of tools and their application for analysis on whole human genome sequence.
Resumo:
A theoretical study has been carried out at the B3LYP/LANL2DZ level to compare the reactivity of phenyl isocyanate and phenyl isothiocyanate towards titanium(IV) alkoxides. Isocyanates are shown to favour both mono insertion and double insertion reactions. Double insertion in a head-to-tail fashion is shown to be more exothermic than double insertion in a head-to-head fashion. The head-to-head double insertion leads to the metathesis product, a carbodiimide, after the extrusion of carbon dioxide. In the case of phenyl isothiocyanate, calculations favour the formation of only mono insertion products. Formation of a double insertion product is highly unfavourable. Further, these studies indicate that the reverse reaction involving the metathesis of N,N-'-diphenyl carbodiimide with carbon dioxide is likely to proceed more efficiently than the metathesis reaction with carbon disulphide. This is in excellent agreement with experimental results as metathesis with carbon disulphide fails to occur. In a second study, multilayer MM/QM calculations are carried out on intermediates generated from reduction of titanium(IV) alkoxides to investigate the effect of alkoxy bridging on the reactivity of multinuclear Ti species. Bimolecular coupling of imines initiated by Ti(III) species leads to a mixture of diastereomers and not diastereoselective coupling of the imine. However if the reaction is carried out by a trimeric biradical species, diastereoselective coupling of the imine is predicted. The presence of alkoxy bridges greatly favours the formation of the d,l (+/-) isomer, whereas the intermediate without alkoxy bridges favours the more stable meso isomer. As a bridged trimeric species, stabilized by bridging alkoxy groups, correctly explains the diastereoselective reaction, it is the most likely intermediate in the reaction.
Resumo:
Position-dependent gene expression is a critical aspect of the development and behaviour of multicellular organisms. It requires a complex series of interactions to occur between different cell types in addition to intracellular signalling cascades. We used Escherichia coli to study the properties of an artificial signalling system at the interface between two expanding cell populations. We genetically engineered one population to produce a diffusible acyl-homoserine lactone (AHL) signal, and another population to respond to it. Our experiments demonstrate how such a signal can be used to reproducibly generate simple visible patterns with high accuracy in swimming agar. The producing and responding cassettes of two such signalling systems can be linked to produce a symmetric interface for bidirectional communication that can be used to visualise molecular logic. Intracellular feedback between these two cassettes would then create a framework for self-organised patterning of higher complexity. Adapting the experiments of Basu et al. (Basu et al., 2005) using cell motility, rather than a differential response to AHL concentrations as a way to define zones of response, we noted how the interaction of sender and receiver cell populations on a swimming plate could lead to complex pattern formation. Equipping highly motile strains such as E. coli MC1000 with AHL-mediated auto-inducing systems based on Vibrio fischeri luxI/luxR and Pseudomonas aeruginosa lasI/lasR cassettes would allow the amplification of a response to an AHL signal and its propagation. We designed and synthesised codon-optimised auto-inducing luxI/R and lasI/R cassettes as optimal gene expression is crucial for the generation of robust patterns. We still have to complete and test the entire genetic circuitry, although by modelling the system we were able to demonstrate its feasibility. © 2007 The Institution of Engineering and Technology.