918 resultados para Sequence Diagram


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Alignment-free methods, in which shared properties of sub-sequences (e.g. identity or match length) are extracted and used to compute a distance matrix, have recently been explored for phylogenetic inference. However, the scalability and robustness of these methods to key evolutionary processes remain to be investigated. Here, using simulated sequence sets of various sizes in both nucleotides and amino acids, we systematically assess the accuracy of phylogenetic inference using an alignment-free approach, based on D2 statistics, under different evolutionary scenarios. We find that compared to a multiple sequence alignment approach, D2 methods are more robust against among-site rate heterogeneity, compositional biases, genetic rearrangements and insertions/deletions, but are more sensitive to recent sequence divergence and sequence truncation. Across diverse empirical datasets, the alignment-free methods perform well for sequences sharing low divergence, at greater computation speed. Our findings provide strong evidence for the scalability and the potential use of alignment-free methods in large-scale phylogenomics.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Acinetobacter baumannii isolate A1 was recovered in the United Kingdom in 1982 and belongs to global clone 1 (GC1). Here, we present its complete 3.91-Mbp genome sequence, generated via a combination of short-read sequencing (Illumina), long-read sequencing (PacBio), and manual finishing.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The human genome project was a grand scientific enterprise which attracted both hyperbole and ridicule alike. The project was lauded as “the moon shot of the life sciences”, the “holy grail of man”, “the code of codes”, and “the book of life”. Such rhetoric has also received scorn. President George Bush senior managed to deflate the pretensions of the project with the accidental slip that it was the “human gnome initiative”. In The Sequence, Kevin Davies seeks to go beyond such metaphors, and provide a candid and honest account of the race of the human genome project. The author is indebted to the authoritative book The Gene Wars, which considered the early struggles over the human genome project. Robert Cook-Deegan observes that there was initially much debate over whether there should be a Human Genome Project at all: The debate became one of “big” science versus “small” science. The reliance on systematic technology development and goal-directed gene-mapping efforts presaged a new style for biology, one that elicited excitement from those attracted to whiz-bang technologies but drew gasps of revulsion from those who aspired to cultivate biology on a more modest scale and with decentralized organisation. The battle was, among other things, over whose vision would control the budget and which scientific aesthetic would prevail.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

It's akin to the old Spanish, English and Portuguese explorers. They would take their boats until they found some edge of land, then they would go up and plant the flag of their king or queen. They didn't know what they'd discovered; how big it is, where it goes to - but they would claim it anyway. David Korn of the Association of American Medical Colleges This article analyses recent litigation over patent law and expressed sequence tags (ESTs). In the case of In re Fisher, the United States Court of Appeals for the Federal Circuit engaged in judicial consideration of the revised utility guidelines of the United States Patent and Trademark Office (USPTO). In this matter, the agricultural biotechnology company Monsanto sought to patent ESTs in maize plants. A patent examiner and the Board of Patent Appeals and Interferences had doubted whether the patent application was useful. Monsanto appealed against the rulings of the USPTO. A number of amicus curiae intervened in the matter in support of the USPTO - including Genentech, Affymetrix, Dow AgroSciences, Eli Lilly, the National Academy of Sciences, and the Association of American Medical Colleges. The majority of the Court of Appeals for the Federal Circuit supported the position of the USPTO, and rejected the patent application on the grounds of utility. The split decision highlighted institutional tensions over the appropriate thresholds for patent criteria - such as novelty, non-obviousness, and utility. The litigation raised larger questions about the definition of research tools, the incremental nature of scientific progress, and the role of patent law in innovation policy. The decision of In re Fisher will have significant ramifications for gene patents, in the wake of the human genome project. Arguably, the USPTO utility guidelines need to be reinforced by a tougher application of the standards of novelty and non-obviousness in respect of gene patents.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper addresses the problem of predicting the outcome of an ongoing case of a business process based on event logs. In this setting, the outcome of a case may refer for example to the achievement of a performance objective or the fulfillment of a compliance rule upon completion of the case. Given a log consisting of traces of completed cases, given a trace of an ongoing case, and given two or more possible out- comes (e.g., a positive and a negative outcome), the paper addresses the problem of determining the most likely outcome for the case in question. Previous approaches to this problem are largely based on simple symbolic sequence classification, meaning that they extract features from traces seen as sequences of event labels, and use these features to construct a classifier for runtime prediction. In doing so, these approaches ignore the data payload associated to each event. This paper approaches the problem from a different angle by treating traces as complex symbolic sequences, that is, sequences of events each carrying a data payload. In this context, the paper outlines different feature encodings of complex symbolic sequences and compares their predictive accuracy on real-life business process event logs.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Idiomarina sp. strain 28-8 is an aerobic, Gram-negative, flagellar bacterium isolated from the bodies of ark shells (Scapharca broughtonii) collected from underwater sediments in Gangjin Bay, South Korea. Here, we present the draft genome sequence of Idiomarina sp. 28-8 (2,971,606 bp, with a G+C content of 46.9%), containing 2,795 putative coding sequences.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The native Asian oyster, Crassostrea ariakensis is one of the most common and important Crassostrea species that occur naturally along the coast of East Asia. Molecular species diagnosis is a prerequisite for population genetic analysis of wild oyster populations because oyster species cannot be discriminated reliably using external morphological characters alone due to character ambiguity. To date there have been few phylogeographic studies of natural edible oyster populations in East Asia, in particular this is true of the common species in Korea C. ariakensis. We therefore assessed the levels and patterns of molecular genetic variation in East Asian wild populations of C. ariakensis from Korea, Japan, and China using DNA sequence analysis of five concatenated mtDNA regions namely; 16S rRNA, cytochrome oxidase I, cytochrome oxidase II, cytochrome oxidase III, and cytochrome b. Two divergent C. ariakensis clades were identified between southern China and remaining sites from the northern region. In addition, hierarchical AMOVA and pairwise UST analyses showed that genetic diversity was discontinuous among wild populations of C. ariakensis in East Asia. Biogeographical and historical sea level changes are discussed as potential factors that may have influenced the genetic heterogeneity of wild C. ariakensis stocks across this region.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Single nucleotide polymorphisms (SNPs) are widely acknowledged as the marker of choice for many genetic and genomic applications because they show co-dominant inheritance, are highly abundant across genomes and are suitable for high-throughput genotyping. Here we evaluated the applicability of SNP markers developed from Crassostrea gigas and C. virginica expressed sequence tags (ESTs) in closely related Crassostrea and Ostrea species. A total of 213 putative interspecific level SNPs were identified from re-sequencing data in six amplicons, yielding on average of one interspecific level SNP per seven bp. High polymorphism levels were observed and the high success rate of transferability show that genic EST-derived SNP markers provide an efficient method for rapid marker development and SNP discovery in closely related oyster species. The six EST-SNP markers identified here will provide useful molecular tools for addressing questions in molecular ecology and evolution studies including for stock analysis (pedigree monitoring) in related oyster taxa.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Molecular phylogenetic studies of homologous sequences of nucleotides often assume that the underlying evolutionary process was globally stationary, reversible, and homogeneous (SRH), and that a model of evolution with one or more site-specific and time-reversible rate matrices (e.g., the GTR rate matrix) is enough to accurately model the evolution of data over the whole tree. However, an increasing body of data suggests that evolution under these conditions is an exception, rather than the norm. To address this issue, several non-SRH models of molecular evolution have been proposed, but they either ignore heterogeneity in the substitution process across sites (HAS) or assume it can be modeled accurately using the distribution. As an alternative to these models of evolution, we introduce a family of mixture models that approximate HAS without the assumption of an underlying predefined statistical distribution. This family of mixture models is combined with non-SRH models of evolution that account for heterogeneity in the substitution process across lineages (HAL). We also present two algorithms for searching model space and identifying an optimal model of evolution that is less likely to over- or underparameterize the data. The performance of the two new algorithms was evaluated using alignments of nucleotides with 10 000 sites simulated under complex non-SRH conditions on a 25-tipped tree. The algorithms were found to be very successful, identifying the correct HAL model with a 75% success rate (the average success rate for assigning rate matrices to the tree's 48 edges was 99.25%) and, for the correct HAL model, identifying the correct HAS model with a 98% success rate. Finally, parameter estimates obtained under the correct HAL-HAS model were found to be accurate and precise. The merits of our new algorithms were illustrated with an analysis of 42 337 second codon sites extracted from a concatenation of 106 alignments of orthologous genes encoded by the nuclear genomes of Saccharomyces cerevisiae, S. paradoxus, S. mikatae, S. kudriavzevii, S. castellii, S. kluyveri, S. bayanus, and Candida albicans. Our results show that second codon sites in the ancestral genome of these species contained 49.1% invariable sites, 39.6% variable sites belonging to one rate category (V1), and 11.3% variable sites belonging to a second rate category (V2). The ancestral nucleotide content was found to differ markedly across these three sets of sites, and the evolutionary processes operating at the variable sites were found to be non-SRH and best modeled by a combination of eight edge-specific rate matrices (four for V1 and four for V2). The number of substitutions per site at the variable sites also differed markedly, with sites belonging to V1 evolving slower than those belonging to V2 along the lineages separating the seven species of Saccharomyces. Finally, sites belonging to V1 appeared to have ceased evolving along the lineages separating S. cerevisiae, S. paradoxus, S. mikatae, S. kudriavzevii, and S. bayanus, implying that they might have become so selectively constrained that they could be considered invariable sites in these species.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The koala (Phascolarctos cinereus) is an Australian marsupial that continues to experience significant population declines. Infectious diseases caused by pathogens such as Chlamydia are proposed to have a major role. Very few species-specific immunological reagents are available, severely hindering our ability to respond to the threat of infectious diseases in the koala. In this study, we utilise data from the sequencing of the koala transcriptome to identify key immunological markers of the koala adaptive immune response and cytokines known to be important in the host response to chlamydial infection in other species. This report describes the identification and preliminary sequence analysis of (1) T lymphocyte glycoprotein markers (CD4, CD8); (2) IL-4, a marker for the Th2 response; (3) cytokines such as IL-6, IL-12 and IL-1β, that have been shown to have a role in chlamydial clearance and pathology in other hosts; and (4) the sequences for the koala immunoglobulins, IgA, IgG, IgE and IgM. These sequences will enable the development of a range of immunological reagents for understanding the koala’s innate and adaptive immune responses, while also providing a resource that will enable continued investigations into the origin and evolution of the marsupial immune system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Chlamydia pecorum is globally associated with several ovine diseases including keratoconjunctivitis and polyarthritis. The exact relationship between the variety of C. pecorum strains reported and the diseases described in sheep remains unclear, challenging efforts to accurately diagnose and manage infected flocks. In the present study, we applied C. pecorum multi-locus sequence typing (MLST) to C. pecorum positive samples collected from sympatric flocks of Australian sheep presenting with conjunctivitis, conjunctivitis with polyarthritis, or polyarthritis only and with no clinical disease (NCD) in order to elucidate the exact relationships between the infecting strains and the range of diseases. Using Bayesian phylogenetic and cluster analyses on 62 C. pecorum positive ocular, vaginal and rectal swab samples from sheep presenting with a range of diseases and in a comparison to C. pecorum sequence types (STs) from other hosts, one ST (ST 23) was recognised as a globally distributed strain associated with ovine and bovine diseases such as polyarthritis and encephalomyelitis. A second ST (ST 69) presently only described in Australian animals, was detected in association with ovine as well as koala chlamydial infections. The majority of vaginal and rectal C. pecorum STs from animals with NCD and/or anatomical sites with no clinical signs of disease in diseased animals, clustered together in a separate group, by both analyses. Furthermore, 8/13 detected STs were novel. This study provides a platform for strain selection for further research into the pathogenic potential of C. pecorum in animals and highlights targets for potential strain-specific diagnostic test development.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have identified strong topoisomerase sites (STS) for Mycobacteruim smegmatis topoisomerase I in double-stranded DNA context using electrophoretic mobility shift assay of enzyme-DNA covalent complexes; Mg2+, an essential component for DNA relaxation activity of the enzyme, is not required for binding to DNA, The enzyme makes single-stranded nicks, with transient covalent interaction at the 5'-end of the broken DNA strand, a characteristic akin to prokaryotic topoisomerases. More importantly, the enzyme binds to duplex DNA having a preferred site with high affinity, a. property similar to the eukaryotic type I topoisomerases, The preferred cleavage site is mapped on a 65 bp duplex DNA and found to be CG/TCTT. Thus, the enzyme resembles other prokaryotic type I topoisomerases in mechanistics of the reaction, but is similar to eukaryotic enzymes in DNA recognition properties.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Treatment of bromoketals 2, derived from allyl alcohols 1, with tributyltin chloride, sodium cyanoborohydride and AIBN furnishes the tetrahydrofurannulated products 3 via a 5-exo-trig radical cyclisation reaction followed by reductive cleavage of ketal 4.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Jacalin and artocarpin, the two lectins from jackfruit (Artocarpus integrifolia) seeds, have different physicochemical properties and carbohydrate-binding specificities. However, comparison of the partial amino-acid sequence of artocarpin with the known sequence of jacalin indicates close to 50% sequence identity. Artocarpin crystallizes in two forms, both monoclinic P2(1), with one and two tetramic molecules, respectively, in the asymmetric units of form I (a = 69.9, b = 73.7, c = 60.6 Angstrom and beta = 95.1 degrees) and form II (a = 87.6, b = 72.2, c = 92.6 Angstrom and beta = 101.1 degrees). Both the crystal structures have been solved by the molecular replacement method using the known structure of jacalin as the search model and ope of them partially refined, confirming that the two lectins are indeed homologous.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Elucidation of the detailed structural features and sequence requirements for iv helices of various lengths could be very important in understanding secondary structure formation in proteins and, hence. in the protein folding mechanism. An algorithm to characterize the geometry of an alpha helix from its C-alpha coordinates has been developed and used to analyze the structures of long cu helices (number of residues greater than or equal to 25) found in globular proteins, the crystal structure coordinates of which are available from the Brookhaven Protein Data Bank, Ail long a helices can be unambiguously characterized as belonging to one of three classes: linear, curved, or kinked, with a majority being curved. Analysis of the sequences of these helices reveals that the long alpha helices have unique sequence characteristics that distinguish them from the short alpha helices in globular proteins, The distribution and statistical propensities of individual amino acids to occur in long alpha heices are different from those found in short alpha helices, with amino acids having longer side chains and/or having a greater number of functional groups occurring more frequently in these helices, The sequences of the long alpha helices can be correlated with their gross structural features, i.e., whether they are curved, linear, or kinked, and in case of the curved helices, with their curvature.