991 resultados para sequence database


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Species identification based on short sequences of DNA markers, that is, DNA barcoding, has emerged as an integral part of modern taxonomy. However, software for the analysis of large and multilocus barcoding data sets is scarce. The Basic Local Alignment Search Tool (BLAST) is currently the fastest tool capable of handling large databases (e.g. >5000 sequences), but its accuracy is a concern and has been criticized for its local optimization. However, current more accurate software requires sequence alignment or complex calculations, which are time-consuming when dealing with large data sets during data preprocessing or during the search stage. Therefore, it is imperative to develop a practical program for both accurate and scalable species identification for DNA barcoding. In this context, we present VIP Barcoding: a user-friendly software in graphical user interface for rapid DNA barcoding. It adopts a hybrid, two-stage algorithm. First, an alignment-free composition vector (CV) method is utilized to reduce searching space by screening a reference database. The alignment-based K2P distance nearest-neighbour method is then employed to analyse the smaller data set generated in the first stage. In comparison with other software, we demonstrate that VIP Barcoding has (i) higher accuracy than Blastn and several alignment-free methods and (ii) higher scalability than alignment-based distance methods and character-based methods. These results suggest that this platform is able to deal with both large-scale and multilocus barcoding data with accuracy and can contribute to DNA barcoding for modern taxonomy. VIP Barcoding is free and available at http://msl.sls.cuhk.edu.hk/vipbarcoding/.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Chlamydia pecorum is globally associated with several ovine diseases including keratoconjunctivitis and polyarthritis. The exact relationship between the variety of C. pecorum strains reported and the diseases described in sheep remains unclear, challenging efforts to accurately diagnose and manage infected flocks. In the present study, we applied C. pecorum multi-locus sequence typing (MLST) to C. pecorum positive samples collected from sympatric flocks of Australian sheep presenting with conjunctivitis, conjunctivitis with polyarthritis, or polyarthritis only and with no clinical disease (NCD) in order to elucidate the exact relationships between the infecting strains and the range of diseases. Using Bayesian phylogenetic and cluster analyses on 62 C. pecorum positive ocular, vaginal and rectal swab samples from sheep presenting with a range of diseases and in a comparison to C. pecorum sequence types (STs) from other hosts, one ST (ST 23) was recognised as a globally distributed strain associated with ovine and bovine diseases such as polyarthritis and encephalomyelitis. A second ST (ST 69) presently only described in Australian animals, was detected in association with ovine as well as koala chlamydial infections. The majority of vaginal and rectal C. pecorum STs from animals with NCD and/or anatomical sites with no clinical signs of disease in diseased animals, clustered together in a separate group, by both analyses. Furthermore, 8/13 detected STs were novel. This study provides a platform for strain selection for further research into the pathogenic potential of C. pecorum in animals and highlights targets for potential strain-specific diagnostic test development.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We have identified strong topoisomerase sites (STS) for Mycobacteruim smegmatis topoisomerase I in double-stranded DNA context using electrophoretic mobility shift assay of enzyme-DNA covalent complexes; Mg2+, an essential component for DNA relaxation activity of the enzyme, is not required for binding to DNA, The enzyme makes single-stranded nicks, with transient covalent interaction at the 5'-end of the broken DNA strand, a characteristic akin to prokaryotic topoisomerases. More importantly, the enzyme binds to duplex DNA having a preferred site with high affinity, a. property similar to the eukaryotic type I topoisomerases, The preferred cleavage site is mapped on a 65 bp duplex DNA and found to be CG/TCTT. Thus, the enzyme resembles other prokaryotic type I topoisomerases in mechanistics of the reaction, but is similar to eukaryotic enzymes in DNA recognition properties.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Treatment of bromoketals 2, derived from allyl alcohols 1, with tributyltin chloride, sodium cyanoborohydride and AIBN furnishes the tetrahydrofurannulated products 3 via a 5-exo-trig radical cyclisation reaction followed by reductive cleavage of ketal 4.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Jacalin and artocarpin, the two lectins from jackfruit (Artocarpus integrifolia) seeds, have different physicochemical properties and carbohydrate-binding specificities. However, comparison of the partial amino-acid sequence of artocarpin with the known sequence of jacalin indicates close to 50% sequence identity. Artocarpin crystallizes in two forms, both monoclinic P2(1), with one and two tetramic molecules, respectively, in the asymmetric units of form I (a = 69.9, b = 73.7, c = 60.6 Angstrom and beta = 95.1 degrees) and form II (a = 87.6, b = 72.2, c = 92.6 Angstrom and beta = 101.1 degrees). Both the crystal structures have been solved by the molecular replacement method using the known structure of jacalin as the search model and ope of them partially refined, confirming that the two lectins are indeed homologous.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Elucidation of the detailed structural features and sequence requirements for iv helices of various lengths could be very important in understanding secondary structure formation in proteins and, hence. in the protein folding mechanism. An algorithm to characterize the geometry of an alpha helix from its C-alpha coordinates has been developed and used to analyze the structures of long cu helices (number of residues greater than or equal to 25) found in globular proteins, the crystal structure coordinates of which are available from the Brookhaven Protein Data Bank, Ail long a helices can be unambiguously characterized as belonging to one of three classes: linear, curved, or kinked, with a majority being curved. Analysis of the sequences of these helices reveals that the long alpha helices have unique sequence characteristics that distinguish them from the short alpha helices in globular proteins, The distribution and statistical propensities of individual amino acids to occur in long alpha heices are different from those found in short alpha helices, with amino acids having longer side chains and/or having a greater number of functional groups occurring more frequently in these helices, The sequences of the long alpha helices can be correlated with their gross structural features, i.e., whether they are curved, linear, or kinked, and in case of the curved helices, with their curvature.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The genomic sequences of several RNA plant viruses including cucumber mosaic virus, brome mosaic virus, alfalfa mosaic virus and tobacco mosaic virus have become available recently. The former two viruses are icosahedral while the latter two are bullet and rod shaped, respectively in particle morphology. The non-structural 3a proteins of cucumber mosaic virus and brome mosaic virus have an amino acid sequence homology of 35% and hence are evolutionarily related. In contrast, the coat proteins exhibit little homology, although the circular dichroism spectrum of these viruses are similar. The non-coding regions of the genome also exhibit variable but extensive homology. Comparison of the brome mosaic virus and alfalfa mosaic virus sequences reveals that they are probably related although with a much larger evolutionary distance. The polypeptide folds of the coat protein of three biologically distinct isometric plant viruses, tomato bushy stunt virus, southern bean mosaic virus and satellite tobacco necrosis virus have been shown to display a striking resemblance. All of them consist of a topologically similar 8-standard β-barrel. The implications of these studies to the understanding of the evolution of plant viruses will be discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The authors report an in vivo human examination of carotid atheroma by using the inversion-recovery ON resonance (IRON) sequence, which is able to produce positive contrast after the infusion of an ultrasmall super paramagnetic iron oxide (USPIO) contrast medium. This technique provides a method of potentially identifying inflammatory burden within carotid atheroma. This may be particularly useful in patients who currently do not meet criteria for intervention (ie, moderate symptomatic stenosis or <70% asymptomatic stenosis) to further risk-stratify this important patient cohort. A 63-year-old man was imaged at 1.5 T before and 36 hours after USPIO infusion by using the IRON sequence. Regions of interest showing profound signal loss at T2*-weighted imaging corresponded well with regions of positive contrast at IRON imaging after the administration of USPIO. These regions also showed a profound decrease in T2* measurements after USPIO infusion, whereas surrounding tissue did not. It has been shown that such strong signal loss on T2*-weighted images after USPIO infusion is indicative of USPIO uptake.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This study investigates the use of unsupervised features derived from word embedding approaches and novel sequence representation approaches for improving clinical information extraction systems. Our results corroborate previous findings that indicate that the use of word embeddings significantly improve the effectiveness of concept extraction models; however, we further determine the influence that the corpora used to generate such features have. We also demonstrate the promise of sequence-based unsupervised features for further improving concept extraction.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes the design and implementation of a high-level query language called Generalized Query-By-Rule (GQBR) which supports retrieval, insertion, deletion and update operations. This language, based on the formalism of database logic, enables the users to access each database in a distributed heterogeneous environment, without having to learn all the different data manipulation languages. The compiler has been implemented on a DEC 1090 system in Pascal.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper describes the design and implementation of ADAMIS (‘A database for medical information systems’). ADAMIS is a relational database management system for a general hospital environment. Apart from the usual database (DB) facilities of data definition and data manipulation, ADAMIS supports a query language called the ‘simplified medical query language’ (SMQL) which is completely end-user oriented and highly non-procedural. Other features of ADAMIS include provision of facilities for statistics collection and report generation. ADAMIS also provides adequate security and integrity features and has been designed mainly for use on interactive terminals.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Database management systems offer a very reliable and attractive data organization for fast and economical information storage and processing for diverse applications. It is much more important that the information should be easily accessible to users with varied backgrounds, professional as well as casual, through a suitable data sublanguage. The language adopted here (APPLE) is one such language for relational database systems and is completely nonprocedural and well suited to users with minimum or no programming background. This is supported by an access path model which permits the user to formulate completely nonprocedural queries expressed solely in terms of attribute names. The data description language (DDL) and data manipulation language (DML) features of APPLE are also discussed. The underlying relational database has been implemented with the help of the DATATRIEVE-11 utility for record and domain definition which is available on the PDP-11/35. The package is coded in Pascal and MACRO-11. Further, most of the limitations of the DATATRIEVE-11 utility have been eliminated in the interface package.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This research examined the influence of tectonic activity on submarine sedimentation processes, through a deposit-based analysis of turbidites in outcrop. A comprehensive field study of the Miocene Whakataki Formation yielded significant data that was analysed using methods of process-sedimentology, stratigraphy, and ichnology. Signatures of the tectonically active depositional environment were identifiable at very high resolution, from grain composition and texture to trace-fossil assemblages, as well as on a broader-scale in stratigraphic stacking patterns and structural deformation. From these results and environmental interpretations, an original facies characterisation and conceptual depositional model have been established.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this paper the main features of ARDBID (A Relational Database for Interactive Design) have been described. An overview of the organization of the database has been presented and a detailed description of the data definition and manipulation languages has been given. These have been implemented on a DEC 1090 system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To identify genes involved in papaya fruit ripening, a total of 1171 expressed sequence tags (ESTs) were generated from randomly selected clones of two independent fruit cDNA libraries derived from yellow and red-fleshed fruit varieties. The most abundant sequences encoded: chitinase, 1-aminocyclopropane-1-carboxylic acid (ACC) oxidase, catalase and methionine synthase, respectively. DNA sequence comparisons identified ESTs with significant similarity to genes associated with fruit softening, aroma and colour biosynthesis. Putative cell wall hydrolases, cell membrane hydrolases, and ethylene synthesis and regulation sequences were identified with predicted roles in fruit softening. Expressed papaya genes associated with fruit aroma included isoprenoid biosynthesis and shikimic acid pathway genes and proteins associated with acyl lipid catabolism. Putative fruit colour genes were identified due to their similarity with carotenoid and chlorophyll biosynthesis genes from other plant species.