894 resultados para SEQUENCE DATABASES
Resumo:
This paper presents a process of mining research & development abstract databases to profile current status and to project potential developments for target technologies, The process is called "technology opportunities analysis." This article steps through the process using a sample data set of abstracts from the INSPEC database on the topic o "knowledge discovery and data mining." The paper offers a set of specific indicators suitable for mining such databases to understand innovation prospects. In illustrating the uses of such indicators, it offers some insights into the status of knowledge discovery research*.
Resumo:
A T(2) magnetization-preparation (T(2) Prep) sequence is proposed that is insensitive to B(1) field variations and simultaneously provides fat suppression without any further increase in specific absorption rate (SAR). Increased B(1) inhomogeneity at higher magnetic field strength (B(0) > or = 3T) necessitates a preparation sequence that is less sensitive to B(1) variations. For the proposed technique, T(2) weighting in the image is achieved using a segmented B(1)-insensitive rotation (BIR-4) adiabatic pulse by inserting two equally long delays, one after the initial reverse adiabatic half passage (AHP), and the other before the final AHP segment of a BIR-4 pulse. This sequence yields T(2) weighting with both B(1) and B(0) insensitivity. To simultaneously suppress fat signal (at the cost of B(0) insensitivity), the second delay is prolonged so that fat accumulates additional phase due to its chemical shift. Numerical simulations as well as phantom and in vivo image acquisitions were performed to show the efficacy of the proposed technique.
Resumo:
Although the molecular typing of Pseudomonas aeruginosa is important to understand the local epidemiology of this opportunistic pathogen, it remains challenging. Our aim was to develop a simple typing method based on the sequencing of two highly variable loci. Single-strand sequencing of three highly variable loci (ms172, ms217, and oprD) was performed on a collection of 282 isolates recovered between 1994 and 2007 (from patients and the environment). As expected, the resolution of each locus alone [number of types (NT) = 35-64; index of discrimination (ID) = 0.816-0.964] was lower than the combination of two loci (NT = 78-97; ID = 0.966-0.971). As each pairwise combination of loci gave similar results, we selected the most robust combination with ms172 [reverse; R] and ms217 [R] to constitute the double-locus sequence typing (DLST) scheme for P. aeruginosa. This combination gave: (i) a complete genotype for 276/282 isolates (typability of 98%), (ii) 86 different types, and (iii) an ID of 0.968. Analysis of multiple isolates from the same patients or taps showed that DLST genotypes are generally stable over a period of several months. The high typability, discriminatory power, and ease of use of the proposed DLST scheme makes it a method of choice for local epidemiological analyses of P. aeruginosa. Moreover, the possibility to give unambiguous definition of types allowed to develop an Internet database ( http://www.dlst.org ) accessible by all.
Resumo:
We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.
Resumo:
The malic enzyme (ME) gene is a target for both thyroid hormone receptors and peroxisome proliferator-activated receptors (PPAR). Within the ME promoter, two direct repeat (DR)-1-like elements, MEp and MEd, have been identified as putative PPAR response elements (PPRE). We demonstrate that only MEp and not MEd is able to bind PPAR/retinoid X receptor (RXR) heterodimers and mediate peroxisome proliferator signaling. Taking advantage of the close sequence resemblance of MEp and MEd, we have identified crucial determinants of a PPRE. Using reciprocal mutation analyses of these two elements, we show the preference for adenine as the spacing nucleotide between the two half-sites of the PPRE and demonstrate the importance of the two first bases flanking the core DR1 in 5'. This latter feature of the PPRE lead us to consider the polarity of the PPAR/RXR heterodimer bound to its cognate element. We demonstrate that, in contrast to the polarity of RXR/TR and RXR/RAR bound to DR4 and DR5 elements respectively, PPAR binds to the 5' extended half-site of the response element, while RXR occupies the 3' half-site. Consistent with this polarity is our finding that formation and binding of the PPAR/RXR heterodimer requires an intact hinge T region in RXR while its integrity is not required for binding of the RXR/TR heterodimer to a DR4.
Resumo:
EMBnet is a consortium of collaborating bioinformatics groups located mainly within Europe (http://www.embnet.org). Each member country is represented by a 'node', a group responsible for the maintenance of local services for their users (e.g. education, training, software, database distribution, technical support, helpdesk). Among these services a web portal with links and access to locally developed and maintained software is essential and different for each node. Our web portal targets biomedical scientists in Switzerland and elsewhere, offering them access to a collection of important sequence analysis tools mirrored from other sites or developed locally. We describe here the Swiss EMBnet node web site (http://www.ch.embnet.org), which presents a number of original services not available anywhere else.
Resumo:
With the dramatic increase in the volume of experimental results in every domain of life sciences, assembling pertinent data and combining information from different fields has become a challenge. Information is dispersed over numerous specialized databases and is presented in many different formats. Rapid access to experiment-based information about well-characterized proteins helps predict the function of uncharacterized proteins identified by large-scale sequencing. In this context, universal knowledgebases play essential roles in providing access to data from complementary types of experiments and serving as hubs with cross-references to many specialized databases. This review outlines how the value of experimental data is optimized by combining high-quality protein sequences with complementary experimental results, including information derived from protein 3D-structures, using as an example the UniProt knowledgebase (UniProtKB) and the tools and links provided on its website ( http://www.uniprot.org/ ). It also evokes precautions that are necessary for successful predictions and extrapolations.
Resumo:
Single amino acid substitution is the type of protein alteration most related to human diseases. Current studies seek primarily to distinguish neutral mutations from harmful ones. Very few methods offer an explanation of the final prediction result in terms of the probable structural or functional effect on the protein. In this study, we describe the use of three novel parameters to identify experimentally-verified critical residues of the TP53 protein (p53). The first two parameters make use of a surface clustering method to calculate the protein surface area of highly conserved regions or regions with high nonlocal atomic interaction energy (ANOLEA) score. These parameters help identify important functional regions on the surface of a protein. The last parameter involves the use of a new method for pseudobinding free-energy estimation to specifically probe the importance of residue side-chains to the stability of protein fold. A decision tree was designed to optimally combine these three parameters. The result was compared to the functional data stored in the International Agency for Research on Cancer (IARC) TP53 mutation database. The final prediction achieved a prediction accuracy of 70% and a Matthews correlation coefficient of 0.45. It also showed a high specificity of 91.8%. Mutations in the 85 correctly identified important residues represented 81.7% of the total mutations recorded in the database. In addition, the method was able to correctly assign a probable functional or structural role to the residues. Such information could be critical for the interpretation and prediction of the effect of missense mutations, as it not only provided the fundamental explanation of the observed effect, but also helped design the most appropriate laboratory experiment to verify the prediction results.
Resumo:
The estrogen-responsive element (ERE) present in the 5'-flanking region of the Xenopus laevis vitellogenin (vit) gene B1 has been characterized by transient expression analysis of chimeric vit-tk-CAT (chloramphenicol acetyltransferase) gene constructs transfected into the human estrogen-responsive MCF-7 cell line. The vit B1 ERE behaves like an inducible enhancer, since it is able to confer estrogen inducibility to the heterologous HSV thymidine kinase (tk) promoter in a relative position- and orientation-independent manner. In this assay, the minimal B1 ERE is 33 bp long and consists of two 13 bp imperfect palindromic elements both of which are required for the enhancer activity. A third imperfect palindromic element is present further upstream within the 5'-flanking region of the gene but is unable to confer hormone responsiveness by itself. Similarly, neither element forming the B1 ERE can alone confer estrogen inducibility to the tk promoter. However, in combinations of two, all three imperfect palindromes can act cooperatively to form a functional ERE. In contrast a single 13 bp perfect palindromic element, GGTCACTGTGACC, such as the one found upstream of the vit gene A2, is itself sufficient to act as a fully active ERE. Single point mutations within this element abolish estrogen inducibility, while a defined combination of two mutations converts this ERE into a glucocorticoid-responsive element.
Resumo:
In the liver of oviparous vertebrates vitellogenin gene expression is controlled by estrogen. The nucleotide sequence of the 5' flanking region of the Xenopus laevis vitellogenin genes A1, A2, B1 and B2 has been determined. These sequences have been compared to each other and to the equivalent region of the chicken vitellogenin II and apo-VLDLII genes which are also expressed in the liver in response to estrogen. The homology between the 5' flanking region of the Xenopus genes B1 and B2 is higher than between the corresponding regions of the other closely related genes A1 and A2. Four short blocks of sequence homology which are present at equivalent positions in the vitellogenin genes of both Xenopus laevis and chicken are characterized. A short sequence with two-fold rotational symmetry (GGTCANNNTGACC) was found at similar positions upstream of the five vitellogenin genes and is also present in two copies close to the 5' end of the chicken apo-VLDLII gene. The possible functional significance of this sequence, common to liver estrogen-responsive genes, is discussed.
Resumo:
We describe an improved multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) scheme for genotyping Staphylococcus aureus. We compare its performance to those of multilocus sequence typing (MLST) and spa typing in a survey of 309 strains. This collection includes 87 epidemic methicillin-resistant S. aureus (MRSA) strains of the Harmony collection, 75 clinical strains representing the major MLST clonal complexes (CCs) (50 methicillin-sensitive S. aureus [MSSA] and 25 MRSA), 135 nasal carriage strains (133 MSSA and 2 MRSA), and 13 published S. aureus genome sequences. The results show excellent concordance between the techniques' results and demonstrate that the discriminatory power of MLVA is higher than those of both MLST and spa typing. Two hundred forty-two genotypes are discriminated with 14 VNTR loci (diversity index, 0.9965; 95% confidence interval, 0.9947 to 0.9984). Using a cutoff value of 45%, 21 clusters are observed, corresponding to the CCs previously defined by MLST. The variability of the different tandem repeats allows epidemiological studies, as well as follow-up of the evolution of CCs and the identification of potential ancestors. The 14 loci can conveniently be analyzed in two steps, based upon a first-line simplified assay comprising a subset of 10 loci (panel 1) and a second subset of 4 loci (panel 2) that provides higher resolution when needed. In conclusion, the MLVA scheme proposed here, in combination with available on-line genotyping databases (including http://mlva.u-psud.fr/), multiplexing, and automatic sizing, can provide a basis for almost-real-time large-scale population monitoring of S. aureus.
Resumo:
In order to contribute to the debate about southern glacial refugia used by temperate species and more northern refugia used by boreal or cold-temperate species, we examined the phylogeography of a widespread snake species (Vipera berus) inhabiting Europe up to the Arctic Circle. The analysis of the mitochondrial DNA (mtDNA) sequence variation in 1043 bp of the cytochrome b gene and in 918 bp of the noncoding control region was performed with phylogenetic approaches. Our results suggest that both the duplicated control region and cytochrome b evolve at a similar rate in this species. Phylogenetic analysis showed that V. berus is divided into three major mitochondrial lineages, probably resulting from an Italian, a Balkan and a Northern (from France to Russia) refugial area in Eastern Europe, near the Carpathian Mountains. In addition, the Northern clade presents an important substructure, suggesting two sequential colonization events in Europe. First, the continent was colonized from the three main refugial areas mentioned above during the Lower-Mid Pleistocene. Second, recolonization of most of Europe most likely originated from several refugia located outside of the Mediterranean peninsulas (Carpathian region, east of the Carpathians, France and possibly Hungary) during the Mid-Late Pleistocene, while populations within the Italian and Balkan Peninsulas fluctuated only slightly in distribution range, with larger lowland populations during glacial times and with refugial mountain populations during interglacials, as in the present time. The phylogeographical structure revealed in our study suggests complex recolonization dynamics of the European continent by V. berus, characterized by latitudinal as well as altitudinal range shifts, driven by both climatic changes and competition with related species.
Resumo:
The goals of the human genome project did not include sequencing of the heterochromatic regions. We describe here an initial sequence of 1.1 Mb of the short arm of human chromosome 21 (HSA21p), estimated to be 10% of 21p. This region contains extensive euchromatic-like sequence and includes on average one transcript every 100 kb. These transcripts show multiple inter- and intrachromosomal copies, and extensive copy number and sequence variability. The sequencing of the "heterochromatic" regions of the human genome is likely to reveal many additional functional elements and provide important evolutionary information.
Resumo:
Diffusion magnetic resonance studies of the brain are typically performed using volume coils. Although in human brain this leads to a near optimal filling factor, studies of rodent brain must contend with the fact that only a fraction of the head volume can be ascribed to the brain. The use of surface coil as transceiver increases Signal-to-Noise Ratio (SNR), reduces radiofrequency power requirements and opens the possibility of parallel transmit schemes, likely to allow efficient acquisition schemes, of critical importance for reducing the long scan times implicated in diffusion tensor imaging. This study demonstrates the implementation of a semiadiabatic echo planar imaging sequence (echo time=40 ms, four interleaves) at 14.1T using a quadrature surface coil as transceiver. It resulted in artifact free images with excellent SNR throughout the brain. Diffusion tensor derived parameters obtained within the rat brain were in excellent agreement with reported values.