59 resultados para SEQUENCING DATA


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Clozapine (CLO), an atypical antipsychotic, depends mainly on cytochrome P450 1A2 (CYP1A2) for its metabolic clearance. Four patients treated with CLO, who were smokers, were nonresponders and had low plasma levels while receiving usual doses. Their plasma levels to dose ratios of CLO (median; range, 0.34; 0.22 to 0.40 ng x day/mL x mg) were significantly lower than ratios calculated from another study with 29 patients (0.75; 0.22 to 2.83 ng x day/mL x mg; P < 0.01). These patients were confirmed as being CYP1A2 ultrarapid metabolizers by the caffeine phenotyping test (median systemic caffeine plasma clearance; range, 3.85; 3.33 to 4.17 mL/min/kg) when compared with previous studies (0.3 to 3.33 mL/min/kg). The sequencing of the entire CYP1A2 gene from genomic DNA of these patients suggests that the -164C > A mutation (CYP1A2*1F) in intron 1, which confers a high inducibility of CYP1A2 in smokers, is the most likely explanation for their ultrarapid CYP1A2 activity. A marked (2 patients) or a moderate (2 patients) improvement of the clinical state of the patients occurred after the increase of CLO blood levels above the therapeutic threshold by the increase of CLO doses to very high values (ie, up to 1400 mg/d) or by the introduction of fluvoxamine, a potent CYP1A2 inhibitor, at low dosage (50 to 100 mg/d). Due to the high frequency of smokers among patients with schizophrenia and to the high frequency of the -164C > A polymorphism, CYP1A2 genotyping could have important clinical implications for the treatment of patients with CLO.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sequencing of pools of individuals (Pool-Seq) represents a reliable and cost-effective approach for estimating genome-wide SNP and transposable element insertion frequencies. However, Pool-Seq does not provide direct information on haplotypes so that, for example, obtaining inversion frequencies has not been possible until now. Here, we have developed a new set of diagnostic marker SNPs for seven cosmopolitan inversions in Drosophila melanogaster that can be used to infer inversion frequencies from Pool-Seq data. We applied our novel marker set to Pool-Seq data from an experimental evolution study and from North American and Australian latitudinal clines. In the experimental evolution data, we find evidence that positive selection has driven the frequencies of In(3R)C and In(3R)Mo to increase over time. In the clinal data, we confirm the existence of frequency clines for In(2L)t, In(3L)P and In(3R)Payne in both North America and Australia and detect a previously unknown latitudinal cline for In(3R)Mo in North America. The inversion markers developed here provide a versatile and robust tool for characterizing inversion frequencies and their dynamics in Pool-Seq data from diverse D. melanogaster populations.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

High-fidelity 'proofreading' polymerases are often used in library construction for next-generation sequencing projects, in an effort to minimize errors in the resulting sequence data. The increased template fidelity of these polymerases can come at the cost of reduced template specificity, and library preparation methods based on the AFLP technique may be particularly susceptible. Here, we compare AFLP profiles generated with standard Taq and two versions of a high-fidelity polymerase. We find that Taq produces fewer and brighter peaks than high-fidelity polymerase, suggesting that Taq performs better at selectively amplifying templates that exactly match the primer sequences. Because the higher accuracy of proofreading polymerases remains important for sequencing applications, we suggest that it may be more effective to use alternative library preparation methods.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Les larves aquatiques d'éphémères (Ephemeroptera) colonisent toutes les eaux douces du monde et sont couramment utilisées comme bio-indicateurs de la qualité de l'eau. Le genre Rhithrogena (Heptageniidae) est le deuxième plus diversifié chez les éphémères, et plusieurs espèces européennes ont une distribution restreinte dans des environnements alpins sensibles. Les espèces de Rhithrogena ont été classées en "groupes d'espèces" faciles à identifier. Cependant, malgré leur importance écologique et en terme de conservation, beaucoup d'espèces présentent des différences morphologiques ambiguës, suggérant que lataxonomie actuelle ne refléterait pas correctement leur diversité évolutive. De plus, aucune information sur leurs relations, leur origine, le taux de spéciation ou les mécanismes ayant provoqué leur remarquable diversification dans les Alpes n'est disponible. Nous avons d'abord examiné le statut spécifique d'environ 50% des espèces européennes de Rhithrogena en utilisant un large échantillonnage de populations alpines incluant 22 localités typiques, ainsi qu'une analyse basée sur le modèle général mixte de Yule et de coalescence (GMYC) appliqué à un gène mitochondrial standard (coxl) et à un gène nucléaire développé spécifiquement pour cette étude. Nous avons observé un regroupement significatif des séquences coxl en 31 espèces potentielles, et nos résultats ont fortement suggéré la présence d'espèces cryptiques et de fractionnements taxonomiques excessifs chez les Rhithrogena. Nos analyses phylogénétiques ont démontré la monophylie de quatre des six groupes d'espèces reconnus présents dans notre échantillonnage. La taxonomie ADN développée dans cette étude pose les bases d'une future révision de ce genre important mais cryptique en Europe. Puis nous avons mené une étude phylogénétique multi-gènes entre les espèces européennes de Rhithrogena. Les données provenant de trois gènes nucléaires et de deux gènes mitochondriaux ont été largement concordantes, et les relations entre les espèces bien résolues au sein de la plupart des groupes d'espèces dans une analyse combinant tous les gènes. En l'absence de points de calibration extérieurs tels que des fossiles, nous avons appliqué à nos données mitochondriales une horloge moléculaire standard pour les insectes, suggérant une origine des Rhithrogena alpins à la limite Oligocène / Miocène. Nos résultats ont montré le rôle prépondérant qu'ont joué les glaciations du quaternaire dans leur diversification, favorisant la spéciation d'au moins la moitié des espèces actuelle dans les Alpes. La biodiversité et le taux d'endémisme à Madagascar, notamment au niveau de la faune des eaux douces, sont parmi les plus extraordinaires et les plus menacés au monde. On pense que beaucoup d'espèces d'éphémères sont restreintes à un seul bassin versant (microendémisme) dans les zones forestières, ce qui les rendrait particulièrement sensibles à la réduction et à la dégradation de leur habitat. Mis à part deux espèces décrites, Afronurus matitensis et Compsoneuria josettae, les Heptageniidae sont pratiquement inconnus à Madagascar. Les deux genres ont une distribution discontinue en Afrique, à Madagascar et en Asie du Sud-Est, et leur taxonomie complexe est régulièrement révisée. L'approche standard pour comprendre leur diversité, leur endémisme et leur origine requerrait un échantillonnage étendu sur plusieurs continents et des années de travaux taxonomiques. Pour accélérer le processus, nous avons utilisé des collections de musées ainsi que des individus fraîchement collectés, et appliqué une approche combinant taxonomie ADN et phylogénie. L'analyses GMYC du gène coxl a délimité 14 espèces potentielles à Madagascar, dont 70% vraisemblablement microendémiques. Une analyse phylogénique incluant des espèces africaines et asiatiques portant sur deux gènes mitochondriaux et quatre gènes nucléaires a montré que les Heptageniidae malgaches sont monophylétiques et groupe frère des Compsoneuria africains. L'existence de cette lignée unique, ainsi qu'un taux élevé de microendémisme, mettent en évidence leur importance en terme de conservation. Nos résultats soulignent également le rôle important que peuvent jouer les collections de musées dans les études moléculaires et en conservation. - Aquatic nymphs of mayflies (Ephemeroptera) colonize all types of freshwaters throughout the world and are extensively used as bio-indicators of water quality. Rhithrogena (Heptageniidae) is the second most species-rich genus of mayflies, and several European species have restricted distributions in sensitive Alpine environments and therefore are of conservation interest. The European Rhithrogena species are arranged into "species groups" that are easily identifiable. However, despite their ecological and conservation importance, ambiguous morphological differences among many species suggest that the current taxonomy may not accurately reflect their evolutionary diversity. Moreover, no information about their relationships, origin, timing of speciation and mechanisms promoting their successful diversification in the Alps is available. We first examined the species status of ca. 50% of European Rhithrogena diversity using a widespread sampling scheme of Alpine species that included 22 type localities, general mixed Yule- coalescent (GMYC) model analysis of one standard mitochondrial (coxl) and one newly developed nuclear marker. We observed significant clustering of coxl into 31 GMYC species, and our results strongly suggest the presence of both cryptic diversity and taxonomic oversplitting in Rhithrogena. Phylogenetic analyses recovered four of the six recognized species groups in our samples as monophyletic. The DNA taxonomy developed here lays the groundwork for a future revision of this important but cryptic genus in Europe. Then we conducted a species-level, multiple-gene phylogenetic study of European Rhithrogena. Data from three nuclear and two mitochondrial loci were broadly congruent, and species-level relationships were well resolved within most species groups in a combined analysis. In the absence of external calibration points like fossils, we applied a standard insect molecular clock hypothesis to our mitochondrial data, suggesting an origin of Alpine Rhithrogena in the Oligocene / Miocene boundary. Our results highlighted the preponderant role that quaternary glaciations played in their diversification, promoting speciation of at least half of the current diversity in the Alps. Madagascar's biodiversity and endemism are among the most extraordinary and endangered in the world. This includes the island's freshwater biodiversity, although detailed knowledge of the diversity, endemism, and biogeographic origin of freshwater invertebrates is lacking. Many mayfly species are thought to be restricted to single river basins (microendemic species) in forested areas, making them particularly sensitive to habitat reduction and degradation. The Heptageniidae are practically unknown in Madagascar except for two described species, Afronurus matitensis and Compsoneuria josettae. Both genera have a disjunct distribution in Africa, Madagascar and Southeast Asia, and a complex taxonomic status still in flux. The standard approach to understanding their diversity, endemism, and origin would require extensive field sampling on several continents and years of taxonomic work. Here we circumvent this using museum collections and freshly collected individuals in a combined approach of DNA taxonomy and phylogeny. The cox/-based GMYC analysis revealed 14 putative species on Madagascar, 70% of which potentially microendemics. A phylogenetic analysis that included African and Asian species and data from two mitochondrial and four nuclear loci indicated the Malagasy Heptageniidae are monophyletic and sister to African Compsoneuria. The observed monophyly and high microendemism highlight their conservation importance. Our results also underline the important role that museum collections can play in molecular studies, especially in critically endangered biodiversity hotspots like Madagascar.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Many eukaryote organisms are polyploid. However, despite their importance, evolutionary inference of polyploid origins and modes of inheritance has been limited by a need for analyses of allele segregation at multiple loci using crosses. The increasing availability of sequence data for nonmodel species now allows the application of established approaches for the analysis of genomic data in polyploids. Here, we ask whether approximate Bayesian computation (ABC), applied to realistic traditional and next-generation sequence data, allows correct inference of the evolutionary and demographic history of polyploids. Using simulations, we evaluate the robustness of evolutionary inference by ABC for tetraploid species as a function of the number of individuals and loci sampled, and the presence or absence of an outgroup. We find that ABC adequately retrieves the recent evolutionary history of polyploid species on the basis of both old and new sequencing technologies. The application of ABC to sequence data from diploid and polyploid species of the plant genus Capsella confirms its utility. Our analysis strongly supports an allopolyploid origin of C. bursa-pastoris about 80 000 years ago. This conclusion runs contrary to previous findings based on the same data set but using an alternative approach and is in agreement with recent findings based on whole-genome sequencing. Our results indicate that ABC is a promising and powerful method for revealing the evolution of polyploid species, without the need to attribute alleles to a homeologous chromosome pair. The approach can readily be extended to more complex scenarios involving higher ploidy levels.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Massively parallel signature sequencing (MPSS) generates millions of short sequence tags corresponding to transcripts from a single RNA preparation. Most MPSS tags can be unambiguously assigned to genes, thereby generating a comprehensive expression profile of the tissue of origin. From the comparison of MPSS data from 32 normal human tissues, we identified 1,056 genes that are predominantly expressed in the testis. Further evaluation by using MPSS tags from cancer cell lines and EST data from a wide variety of tumors identified 202 of these genes as candidates for encoding cancer/testis (CT) antigens. Of these genes, the expression in normal tissues was assessed by RT-PCR in a subset of 166 intron-containing genes, and those with confirmed testis-predominant expression were further evaluated for their expression in 21 cancer cell lines. Thus, 20 CT or CT-like genes were identified, with several exhibiting expression in five or more of the cancer cell lines examined. One of these genes is a member of a CT gene family that we designated as CT45. The CT45 family comprises six highly similar (>98% cDNA identity) genes that are clustered in tandem within a 125-kb region on Xq26.3. CT45 was found to be frequently expressed in both cancer cell lines and lung cancer specimens. Thus, MPSS analysis has resulted in a significant extension of our knowledge of CT antigens, leading to the discovery of a distinctive X-linked CT-antigen gene family.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Résumé: L'automatisation du séquençage et de l'annotation des génomes, ainsi que l'application à large échelle de méthodes de mesure de l'expression génique, génèrent une quantité phénoménale de données pour des organismes modèles tels que l'homme ou la souris. Dans ce déluge de données, il devient très difficile d'obtenir des informations spécifiques à un organisme ou à un gène, et une telle recherche aboutit fréquemment à des réponses fragmentées, voir incomplètes. La création d'une base de données capable de gérer et d'intégrer aussi bien les données génomiques que les données transcriptomiques peut grandement améliorer la vitesse de recherche ainsi que la qualité des résultats obtenus, en permettant une comparaison directe de mesures d'expression des gènes provenant d'expériences réalisées grâce à des techniques différentes. L'objectif principal de ce projet, appelé CleanEx, est de fournir un accès direct aux données d'expression publiques par le biais de noms de gènes officiels, et de représenter des données d'expression produites selon des protocoles différents de manière à faciliter une analyse générale et une comparaison entre plusieurs jeux de données. Une mise à jour cohérente et régulière de la nomenclature des gènes est assurée en associant chaque expérience d'expression de gène à un identificateur permanent de la séquence-cible, donnant une description physique de la population d'ARN visée par l'expérience. Ces identificateurs sont ensuite associés à intervalles réguliers aux catalogues, en constante évolution, des gènes d'organismes modèles. Cette procédure automatique de traçage se fonde en partie sur des ressources externes d'information génomique, telles que UniGene et RefSeq. La partie centrale de CleanEx consiste en un index de gènes établi de manière hebdomadaire et qui contient les liens à toutes les données publiques d'expression déjà incorporées au système. En outre, la base de données des séquences-cible fournit un lien sur le gène correspondant ainsi qu'un contrôle de qualité de ce lien pour différents types de ressources expérimentales, telles que des clones ou des sondes Affymetrix. Le système de recherche en ligne de CleanEx offre un accès aux entrées individuelles ainsi qu'à des outils d'analyse croisée de jeux de donnnées. Ces outils se sont avérés très efficaces dans le cadre de la comparaison de l'expression de gènes, ainsi que, dans une certaine mesure, dans la détection d'une variation de cette expression liée au phénomène d'épissage alternatif. Les fichiers et les outils de CleanEx sont accessibles en ligne (http://www.cleanex.isb-sib.ch/). Abstract: The automatic genome sequencing and annotation, as well as the large-scale gene expression measurements methods, generate a massive amount of data for model organisms. Searching for genespecific or organism-specific information througout all the different databases has become a very difficult task, and often results in fragmented and unrelated answers. The generation of a database which will federate and integrate genomic and transcriptomic data together will greatly improve the search speed as well as the quality of the results by allowing a direct comparison of expression results obtained by different techniques. The main goal of this project, called the CleanEx database, is thus to provide access to public gene expression data via unique gene names and to represent heterogeneous expression data produced by different technologies in a way that facilitates joint analysis and crossdataset comparisons. A consistent and uptodate gene nomenclature is achieved by associating each single gene expression experiment with a permanent target identifier consisting of a physical description of the targeted RNA population or the hybridization reagent used. These targets are then mapped at regular intervals to the growing and evolving catalogues of genes from model organisms, such as human and mouse. The completely automatic mapping procedure relies partly on external genome information resources such as UniGene and RefSeq. The central part of CleanEx is a weekly built gene index containing crossreferences to all public expression data already incorporated into the system. In addition, the expression target database of CleanEx provides gene mapping and quality control information for various types of experimental resources, such as cDNA clones or Affymetrix probe sets. The Affymetrix mapping files are accessible as text files, for further use in external applications, and as individual entries, via the webbased interfaces . The CleanEx webbased query interfaces offer access to individual entries via text string searches or quantitative expression criteria, as well as crossdataset analysis tools, and crosschip gene comparison. These tools have proven to be very efficient in expression data comparison and even, to a certain extent, in detection of differentially expressed splice variants. The CleanEx flat files and tools are available online at: http://www.cleanex.isbsib. ch/.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

To permit the tracking of turbulent flow structures in an Eulerian frame from single-point measurements, we make use of a generalization of conventional two-dimensional quadrant analysis to three-dimensional octants. We characterize flow structures using the sequences of these octants and show how significance may be attached to particular sequences using statistical mull models. We analyze an example experiment and show how a particular dominant flow structure can be identified from the conditional probability of octant sequences. The frequency of this structure corresponds to the dominant peak in the velocity spectra and exerts a high proportion of the total shear stress. We link this structure explicitly to the propensity for sediment entrainment and show that greater insight into sediment entrainment can be obtained by disaggregating those octants that occur within the identified macroturbulence structure from those that do not. Hence, this work goes beyond critiques of Reynolds stress approaches to bed load entrainment that highlight the importance of outward interactions, to identifying and prioritizing the quadrants/octants that define particular flow structures. Key Points <list list-type=''bulleted'' id=''jgrf20196-list-0001''> <list-item id=''jgrf20196-li-0001''>A new method for analysing single point velocity data is presented <list-item id=''jgrf20196-li-0002''>Flow structures are identified by a sequence of flow states (termed octants) <list-item id=''jgrf20196-li-0003''>The identified structure exerts high stresses and causes bed-load entrainment

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Next-generation sequencing (NGS) technologies have become the standard for data generation in studies of population genomics, as the 1000 Genomes Project (1000G). However, these techniques are known to be problematic when applied to highly polymorphic genomic regions, such as the human leukocyte antigen (HLA) genes. Because accurate genotype calls and allele frequency estimations are crucial to population genomics analyses, it is important to assess the reliability of NGS data. Here, we evaluate the reliability of genotype calls and allele frequency estimates of the single-nucleotide polymorphisms (SNPs) reported by 1000G (phase I) at five HLA genes (HLA-A, -B, -C, -DRB1, and -DQB1). We take advantage of the availability of HLA Sanger sequencing of 930 of the 1092 1000G samples and use this as a gold standard to benchmark the 1000G data. We document that 18.6% of SNP genotype calls in HLA genes are incorrect and that allele frequencies are estimated with an error greater than ±0.1 at approximately 25% of the SNPs in HLA genes. We found a bias toward overestimation of reference allele frequency for the 1000G data, indicating mapping bias is an important cause of error in frequency estimation in this dataset. We provide a list of sites that have poor allele frequency estimates and discuss the outcomes of including those sites in different kinds of analyses. Because the HLA region is the most polymorphic in the human genome, our results provide insights into the challenges of using of NGS data at other genomic regions of high diversity.

Relevância:

20.00% 20.00%

Publicador:

Relevância:

20.00% 20.00%

Publicador:

Resumo:

OBJECTIVES: This study aimed at investigating whether data from medical teleconsultations may contribute to influenza surveillance. METHODS: International Classification of Primary Care 2nd Edition (ICPC-2) codes were used to analyse the proportion of teleconsultations due to influenza-related symptoms. Results were compared with the weekly Swiss Sentinel reports. RESULTS: When using the ICPC-2 code for fever we could reproduce the seasonal influenza peaks of the winter seasons 07/08, 08/09 and 09/10 as depicted by the Sentinel data. For the pandemic influenza 09/10, we detected a much higher first peak in summer 2009 which correlated with a potential underreporting in the Sentinel system. CONCLUSIONS: ICPC-2 data from medical teleconsultations allows influenza surveillance in real time and correlates very well with the Swiss Sentinel system.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This letter describes a data telemetry biomedical experiment. An implant, consisting of a biometric data sensor, electronics, an antenna, and a biocompatible capsule, is described. All the elements were co-designed in order to maximize the transmission distance. The device was implanted in a pig for an in vivo experiment of temperature monitoring.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Dramatic improvements in DNA sequencing technologies have led to amore than 1,000-fold reduction in sequencing costs over the past five years.Genome-wide research approaches can thus now be applied beyond medicallyrelevant questions to examine the molecular-genetic basis of behavior,development and unique life histories in almost any organism. A first step foran emerging model organism is usually establishing a reference genomesequence. I offer insight gained from the fire ant genome project. First, I detailhow the project came to be and how sequencing, assembly and annotationstrategies were chosen. Subsequently, I describe some of the issues linked toworking with data from recently sequenced genomes. Finally, I discuss anapproach undertaken in a follow-up project based on the fire ant genomesequence.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

To make full use of research data, the bioscience community needs to adopt technologies and reward mechanisms that support interoperability and promote the growth of an open 'data commoning' culture. Here we describe the prerequisites for data commoning and present an established and growing ecosystem of solutions using the shared 'Investigation-Study-Assay' framework to support that vision.