999 resultados para SNP identification
Resumo:
Simulation-based assessment is a popular and frequently necessary approach to evaluation of statistical procedures. Sometimes overlooked is the ability to take advantage of underlying mathematical relations and we focus on this aspect. We show how to take advantage of large-sample theory when conducting a simulation using the analysis of genomic data as a motivating example. The approach uses convergence results to provide an approximation to smaller-sample results, results that are available only by simulation. We consider evaluating and comparing a variety of ranking-based methods for identifying the most highly associated SNPs in a genome-wide association study, derive integral equation representations of the pre-posterior distribution of percentiles produced by three ranking methods, and provide examples comparing performance. These results are of interest in their own right and set the framework for a more extensive set of comparisons.
Resumo:
Background: This paper describes SeqDoC, a simple, web-based tool to carry out direct comparison of ABI sequence chromatograms. This allows the rapid identification of single nucleotide polymorphisms (SNPs) and point mutations without the need to install or learn more complicated analysis software. Results: SeqDoC produces a subtracted trace showing differences between a reference and test chromatogram, and is optimised to emphasise those characteristic of single base changes. It automatically aligns sequences, and produces straightforward graphical output. The use of direct comparison of the sequence chromatograms means that artefacts introduced by automatic base-calling software are avoided. Homozygous and heterozygous substitutions and insertion/deletion events are all readily identified. SeqDoC successfully highlights nucleotide changes missed by the Staden package 'tracediff' program. Conclusion: SeqDoC is ideal for small-scale SNP identification, for identification of changes in random mutagenesis screens, and for verification of PCR amplification fidelity. Differences are highlighted, not interpreted, allowing the investigator to make the ultimate decision on the nature of the change.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Linkage disequilibrium (LD) is defined as the nonrandom association of alleles at two or more loci in a population and may be a useful tool in a diverse array of applications including disease gene mapping, elucidating the demographic history of populations, and testing hypotheses of human evolution. However, the successful application of LD-based approaches to pertinent genetic questions is hampered by a lack of understanding about the forces that mediate the genome-wide distribution of LD within and between human populations. Delineating the genomic patterns of LD is a complex task that will require interdisciplinary research that transcends traditional scientific boundaries. The research presented in this dissertation is predicated upon the need for interdisciplinary studies and both theoretical and experimental projects were pursued. In the theoretical studies, I have investigated the effect of genotyping errors and SNP identification strategies on estimates of LD. The primary importance of these two chapters is that they provide important insights and guidance for the design of future empirical LD studies. Furthermore, I analyzed the allele frequency distribution of 26,530 single nucleotide polymorphisms (SNPs) in three populations and generated the first-generation natural selection map of the human genome, which will be an important resource for explaining and understanding genomic patterns of LD. Finally, in the experimental study, I describe a novel and simple, low-cost, and high-throughput SNP genotyping method. The theoretical analyses and experimental tools developed in this dissertation will facilitate a more complete understanding of patterns of LD in human populations. ^
Resumo:
BACKGROUND: Genotypes obtained with commercial SNP arrays have been extensively used in many large case-control or population-based cohorts for SNP-based genome-wide association studies for a multitude of traits. Yet, these genotypes capture only a small fraction of the variance of the studied traits. Genomic structural variants (GSV) such as Copy Number Variation (CNV) may account for part of the missing heritability, but their comprehensive detection requires either next-generation arrays or sequencing. Sophisticated algorithms that infer CNVs by combining the intensities from SNP-probes for the two alleles can already be used to extract a partial view of such GSV from existing data sets. RESULTS: Here we present several advances to facilitate the latter approach. First, we introduce a novel CNV detection method based on a Gaussian Mixture Model. Second, we propose a new algorithm, PCA merge, for combining copy-number profiles from many individuals into consensus regions. We applied both our new methods as well as existing ones to data from 5612 individuals from the CoLaus study who were genotyped on Affymetrix 500K arrays. We developed a number of procedures in order to evaluate the performance of the different methods. This includes comparison with previously published CNVs as well as using a replication sample of 239 individuals, genotyped with Illumina 550K arrays. We also established a new evaluation procedure that employs the fact that related individuals are expected to share their CNVs more frequently than randomly selected individuals. The ability to detect both rare and common CNVs provides a valuable resource that will facilitate association studies exploring potential phenotypic associations with CNVs. CONCLUSION: Our new methodologies for CNV detection and their evaluation will help in extracting additional information from the large amount of SNP-genotyping data on various cohorts and use this to explore structural variants and their impact on complex traits.
Resumo:
The thesis identify CNV structural variants as possible markers for genomic selection and identify QTL regions for Fatty Acid Content in the Italian Brown Swiss population. Additionally it maps the QTL for mastitis resistance in the Valdostana Red Pied cattle.
Resumo:
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.
Resumo:
Candidaemia is the fourth most common cause of bloodstream infection, with a high mortality rate of up to 40%. Identification of host genetic factors that confer susceptibility to candidaemia may aid in designing adjunctive immunotherapeutic strategies. Here we hypothesize that variation in immune genes may predispose to candidaemia. We analyse 118,989 single-nucleotide polymorphisms (SNPs) across 186 loci known to be associated with immune-mediated diseases in the largest candidaemia cohort to date of 217 patients of European ancestry and a group of 11,920 controls. We validate the significant associations by comparison with a disease-matched control group. We observe significant association between candidaemia and SNPs in the CD58 (P = 1.97 × 10(-11); odds ratio (OR) = 4.68), LCE4A-C1orf68 (P = 1.98 × 10(-10); OR = 4.25) and TAGAP (P = 1.84 × 10(-8); OR = 2.96) loci. Individuals carrying two or more risk alleles have an increased risk for candidaemia of 19.4-fold compared with individuals carrying no risk allele. We identify three novel genetic risk factors for candidaemia, which we subsequently validate for their role in antifungal host defence.
Resumo:
Schwann cells synthesize a large amount of membrane that form a specialized structure called myelin that surrounds axons and facilitate the transmission of electrical signal along neurons in peripheral nervous system (PNS). Previous studies demonstrated that both Schwann cell differentiation and de-differentiation (in the situation of a nerve injury or demyelinating disease) are regulated by cell-intrinsic regulators including several transcription factors. In particular, the de-differentiation of mature Schwann cells is driven by the activation of multiple negative regulators of myelination including Sox2, c-Jun, Notch and Pax3, all usually expressed in immature Schwann cells and suppressed at the onset of myelination. In order to identify new regulators of myelination involved in the development of the PNS, we analyzed the gene-expression profiling data from developing PNS and from three models of demyelinating neuropathies. This analysis led to the identification of Sox4, a member of the Sox family of transcription factors, as a potential candidate. To characterize the molecular function of Sox4 in PNS, we generated two transgenic lines of mice, which overexpress Sox4 specifically in Schwann cells. Detailed analysis of these mice showed that the overexpression of Sox4 in Schwann cells causes a delay in progression of myelination between post-natal day 2 (P2) and P5. Our in vitro analysis suggested that Sox4 cDNA can be overexpressed while the protein translation is tightly regulated. Interestingly, we observed that Sox4 protein is stabilized in nerves of the CMT4C mouse, a model of the human neuropathy. We therefore crossed Sox4 transgenic mice with CMT4C mice and we observed that Sox4 overexpression exacerbated the neuropathy phenotype in these mice. While recognized as being crucial for the normal function of both neurons and myelinating glial cells, the processes that regulate the beginning of myelination and the nature of the neuro-glial cross-talk remains mostly unknown. In order to gain insight into the molecular pathways involved in the interactions between neurons and associated glial cells, we developed a neuron-glia co-culture system based on microfluidic chambers and successfully induced myelination in this system by ascorbic acid. Importantly, we observed that in addition to acting on Schwann cells, ascorbic acid also modulate neuronal/axonal NRG1/ErbB2-B3 signalling. The experimental setting used in our study thus allowed us to discover a novel phenomena of propagation for myelination in vitro. The further characterization of this event brought us to identify other compounds able to induce myelination: ADAMs secretases inhibitor GM6001 and cyclic-AMP. The results generated during my thesis project are therefore not only important for the advancement of our understanding of how the PNS works, but may also potentially help to develop new therapies aiming at improvement of PNS myelination under disease conditions. - Les cellules de Schwann synthétisent une grande quantité de membrane formant une structure spécialisée appelée myéline qui entoure les axones et facilite la transmission du signal électrique le long des neurones du système nerveux périphérique (SNP). Des études antérieures ont démontré que la différenciation et la dédifférenciation des cellules de Schwann (dans la situation d'une lésion nerveuse ou d'une maladie démyélinisante) sont régulées par des régulateurs cellulaires intrinsèques, incluant plusieurs facteurs de transcription. En particulier, la dédifférenciation des cellules de Schwann matures est contrôlée par l'activation de plusieurs régulateurs négatifs de la myélinisation dont Sox2, c-Jun, Notch et Pax3, tous habituellement exprimés dans des cellules de Schwann immatures et supprimés au début de la myélinisation. Afin d'identifier de nouveaux régulateurs de myélinisation impliqués dans le développement du SNP, nous avons analysé le profil d'expression génique durant le développement du SNP ainsi que dans trois modèles de neuropathies démyélinisantes. Cette analyse a mené à l'identification de Sox4, un membre de la famille des facteurs de transcription Sox, comme étant un candidat potentiel. Dans le but de caractériser la fonction moléculaire de Sox4 dans le SNP, nous avons généré deux lignées transgéniques de souris qui surexpriment Sox4 spécifiquement dans les cellules de Schwann. L'analyse détaillée de ces souris a montré que la surexpression de Sox4 dans les cellules de Schwann provoque un retard dans la progression de la myélinisation entre le jour postnatal 2 (P2) et P5. Notre analyse in vitro a suggéré que l'ADNc de Sox4 peut être surexprimé alors que la traduction des protéines est quand à elle étroitement régulée. De façon intéressante, nous avons observé que la protéine Sox4 est stabilisée dans les nerfs des souris CMT4C, un modèle de neuropathie humaine. Nous avons donc croisé les souris transgéniques Sox4 avec des souris CMT4C et avons observé que la surexpression de Sox4 exacerbe le phénotype de neuropathie chez ces souris. Bien que reconnus comme étant cruciaux pour le fonctionnement normal des neurones et des cellules gliales myélinisantes, les processus qui régulent le début de la myélinisation ainsi que la nature des interactions neurone-glie restent largement méconnus. Afin de mieux comprendre les mécanismes moléculaires impliqués dans les interactions entre les neurones et les cellules gliales leur étant associés, nous avons développé un système de co-culture neurone-glie basé sur des chambres microfluidiques et y avons induit avec succès la myélinisation avec de l'acide ascorbique. Étonnamment, nous avons remarqué que, en plus d'agir sur les cellules de Schwann, l'acide ascorbique module également la voie de signalisation neuronale/axonale NRG1/ErbB2-B3. Le protocole expérimental utilisé dans notre étude a ainsi permis de découvrir un nouveau phénomène de propagation de la myélinisation in vitro. La caractérisation plus poussée de ce phénomène nous a menés à identifier d'autres composés capables d'induire la myélinisation: L'inhibiteur de sécrétases ADAMs GM6001 et l'AMP cyclique. Les résultats obtenus au cours de mon projet de thèse ne sont donc pas seulement importants pour l'avancement de notre compréhension sur la façon dont le SNP fonctionne, mais peuvent aussi potentiellement aider à développer de nouvelles thérapies visant à l'amélioration de la myélinisation du SNP dans des conditions pathologiques.
Resumo:
Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNPbased linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2 %) were heterozygous in one of the two parents of the progeny, 1,007 (12.8 %) were heterozygous in both parental genotypes, whilst just 2.8 % of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7 % of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or misassignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
Resumo:
The pyrimidine glycosides, vicine and convicine, limit the use of faba bean (Vicia faba L.) as food and feed. A single recessive gene, vc-, is responsible for a lowered vicine–convicine concentration. The biosynthetic pathway of these closely related compounds is not known, and the nearest available markers are several cM away from vc-. Improved markers would assist breeding and help to identify candidate genes. A segregating population of 210 F5 recombinant inbred lines was developed from the cross of Mélodie/2 (low vicine–convicine) × ILB 938/2 (normal vicine–convicine), and vicine–convicine concentrations were determined twice on each line. The population was genotyped with a set of 188 SNPs. A strong, single QTL for vicine–convicine concentration was identified on chromosome I, flanked by markers 1.0 cM away on one side and 2.6 cM on the other. The interval defined by these markers in the model species Medicago truncatula includes about 340 genes, but no candidate genes were identified. Further fine mapping should lead to the identification of tightly linked markers as well as narrowing down the search for candidate regulatory or biosynthetic genes which could underlie the vc- locus.
Resumo:
Contents: The osteopontin gene may influence the fertility of water buffaloes because it is a protein present in sperm. The aim of this work was to identify polymorphisms in this gene and associate them with fertility parameters of animals kept under extensive grazing. A total of 306 male buffaloes older than 18 months, from two farms, one in the state of Amapá and the other in the state of Pará, Brazil were used in the study. Seven SNPs were identified in the regions studied. The polymorphisms were in gene positions 1478, 1513 and 1611 in the region 5′upstrem and positions 6690, 6737, 6925 and 6952 in the region amplified in intron 5. The SNPs were associated with the traits, namely scrotal circumference, scrotal volume, sperm motility, sperm concentration and sperm pathology. There were significant SNPs (p < 0.05) for all the traits. The SNP 6690 was significant for scrotal circumference, sperm concentration, sperm motility and sperm pathology and the SNP 6737 for scrotal volume. The genotype AA of SNP 6690 presented the highest averages for scrotal circumference, sperm concentration and motility and the lowest total number of sperm pathologies. For the scrotal volume trait, the animals with the largest volume were correlated with the presence of the genotype GG of SNP 6737. These results indicate a significance of the osteopontin gene as it seems to exert a substantial influence on the semen production traits of male buffaloes. © 2013 Blackwell Verlag GmbH.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Die vorliegende Dissertation entstand im Rahmen eines multizentrischen EU-geförderten Projektes, das die Anwendungsmöglichkeiten von Einzelnukleotid-Polymorphismen (SNPs) zur Individualisierung von Personen im Kontext der Zuordnung von biologischen Tatortspuren oder auch bei der Identifizierung unbekannter Toter behandelt. Die übergeordnete Zielsetzung des Projektes bestand darin, hochauflösende Genotypisierungsmethoden zu etablieren und zu validieren, die mit hoher Genauigkeit aber geringen Aufwand SNPs im Multiplexformat simultan analysieren können. Zunächst wurden 29 Y-chromosomale und 52 autosomale SNPs unter der Anforderung ausgewählt, dass sie als Multiplex eine möglichst hohe Individualisierungschance aufweisen. Anschließend folgten die Validierungen beider Multiplex-Systeme und der SNaPshot™-Minisequenzierungsmethode in systematischen Studien unter Beteiligung aller Arbeitsgruppen des Projektes. Die validierte Referenzmethode auf der Basis einer Minisequenzierung diente einerseits für die kontrollierte Zusammenarbeit unterschiedlicher Laboratorien und andererseits als Grundlage für die Entwicklung eines Assays zur SNP-Genotypisierung mittels der elektronischen Microarray-Technologie in dieser Arbeit. Der eigenständige Hauptteil dieser Dissertation beschreibt unter Verwendung der zuvor validierten autosomalen SNPs die Neuentwicklung und Validierung eines Hybridisierungsassays für die elektronische Microarray-Plattform der Firma Nanogen Dazu wurden im Vorfeld drei verschiedene Assays etabliert, die sich im Funktionsprinzip auf dem Microarray unterscheiden. Davon wurde leistungsorientiert das Capture down-Assay zur Weiterentwicklung ausgewählt. Nach zahlreichen Optimierungsmaßnahmen hinsichtlich PCR-Produktbehandlung, gerätespezifischer Abläufe und analysespezifischer Oligonukleotiddesigns stand das Capture down-Assay zur simultanen Typisierung von drei Individuen mit je 32 SNPs auf einem Microarray bereit. Anschließend wurde dieses Verfahren anhand von 40 DNA-Proben mit bekannten Genotypen für die 32 SNPs validiert und durch parallele SNaPshot™-Typisierung die Genauigkeit bestimmt. Das Ergebnis beweist nicht nur die Eignung des validierten Analyseassays und der elektronischen Microarray-Technologie für bestimmte Fragestellungen, sondern zeigt auch deren Vorteile in Bezug auf Schnelligkeit, Flexibilität und Effizienz. Die Automatisierung, welche die räumliche Anordnung der zu untersuchenden Fragmente unmittelbar vor der Analyse ermöglicht, reduziert unnötige Arbeitsschritte und damit die Fehlerhäufigkeit und Kontaminationsgefahr bei verbesserter Zeiteffizienz. Mit einer maximal erreichten Genauigkeit von 94% kann die Zuverlässigkeit der in der forensischen Genetik aktuell eingesetzten STR-Systeme jedoch noch nicht erreicht werden. Die Rolle des neuen Verfahrens wird damit nicht in einer Ablösung der etablierten Methoden, sondern in einer Ergänzung zur Lösung spezieller Probleme wie z.B. der Untersuchung stark degradierter DNA-Spuren zu finden sein.
Resumo:
The aim of this work was to identify markers associated with production traits in the pig genome using different approaches. We focused the attention on Italian Large White pig breed using Genome Wide Association Studies (GWAS) and applying a selective genotyping approach to increase the power of the analyses. Furthermore, we searched the pig genome using Next Generation Sequencing (NSG) Ion Torrent Technology to combine selective genotyping approach and deep sequencing for SNP discovery. Other two studies were carried on with a different approach. Allele frequency changes for SNPs affecting candidate genes and at Genome Wide level were analysed to identify selection signatures driven by selection program during the last 20 years. This approach confirmed that a great number of markers may affect production traits and that they are captured by the classical selection programs. GWAS revealed 123 significant or suggestively significant SNP associated with Back Fat Thickenss and 229 associated with Average Daily Gain. 16 Copy Number Variant Regions resulted more frequent in lean or fat pigs and showed that different copies of those region could have a limited impact on fat. These often appear to be involved in food intake and behavior, beside affecting genes involved in metabolic pathways and their expression. By combining NGS sequencing with selective genotyping approach, new variants where discovered and at least 54 are worth to be analysed in association studies. The study of groups of pigs undergone to stringent selection showed that allele frequency of some loci can drastically change if they are close to traits that are interesting for selection schemes. These approaches could be, in future, integrated in genomic selection plans.