972 resultados para Single-nucleotide-polymorphism
Resumo:
OBJECTIVES: To evaluate the influence of genetic polymorphisms on the susceptibility to Candida colonization and intra-abdominal candidiasis, a blood culture-negative life-threatening infection in high-risk surgical ICU patients. DESIGN: Prospective observational cohort study. SETTING: Surgical ICUs from two University hospitals of the Fungal Infection Network of Switzerland. PATIENTS: Eighty-nine patients at high risk for intra-abdominal candidiasis (68 with recurrent gastrointestinal perforation and 21 with acute necrotizing pancreatitis). MEASUREMENTS AND MAIN RESULTS: Eighteen single-nucleotide polymorphisms in 16 genes previously associated with development of fungal infections were analyzed from patient's DNA by using an Illumina Veracode genotyping platform. Candida colonization was defined by recovery of Candida species from at least one nonsterile site by twice weekly monitoring of cultures from oropharynx, stools, urine, skin, and/or respiratory tract. A corrected colonization index greater than or equal to 0.4 defined "heavy" colonization. Intra-abdominal candidiasis was defined by the presence of clinical symptoms and signs of peritonitis or intra-abdominal abscess and isolation of Candida species either in pure or mixed culture from intraoperatively collected abdominal samples. Single-nucleotide polymorphisms in three innate immune genes were associated with development of a Candida corrected colonization index greater than or equal to 0.4 (Toll-like receptor rs4986790, hazard ratio = 3.39; 95% CI, 1.45-7.93; p = 0.005) or occurrence of intra-abdominal candidiasis (tumor necrosis factor-α rs1800629, hazard ratio = 4.31; 95% CI, 1.85-10.1; p= 0.0007; β-defensin 1 rs1800972, hazard ratio = 3.21; 95% CI, 1.36-7.59; p = 0.008). CONCLUSION: We report a strong association between the promoter rs1800629 single-nucleotide polymorphism in tumor necrosis factor-α and an increased susceptibility to intra-abdominal candidiasis in a homogenous prospective cohort of high-risk surgical ICU patients. This finding highlights the relevance of the tumor necrosis factor-α functional polymorphism in immune response to fungal pathogens. Immunogenetic profiling in patients at clinical high risk followed by targeted antifungal interventions may improve the prevention or preemptive management of this life-threatening infection.
Resumo:
We present a Bayesian approach for estimating the relative frequencies of multi-single nucleotide polymorphism (SNP) haplotypes in populations of the malaria parasite Plasmodium falciparum by using microarray SNP data from human blood samples. Each sample comes from a malaria patient and contains one or several parasite clones that may genetically differ. Samples containing multiple parasite clones with different genetic markers pose a special challenge. The situation is comparable with a polyploid organism. The data from each blood sample indicates whether the parasites in the blood carry a mutant or a wildtype allele at various selected genomic positions. If both mutant and wildtype alleles are detected at a given position in a multiply infected sample, the data indicates the presence of both alleles, but the ratio is unknown. Thus, the data only partially reveals which specific combinations of genetic markers (i.e. haplotypes across the examined SNPs) occur in distinct parasite clones. In addition, SNP data may contain errors at non-negligible rates. We use a multinomial mixture model with partially missing observations to represent this data and a Markov chain Monte Carlo method to estimate the haplotype frequencies in a population. Our approach addresses both challenges, multiple infections and data errors.
Resumo:
The prevalence of hypertension in African Americans (AAs) is higher than in other US groups; yet, few have performed genome-wide association studies (GWASs) in AA. Among people of European descent, GWASs have identified genetic variants at 13 loci that are associated with blood pressure. It is unknown if these variants confer susceptibility in people of African ancestry. Here, we examined genome-wide and candidate gene associations with systolic blood pressure (SBP) and diastolic blood pressure (DBP) using the Candidate Gene Association Resource (CARe) consortium consisting of 8591 AAs. Genotypes included genome-wide single-nucleotide polymorphism (SNP) data utilizing the Affymetrix 6.0 array with imputation to 2.5 million HapMap SNPs and candidate gene SNP data utilizing a 50K cardiovascular gene-centric array (ITMAT-Broad-CARe [IBC] array). For Affymetrix data, the strongest signal for DBP was rs10474346 (P= 3.6 × 10(-8)) located near GPR98 and ARRDC3. For SBP, the strongest signal was rs2258119 in C21orf91 (P= 4.7 × 10(-8)). The top IBC association for SBP was rs2012318 (P= 6.4 × 10(-6)) near SLC25A42 and for DBP was rs2523586 (P= 1.3 × 10(-6)) near HLA-B. None of the top variants replicated in additional AA (n = 11 882) or European-American (n = 69 899) cohorts. We replicated previously reported European-American blood pressure SNPs in our AA samples (SH2B3, P= 0.009; TBX3-TBX5, P= 0.03; and CSK-ULK3, P= 0.0004). These genetic loci represent the best evidence of genetic influences on SBP and DBP in AAs to date. More broadly, this work supports that notion that blood pressure among AAs is a trait with genetic underpinnings but also with significant complexity.
Resumo:
OBJECTIVES: Toll-like receptors (TLRs) are innate immune sensors that are integral to resisting chronic and opportunistic infections. Mounting evidence implicates TLR polymorphisms in susceptibilities to various infectious diseases, including HIV-1. We investigated the impact of TLR single nucleotide polymorphisms (SNPs) on clinical outcome in a seroincident cohort of HIV-1-infected volunteers. DESIGN: We analyzed TLR SNPs in 201 antiretroviral treatment-naive HIV-1-infected volunteers from a longitudinal seroincident cohort with regular follow-up intervals (median follow-up 4.2 years, interquartile range 4.4). Participants were stratified into two groups according to either disease progression, defined as peripheral blood CD4(+) T-cell decline over time, or peak and setpoint viral load. METHODS: Haplotype tagging SNPs from TLR2, TLR3, TLR4, and TLR9 were detected by mass array genotyping, and CD4(+) T-cell counts and viral load measurements were determined prior to antiretroviral therapy initiation. The association of TLR haplotypes with viral load and rapid progression was assessed by multivariate regression models using age and sex as covariates. RESULTS: Two TLR4 SNPs in strong linkage disequilibrium [1063 A/G (D299G) and 1363 C/T (T399I)] were more frequent among individuals with high peak viral load compared with low/moderate peak viral load (odds ratio 6.65, 95% confidence interval 2.19-20.46, P < 0.001; adjusted P = 0.002 for 1063 A/G). In addition, a TLR9 SNP previously associated with slow progression was found less frequently among individuals with high viral setpoint compared with low/moderate setpoint (odds ratio 0.29, 95% confidence interval 0.13-0.65, P = 0.003, adjusted P = 0.04). CONCLUSION: This study suggests a potentially new role for TLR4 polymorphisms in HIV-1 peak viral load and confirms a role for TLR9 polymorphisms in disease progression.
Resumo:
PURPOSE. Knowledge of genetic factors predisposing to age-related cataract is very limited. The aim of this study was to identify DNA sequences that either lead to or predispose for this disease. METHODS. The candidate gene SLC16A12, which encodes a solute carrier of the monocarboxylate transporter family, was sequenced in 484 patients with cataract (134 with juvenile cataract, 350 with age-related cataract) and 190 control subjects. Expression studies included luciferase reporter assay and RT-PCR experiments. RESULTS. One patient with age-related cataract showed a novel heterozygous mutation (c.-17A>G) in the 5'untranslated region (5'UTR). This mutation is in cis with the minor G-allele of the single nucleotide polymorphism (SNP) rs3740030 (c.-42T/G), also within the 5'UTR. Using a luciferase reporter assay system, a construct with the patient's haplotype caused a significant upregulation of luciferase activity. In comparison, the SNP G-allele alone promoted less activity, but that amount was still significantly higher than the amount of the common T-allele. Analysis of SLC16A12 transcripts in surrogate tissue demonstrated striking allele-specific differences causing 5'UTR heterogeneity with respect to sequence and quantity. These differences in gene expression were mirrored in an allele-specific predisposition to age-related cataract, as determined in a Swiss population (odds ratio approximately 2.2; confidence intervals, 1.23-4.3). CONCLUSIONS. The monocarboxylate transporter SLC16A12 may contribute to age-related cataract. Sequences within the 5'UTR modulate translational efficiency with pathogenic consequences.
Resumo:
There is evidence across several species for genetic control of phenotypic variation of complex traits, such that the variance among phenotypes is genotype dependent. Understanding genetic control of variability is important in evolutionary biology, agricultural selection programmes and human medicine, yet for complex traits, no individual genetic variants associated with variance, as opposed to the mean, have been identified. Here we perform a meta-analysis of genome-wide association studies of phenotypic variation using ∼170,000 samples on height and body mass index (BMI) in human populations. We report evidence that the single nucleotide polymorphism (SNP) rs7202116 at the FTO gene locus, which is known to be associated with obesity (as measured by mean BMI for each rs7202116 genotype), is also associated with phenotypic variability. We show that the results are not due to scale effects or other artefacts, and find no other experiment-wise significant evidence for effects on variability, either at loci other than FTO for BMI or at any locus for height. The difference in variance for BMI among individuals with opposite homozygous genotypes at the FTO locus is approximately 7%, corresponding to a difference of ∼0.5 kilograms in the standard deviation of weight. Our results indicate that genetic variants can be discovered that are associated with variability, and that between-person variability in obesity can partly be explained by the genotype at the FTO locus. The results are consistent with reported FTO by environment interactions for BMI, possibly mediated by DNA methylation. Our BMI results for other SNPs and our height results for all SNPs suggest that most genetic variants, including those that influence mean height or mean BMI, are not associated with phenotypic variance, or that their effects on variability are too small to detect even with samples sizes greater than 100,000.
Resumo:
Summary The CD4 molecule plays a key role in AIDS pathogenesis, it is required for entry of the virus into permissive cells and its subsequent down-modulation of the cell surface is a hallmark of HN-1 infected cells. The virus encodes no less than three proteins that participate in this process: Nef, Vpu and Env. Vpu protein interacts with CD4 within the endoplasmic reticulum of infected cells, where it targets CD4 for degradation through the interaction with a cellular protein named ß-TrCP1. This F-box protein functions as the substrate recognition subunit of the SCF ß-Trcr E3 ubiquitin ligase, which normally induce the ubiquitination and subsequent degradation of various proteins such as ß-catenin and IxBa. Mammals possess a homologue of ß-TrCP1, HOS, also named ß-TrCP2 which has a cytoplasmic subcellular distribution. Structural analysis of the ligand-binding domain of both homologues shows striking surface similarities. Both F-box proteins have a redundant role in a number of cellular processes; however the potential role of ß-TrCP2 in HIV-1 infected cells has not been evaluated. In the present study, we assessed the existence of génetic variants of BRTC, encoding ß-TrCP1, and evaluated whether these variants would affect CD4 down-modulation. Additionally, we determined whether ß-TrCP2 shares with its homologue structural and functional properties that would allow it to bind Vpu, modulate CD4 expression, and thus participate in HN-1 pathogenesis. We identified a single nucleotide polymorphism present in the human population with an allelic frequency of 0.03 that leads to the substitution of alanine 507 by a serine. However, we showed by transient transfection in HeLa CD4+ cells that this variant behaves as ß-TrCP1 with respect to CD4 down-modulation. We established transient expression systems in HeLa CD4+ cells to test whether ß-TrCP2 is implicated in Vpu-mediated CD4 down-modulation. We show by coimmunoprecipitation experiments that ß-TrCP2 binds Vpu and is able to induce CD4 down-modulation as efficiently as ß-TrCP1. In two different cell lines, HeLa CD4+ and Jurkat, Vpu-mediated CD4 down-modulation could not be completely reversed through the silencing of endogenous ß-TrCP 1 or ß-TrCP2 individually, but required both genes to be silenced simultaneously. We evaluated the role of ß-TrCP1 and ß-TrCP2 in HIV-1 life cycle using silencing prior to actual viral infection. Both ß-TrCP1 and ß-TrCP2 contributed to CD4 down-modulation during aone-cycle viral infection iri Ghost cells. In addition, the combined silencing of both homologues in the absence of env and nef reversed CD4 down-modulation, showing that ß-TrCP 1 and ß-TrCP2 represent the main and additive effectors of HIV-1 encoded Vpu. In addition, we showed that silencing of ß-TrCPI but not ß-TrCP2 induced a decrease of HIV-1 LTR-driven expression. In a transient transfection system with Tat and a LTR luciferase reporter, both homologues modulated LTR-driven expression. The present study revealed that ß-TrCP2 represents a novel protein participating in HIV-1 cycle and complete comprehension of the complex interplay occurring between the two F-Box will improve our understanding of HIV-1 infection. Résumé La molécule CD4 joue un rôle clef dans la pathogenèse du SIDA ; elle est requise pour l'entrée du virus dans les cellules permissives et la diminution de sa concentration au niveau de la surface cellulaire est une importante caractéristique des cellules infectées par le VIH-1. Le virus encode pas moins de trois protéines qui participent à ce processus Nef, Vpu et Env. La protéine Vpu lie CD4 au niveau du réticulum endoplasmique et induit sa dégradation en interagissant avec une protéine cellulaire nommée ß-TrCP 1. Cette protéine de type F-Box est une sous unité du complexe ubiquitine-ligase E3 SCFß-TrCP. Elle permet la reconnaissance du substrat par le complexe qui induit l'ubiquitination et la subséquente dégradation de diverses protéines cellulaires comme la ß-catenin ou IκBα. Les mammifères possèdent un homologue à ß-TrCP1appelé ß-TrCP2 (ou HOS). L'analyse comparative du domaine permettant la reconnaissance des substrats des deux homologues montre de frappantes similarités. Le rôle de ß-TrCP2 dans le cycle viral du VIH-1 n'a pas encore été évalué. Lors de cette étude, nous avons recherché l'existence de variants génétique de BTRC (codant pour ß-TrCP1) et nous avons évalué si ces variants pourraient affecter la dégradation des molécules CD4 induite par le virus. Nous avons ainsi identifié un polymorphisme présent dans la population humaine avec une fréquence allélique de 0.03 qui consiste en une substitution de l'alanine 507 par une sérine. Nous avons cependant montré par transfection dans des cellules HeLa CD4+ que ce variant se comporte comme ß-TrCP 1 en ce qui concerne la modulation de CD4. De plus, nous avons déterminé si ß-TrCP2 partageait avec son homologue des propriétés structurelles et fonctionnelles qui lui permettraient de lier Vpu, moduler la concentration de CD4 et ainsi prendre part à la pathogenèse du SIDA. Pour ce faire, nous avons établi un système d'expression temporaire dans des cellules HeLa CD4+. Par co-immunoprécipitation, nous avons montré que ß-TrCP2 lie Vpu et est capable d'induire la dégradation de CD4 aussi efficacement que ß-TrCP1. Dans deux différentes lignées cellulaires, HeLa CD4+ et Jurkat, la dégradation de CD4 n'a pu être complètement inhibée par le silencing individuel de ß-TrCP 1 ou ß-TrCP2, mais nécessitait le silencing simultané des 2 gènes. Nous avons évalué le rôle des deux homologues dans le cycle viral du VIH-1 en infectant des cellules Ghost avec le virus après avoir effectué un silencing des deux protéines. Nous avons ainsi montré que ß-TrCP 1 et ß-TrCP2 contribuent de manière additive à la dégradation de CD4 induite par une infection du VIH-1. Le silencing combiné des deux homologues inhiba complètement cette dégradation en l'absence de env et nef, prouvant qu'aucune autre voie ne participe à ce processus: En outre, nous avons montré que le silencing de ß-TrCP 1 mais pas celui de ß-TrCP2 induisait une diminution de l'expression virale sous contrôle du LTR. Nous n'avons cependant pas été en mesure de reconstituer cet effet en exprimant Tat et un gène reporteur sous contrôle du LTR dans des cellules HeLa CD4+. Le présent travail révèle que ß-TrCP2 représente une nouvelle protéine participant dans le cycle viral du VIH-1. Une complète compréhension de l'effet de chacun des deux homologues sur le cycle viral permettra d'améliorer notre compréhension de l'infection par le VIH-1.
Resumo:
Hypertension is an important determinant of cardiovascular morbidity and mortality and has a substantial heritability, which is likely of polygenic origin. The aim of this study was to assess to what extent multiple common genetic variants contribute to blood pressure regulation in both adults and children and to assess overlap in variants between different age groups, using genome-wide profiling. Single nucleotide polymorphism sets were defined based on a meta-analysis of genome-wide association studies on systolic blood pressure and diastolic blood pressure performed by the Cohort for Heart and Aging Research in Genome Epidemiology (n=29 136), using different P value thresholds for selecting single nucleotide polymorphisms. Subsequently, genetic risk scores for systolic blood pressure and diastolic blood pressure were calculated in an independent adult population (n=2072) and a child population (n=1034). The explained variance of the genetic risk scores was evaluated using linear regression models, including sex, age, and body mass index. Genetic risk scores, including also many nongenome-wide significant single nucleotide polymorphisms, explained more of the variance than scores based only on very significant single nucleotide polymorphisms in adults and children. Genetic risk scores significantly explained ≤1.2% (P=9.6*10(-8)) of the variance in adult systolic blood pressure and 0.8% (P=0.004) in children. For diastolic blood pressure, the variance explained was similar in adults and children (1.7% [P=8.9*10(-10)] and 1.4% [P=3.3*10(-5)], respectively). These findings suggest the presence of many genetic loci with small effects on blood pressure regulation both in adults and children, indicating also a (partly) common polygenic regulation of blood pressure throughout different periods of life.
Resumo:
Motivation: Genome-wide association studies have become widely used tools to study effects of genetic variants on complex diseases. While it is of great interest to extend existing analysis methods by considering interaction effects between pairs of loci, the large number of possible tests presents a significant computational challenge. The number of computations is further multiplied in the study of gene expression quantitative trait mapping, in which tests are performed for thousands of gene phenotypes simultaneously. Results: We present FastEpistasis, an efficient parallel solution extending the PLINK epistasis module, designed to test for epistasis effects when analyzing continuous phenotypes. Our results show that the algorithm scales with the number of processors and offers a reduction in computation time when several phenotypes are analyzed simultaneously. FastEpistasis is capable of testing the association of a continuous trait with all single nucleotide polymorphism ( SNP) pairs from 500 000 SNPs, totaling 125 billion tests, in a population of 5000 individuals in 29, 4 or 0.5 days using 8, 64 or 512 processors.
Resumo:
Urotensin-II controls ion/water homeostasis in fish and vascular tone in rodents. We hypothesised that common genetic variants in urotensin-II pathway genes are associated with human blood pressure or renal function. We performed family-based analysis of association between blood pressure, glomerular filtration and genes of the urotensin-II pathway (urotensin-II, urotensin-II related peptide, urotensin-II receptor) saturated with 28 tagging single nucleotide polymorphisms in 2024 individuals from 520 families; followed by an independent replication in 420 families and 7545 unrelated subjects. The expression studies of the urotensin-II pathway were carried out in 97 human kidneys. Phylogenetic evolutionary analysis was conducted in 17 vertebrate species. One single nucleotide polymorphism (rs531485 in urotensin-II gene) was associated with adjusted estimated glomerular filtration rate in the discovery cohort (p = 0.0005). It showed no association with estimated glomerular filtration rate in the combined replication resource of 8724 subjects from 6 populations. Expression of urotensin-II and its receptor showed strong linear correlation (r = 0.86, p<0.0001). There was no difference in renal expression of urotensin-II system between hypertensive and normotensive subjects. Evolutionary analysis revealed accumulation of mutations in urotensin-II since the divergence of primates and weaker conservation of urotensin-II receptor in primates than in lower vertebrates. Our data suggest that urotensin-II system genes are unlikely to play a major role in genetic control of human blood pressure or renal function. The signatures of evolutionary forces acting on urotensin-II system indicate that it may have evolved towards loss of function since the divergence of primates.
Resumo:
BACKGROUND & AIMS: Recent studies have described a major impact of genetic variations near the IL28B gene on the natural course and outcome of antiviral therapy in chronic hepatitis C. We therefore, aimed to explore the impact of donor and recipient genotypes of these polymorphisms on hepatitis C virus (HCV) liver graft reinfection. METHODS: Donor and recipient genotypes of IL28B rs12979860C>T single nucleotide polymorphism were determined in 91 patients with HCV liver graft reinfection, 47 of whom were treated with pegylated interferon-α (PEG-IFN-α) and ribavirin. IL28B genetic polymorphisms were correlated with the natural course and treatment outcome of recurrent hepatitis C. RESULTS: Patients requiring liver transplantation due to end-stage chronic hepatitis C appeared to be selected toward the adverse genotypes rs12979860 CT/TT compared to non-transplanted HCV-infected patients (p=0.046). Patients with the donor genotype rs12979860 CC had higher peak ALT and HCV RNA serum concentrations than those with CT/TT (p=0.04 and 0.06, respectively). No association was observed between ALT/HCV RNA serum concentrations and recipient genotypes (p>0.3). More important, donor IL28B rs12979860 CC vs. CT/TT genotypes were associated with rapid, complete early, and sustained virologic response (RVR, cEVR, SVR) to treatment with PEG-IFN-α and ribavirin (p=0.003, 0.0012, 0.008, respectively), but weaker associations of recipient genotypes with RVR, cEVR, and SVR were observed as well (p=0.0046, 0.115, 0.118, respectively). CONCLUSIONS: We provide evidence for a dominant, but not exclusive impact of the donor rather than the recipient IL28B genetic background on the natural course and treatment outcome of HCV liver graft reinfection.
Resumo:
BACKGROUND: There is an ever-increasing volume of data on host genes that are modulated during HIV infection, influence disease susceptibility or carry genetic variants that impact HIV infection. We created GuavaH (Genomic Utility for Association and Viral Analyses in HIV, http://www.GuavaH.org), a public resource that supports multipurpose analysis of genome-wide genetic variation and gene expression profile across multiple phenotypes relevant to HIV biology. FINDINGS: We included original data from 8 genome and transcriptome studies addressing viral and host responses in and ex vivo. These studies cover phenotypes such as HIV acquisition, plasma viral load, disease progression, viral replication cycle, latency and viral-host genome interaction. This represents genome-wide association data from more than 4,000 individuals, exome sequencing data from 392 individuals, in vivo transcriptome microarray data from 127 patients/conditions, and 60 sets of RNA-seq data. Additionally, GuavaH allows visualization of protein variation in ~8,000 individuals from the general population. The publicly available GuavaH framework supports queries on (i) unique single nucleotide polymorphism across different HIV related phenotypes, (ii) gene structure and variation, (iii) in vivo gene expression in the setting of human infection (CD4+ T cells), and (iv) in vitro gene expression data in models of permissive infection, latency and reactivation. CONCLUSIONS: The complexity of the analysis of host genetic influences on HIV biology and pathogenesis calls for comprehensive motors of research on curated data. The tool developed here allows queries and supports validation of the rapidly growing body of host genomic information pertinent to HIV research.
Resumo:
Gene-lifestyle interactions have been suggested to contribute to the development of type 2 diabetes. Glucose levels 2 h after a standard 75-g glucose challenge are used to diagnose diabetes and are associated with both genetic and lifestyle factors. However, whether these factors interact to determine 2-h glucose levels is unknown. We meta-analyzed single nucleotide polymorphism (SNP) × BMI and SNP × physical activity (PA) interaction regression models for five SNPs previously associated with 2-h glucose levels from up to 22 studies comprising 54,884 individuals without diabetes. PA levels were dichotomized, with individuals below the first quintile classified as inactive (20%) and the remainder as active (80%). BMI was considered a continuous trait. Inactive individuals had higher 2-h glucose levels than active individuals (β = 0.22 mmol/L [95% CI 0.13-0.31], P = 1.63 × 10(-6)). All SNPs were associated with 2-h glucose (β = 0.06-0.12 mmol/allele, P ≤ 1.53 × 10(-7)), but no significant interactions were found with PA (P > 0.18) or BMI (P ≥ 0.04). In this large study of gene-lifestyle interaction, we observed no interactions between genetic and lifestyle factors, both of which were associated with 2-h glucose. It is perhaps unlikely that top loci from genome-wide association studies will exhibit strong subgroup-specific effects, and may not, therefore, make the best candidates for the study of interactions.
Resumo:
AbstractAlthough the genomes from any two human individuals are more than 99.99% identical at the sequence level, some structural variation can be observed. Differences between genomes include single nucleotide polymorphism (SNP), inversion and copy number changes (gain or loss of DNA). The latter can range from submicroscopic events (CNVs, at least 1kb in size) to complete chromosomal aneuploidies. Small copy number variations have often no (lethal) consequences to the cell, but a few were associated to disease susceptibility and phenotypic variations. Larger re-arrangements (i.e. complete chromosome gain) are frequently associated with more severe consequences on health such as genomic disorders and cancer. High-throughput technologies like DNA microarrays enable the detection of CNVs in a genome-wide fashion. Since the initial catalogue of CNVs in the human genome in 2006, there has been tremendous interest in CNVs both in the context of population and medical genetics. Understanding CNV patterns within and between human populations is essential to elucidate their possible contribution to disease. But genome analysis is a challenging task; the technology evolves rapidly creating needs for novel, efficient and robust analytical tools which need to be compared with existing ones. Also, while the link between CNV and disease has been established, the relative CNV contribution is not fully understood and the predisposition to disease from CNVs of the general population has not been yet investigated.During my PhD thesis, I worked on several aspects related to CNVs. As l will report in chapter 3, ! was interested in computational methods to detect CNVs from the general population. I had access to the CoLaus dataset, a population-based study with more than 6,000 participants from the Lausanne area. All these individuals were analysed on SNP arrays and extensive clinical information were available. My work explored existing CNV detection methods and I developed a variety of metrics to compare their performance. Since these methods were not producing entirely satisfactory results, I implemented my own method which outperformed two existing methods. I also devised strategies to combine CNVs from different individuals into CNV regions.I was also interested in the clinical impact of CNVs in common disease (chapter 4). Through an international collaboration led by the Centre Hospitalier Universitaire Vaudois (CHUV) and the Imperial College London I was involved as a main data analyst in the investigation of a rare deletion at chromosome 16p11 detected in obese patients. Specifically, we compared 8,456 obese patients and 11,856 individuals from the general population and we found that the deletion was accounting for 0.7% of the morbid obesity cases and was absent in healthy non- obese controls. This highlights the importance of rare variants with strong impact and provides new insights in the design of clinical studies to identify the missing heritability in common disease.Furthermore, I was interested in the detection of somatic copy number alterations (SCNA) and their consequences in cancer (chapter 5). This project was a collaboration initiated by the Ludwig Institute for Cancer Research and involved other groups from the Swiss Institute of Bioinformatics, the CHUV and Universities of Lausanne and Geneva. The focus of my work was to identify genes with altered expression levels within somatic copy number alterations (SCNA) in seven metastatic melanoma ceil lines, using CGH and SNP arrays, RNA-seq, and karyotyping. Very few SCNA genes were shared by even two melanoma samples making it difficult to draw any conclusions at the individual gene level. To overcome this limitation, I used a network-guided analysis to determine whether any pathways, defined by amplified or deleted genes, were common among the samples. Six of the melanoma samples were potentially altered in four pathways and five samples harboured copy-number and expression changes in components of six pathways. In total, this approach identified 28 pathways. Validation with two external, large melanoma datasets confirmed all but three of the detected pathways and demonstrated the utility of network-guided approaches for both large and small datasets analysis.RésuméBien que le génome de deux individus soit similaire à plus de 99.99%, des différences de structure peuvent être observées. Ces différences incluent les polymorphismes simples de nucléotides, les inversions et les changements en nombre de copies (gain ou perte d'ADN). Ces derniers varient de petits événements dits sous-microscopiques (moins de 1kb en taille), appelés CNVs (copy number variants) jusqu'à des événements plus large pouvant affecter des chromosomes entiers. Les petites variations sont généralement sans conséquence pour la cellule, toutefois certaines ont été impliquées dans la prédisposition à certaines maladies, et à des variations phénotypiques dans la population générale. Les réarrangements plus grands (par exemple, une copie additionnelle d'un chromosome appelée communément trisomie) ont des répercutions plus grave pour la santé, comme par exemple dans certains syndromes génomiques et dans le cancer. Les technologies à haut-débit telle les puces à ADN permettent la détection de CNVs à l'échelle du génome humain. La cartographie en 2006 des CNV du génome humain, a suscité un fort intérêt en génétique des populations et en génétique médicale. La détection de différences au sein et entre plusieurs populations est un élément clef pour élucider la contribution possible des CNVs dans les maladies. Toutefois l'analyse du génome reste une tâche difficile, la technologie évolue très rapidement créant de nouveaux besoins pour le développement d'outils, l'amélioration des précédents, et la comparaison des différentes méthodes. De plus, si le lien entre CNV et maladie a été établit, leur contribution précise n'est pas encore comprise. De même que les études sur la prédisposition aux maladies par des CNVs détectés dans la population générale n'ont pas encore été réalisées.Pendant mon doctorat, je me suis concentré sur trois axes principaux ayant attrait aux CNV. Dans le chapitre 3, je détaille mes travaux sur les méthodes d'analyses des puces à ADN. J'ai eu accès aux données du projet CoLaus, une étude de la population de Lausanne. Dans cette étude, le génome de plus de 6000 individus a été analysé avec des puces SNP et de nombreuses informations cliniques ont été récoltées. Pendant mes travaux, j'ai utilisé et comparé plusieurs méthodes de détection des CNVs. Les résultats n'étant pas complètement satisfaisant, j'ai implémenté ma propre méthode qui donne de meilleures performances que deux des trois autres méthodes utilisées. Je me suis aussi intéressé aux stratégies pour combiner les CNVs de différents individus en régions.Je me suis aussi intéressé à l'impact clinique des CNVs dans le cas des maladies génétiques communes (chapitre 4). Ce projet fut possible grâce à une étroite collaboration avec le Centre Hospitalier Universitaire Vaudois (CHUV) et l'Impérial College à Londres. Dans ce projet, j'ai été l'un des analystes principaux et j'ai travaillé sur l'impact clinique d'une délétion rare du chromosome 16p11 présente chez des patients atteints d'obésité. Dans cette collaboration multidisciplinaire, nous avons comparés 8'456 patients atteint d'obésité et 11 '856 individus de la population générale. Nous avons trouvés que la délétion était impliquée dans 0.7% des cas d'obésité morbide et était absente chez les contrôles sains (non-atteint d'obésité). Notre étude illustre l'importance des CNVs rares qui peuvent avoir un impact clinique très important. De plus, ceci permet d'envisager une alternative aux études d'associations pour améliorer notre compréhension de l'étiologie des maladies génétiques communes.Egalement, j'ai travaillé sur la détection d'altérations somatiques en nombres de copies (SCNA) et de leurs conséquences pour le cancer (chapitre 5). Ce projet fut une collaboration initiée par l'Institut Ludwig de Recherche contre le Cancer et impliquant l'Institut Suisse de Bioinformatique, le CHUV et les Universités de Lausanne et Genève. Je me suis concentré sur l'identification de gènes affectés par des SCNAs et avec une sur- ou sous-expression dans des lignées cellulaires dérivées de mélanomes métastatiques. Les données utilisées ont été générées par des puces ADN (CGH et SNP) et du séquençage à haut débit du transcriptome. Mes recherches ont montrées que peu de gènes sont récurrents entre les mélanomes, ce qui rend difficile l'interprétation des résultats. Pour contourner ces limitations, j'ai utilisé une analyse de réseaux pour définir si des réseaux de signalisations enrichis en gènes amplifiés ou perdus, étaient communs aux différents échantillons. En fait, parmi les 28 réseaux détectés, quatre réseaux sont potentiellement dérégulés chez six mélanomes, et six réseaux supplémentaires sont affectés chez cinq mélanomes. La validation de ces résultats avec deux larges jeux de données publiques, a confirmée tous ces réseaux sauf trois. Ceci démontre l'utilité de cette approche pour l'analyse de petits et de larges jeux de données.Résumé grand publicL'avènement de la biologie moléculaire, en particulier ces dix dernières années, a révolutionné la recherche en génétique médicale. Grâce à la disponibilité du génome humain de référence dès 2001, de nouvelles technologies telles que les puces à ADN sont apparues et ont permis d'étudier le génome dans son ensemble avec une résolution dite sous-microscopique jusque-là impossible par les techniques traditionnelles de cytogénétique. Un des exemples les plus importants est l'étude des variations structurales du génome, en particulier l'étude du nombre de copies des gènes. Il était établi dès 1959 avec l'identification de la trisomie 21 par le professeur Jérôme Lejeune que le gain d'un chromosome supplémentaire était à l'origine de syndrome génétique avec des répercussions graves pour la santé du patient. Ces observations ont également été réalisées en oncologie sur les cellules cancéreuses qui accumulent fréquemment des aberrations en nombre de copies (telles que la perte ou le gain d'un ou plusieurs chromosomes). Dès 2004, plusieurs groupes de recherches ont répertorié des changements en nombre de copies dans des individus provenant de la population générale (c'est-à-dire sans symptômes cliniques visibles). En 2006, le Dr. Richard Redon a établi la première carte de variation en nombre de copies dans la population générale. Ces découvertes ont démontrées que les variations dans le génome était fréquentes et que la plupart d'entre elles étaient bénignes, c'est-à-dire sans conséquence clinique pour la santé de l'individu. Ceci a suscité un très grand intérêt pour comprendre les variations naturelles entre individus mais aussi pour mieux appréhender la prédisposition génétique à certaines maladies.Lors de ma thèse, j'ai développé de nouveaux outils informatiques pour l'analyse de puces à ADN dans le but de cartographier ces variations à l'échelle génomique. J'ai utilisé ces outils pour établir les variations dans la population suisse et je me suis consacré par la suite à l'étude de facteurs pouvant expliquer la prédisposition aux maladies telles que l'obésité. Cette étude en collaboration avec le Centre Hospitalier Universitaire Vaudois a permis l'identification d'une délétion sur le chromosome 16 expliquant 0.7% des cas d'obésité morbide. Cette étude a plusieurs répercussions. Tout d'abord elle permet d'effectuer le diagnostique chez les enfants à naître afin de déterminer leur prédisposition à l'obésité. Ensuite ce locus implique une vingtaine de gènes. Ceci permet de formuler de nouvelles hypothèses de travail et d'orienter la recherche afin d'améliorer notre compréhension de la maladie et l'espoir de découvrir un nouveau traitement Enfin notre étude fournit une alternative aux études d'association génétique qui n'ont eu jusqu'à présent qu'un succès mitigé.Dans la dernière partie de ma thèse, je me suis intéressé à l'analyse des aberrations en nombre de copies dans le cancer. Mon choix s'est porté sur l'étude de mélanomes, impliqués dans le cancer de la peau. Le mélanome est une tumeur très agressive, elle est responsable de 80% des décès des cancers de la peau et est souvent résistante aux traitements utilisés en oncologie (chimiothérapie, radiothérapie). Dans le cadre d'une collaboration entre l'Institut Ludwig de Recherche contre le Cancer, l'Institut Suisse de Bioinformatique, le CHUV et les universités de Lausanne et Genève, nous avons séquencés l'exome (les gènes) et le transcriptome (l'expression des gènes) de sept mélanomes métastatiques, effectués des analyses du nombre de copies par des puces à ADN et des caryotypes. Mes travaux ont permis le développement de nouvelles méthodes d'analyses adaptées au cancer, d'établir la liste des réseaux de signalisation cellulaire affectés de façon récurrente chez le mélanome et d'identifier deux cibles thérapeutiques potentielles jusqu'alors ignorées dans les cancers de la peau.
Resumo:
O objetivo deste trabalho foi validar a associação de marcadores moleculares do tipo "single nucleotide polymorphism" (SNP) para os genes FAD3A, FAD3B e FAD3C com o conteúdo de ácido linolênico (18:3) em sementes de soja e analisar a influência dos parâmetros genéticos destes marcadores nesta característica. Foram genotipadas 185 progênies F2 derivadas do cruzamento entre A29 (mutante para os três genes FAD3, 1% de 18:3) e Tucunaré (genótipo selvagem, 11% de 18:3). Os marcadores moleculares para os genes FAD3A, FAD3B e FAD3C explicaram a variação do conteúdo de 18:3 nas populações segregantes F2 e F2:3. Além disso, as substituições alélicas no loco FAD3A proporcionam maiores variações no conteúdo de 18:3 que as substituições nos outros dois locos.