965 resultados para SNP arrays
Resumo:
This thesis presents a highly sensitive genome wide search method for recessive mutations. The method is suitable for distantly related samples that are divided into phenotype positives and negatives. High throughput genotype arrays are used to identify and compare homozygous regions between the cohorts. The method is demonstrated by comparing colorectal cancer patients against unaffected references. The objective is to find homozygous regions and alleles that are more common in cancer patients. We have designed and implemented software tools to automate the data analysis from genotypes to lists of candidate genes and to their properties. The programs have been designed in respect to a pipeline architecture that allows their integration to other programs such as biological databases and copy number analysis tools. The integration of the tools is crucial as the genome wide analysis of the cohort differences produces many candidate regions not related to the studied phenotype. CohortComparator is a genotype comparison tool that detects homozygous regions and compares their loci and allele constitutions between two sets of samples. The data is visualised in chromosome specific graphs illustrating the homozygous regions and alleles of each sample. The genomic regions that may harbour recessive mutations are emphasised with different colours and a scoring scheme is given for these regions. The detection of homozygous regions, cohort comparisons and result annotations are all subjected to presumptions many of which have been parameterized in our programs. The effect of these parameters and the suitable scope of the methods have been evaluated. Samples with different resolutions can be balanced with the genotype estimates of their haplotypes and they can be used within the same study.
Resumo:
Submicroscopic changes in chromosomal DNA copy number dosage are common and have been implicated in many heritable diseases and cancers. Recent high-throughput technologies have a resolution that permits the detection of segmental changes in DNA copy number that span thousands of basepairs across the genome. Genome-wide association studies (GWAS) may simultaneously screen for copy number-phenotype and SNP-phenotype associations as part of the analytic strategy. However, genome-wide array analyses are particularly susceptible to batch effects as the logistics of preparing DNA and processing thousands of arrays often involves multiple laboratories and technicians, or changes over calendar time to the reagents and laboratory equipment. Failure to adjust for batch effects can lead to incorrect inference and requires inefficient post-hoc quality control procedures that exclude regions that are associated with batch. Our work extends previous model-based approaches for copy number estimation by explicitly modeling batch effects and using shrinkage to improve locus-specific estimates of copy number uncertainty. Key features of this approach include the use of diallelic genotype calls from experimental data to estimate batch- and locus-specific parameters of background and signal without the requirement of training data. We illustrate these ideas using a study of bipolar disease and a study of chromosome 21 trisomy. The former has batch effects that dominate much of the observed variation in quantile-normalized intensities, while the latter illustrates the robustness of our approach to datasets where as many as 25% of the samples have altered copy number. Locus-specific estimates of copy number can be plotted on the copy-number scale to investigate mosaicism and guide the choice of appropriate downstream approaches for smoothing the copy number as a function of physical position. The software is open source and implemented in the R package CRLMM available at Bioconductor (http:www.bioconductor.org).
Resumo:
Amplifications and deletions of chromosomal DNA, as well as copy-neutral loss of heterozygosity have been associated with diseases processes. High-throughput single nucleotide polymorphism (SNP) arrays are useful for making genome-wide estimates of copy number and genotype calls. Because neighboring SNPs in high throughput SNP arrays are likely to have dependent copy number and genotype due to the underlying haplotype structure and linkage disequilibrium, hidden Markov models (HMM) may be useful for improving genotype calls and copy number estimates that do not incorporate information from nearby SNPs. We improve previous approaches that utilize a HMM framework for inference in high throughput SNP arrays by integrating copy number, genotype calls, and the corresponding confidence scores when available. Using simulated data, we demonstrate how confidence scores control smoothing in a probabilistic framework. Software for fitting HMMs to SNP array data is available in the R package ICE.
Resumo:
High density SNP arrays can be used to identify DNA copy number changes in tumors such as homozygous deletions of tumor suppressor genes and focal amplifications of oncogenes. Illumina Human CNV370 Bead chip arrays were used to assess the genome for unbalanced chromosomal events occurring in 39 cell lines derived from stage III metastatic melanomas. A number of genes previously recognized to have an important role in the development and progression of melanoma were identified including homozygous deletions of CDKN2A (13 of 39 samples), CDKN2B (10 of 39), PTEN (3 of 39), PTPRD (3 of 39), TP53 (1 of 39), and amplifications of CCND1 (2 of 39), MITF (2 of 39), MDM2 (1 of 39), and NRAS (1 of 39). In addition, a number of focal homozygous deletions potentially targeting novel melanoma tumor suppressor genes were identified. Because of their likely functional significance for melanoma progression, FAS, CH25H, BMPR1A, ACTA2, and TFG were investigated in a larger cohort of melanomas through sequencing. Nonsynonymous mutations were identified in BMPR1A (1 of 43), ACTA2 (3 of 43), and TFG (5 of 103). A number of potentially important mutation events occurred in TFG including the identification of a mini mutation ‘‘hotspot’’ at amino acid residue 380 (P380S and P380L) and the presence of multiple mutations in two melanomas. Mutations in TFG may have important clinical relevance for current therapeutic strategies to treat metastatic melanoma.
Resumo:
Background: Sorghum genome mapping based on DNA markers began in the early 1990s and numerous genetic linkage maps of sorghum have been published in the last decade, based initially on RFLP markers with more recent maps including AFLPs and SSRs and very recently, Diversity Array Technology (DArT) markers. It is essential to integrate the rapidly growing body of genetic linkage data produced through DArT with the multiple genetic linkage maps for sorghum generated through other marker technologies. Here, we report on the colinearity of six independent sorghum component maps and on the integration of these component maps into a single reference resource that contains commonly utilized SSRs, AFLPs, and high-throughput DArT markers. Results: The six component maps were constructed using the MultiPoint software. The lengths of the resulting maps varied between 910 and 1528 cM. The order of the 498 markers that segregated in more than one population was highly consistent between the six individual mapping data sets. The framework consensus map was constructed using a "Neighbours" approach and contained 251 integrated bridge markers on the 10 sorghum chromosomes spanning 1355.4 cM with an average density of one marker every 5.4 cM, and were used for the projection of the remaining markers. In total, the sorghum consensus map consisted of a total of 1997 markers mapped to 2029 unique loci ( 1190 DArT loci and 839 other loci) spanning 1603.5 cM and with an average marker density of 1 marker/0.79 cM. In addition, 35 multicopy markers were identified. On average, each chromosome on the consensus map contained 203 markers of which 58.6% were DArT markers. Non-random patterns of DNA marker distribution were observed, with some clear marker-dense regions and some marker-rare regions. Conclusion: The final consensus map has allowed us to map a larger number of markers than possible in any individual map, to obtain a more complete coverage of the sorghum genome and to fill a number of gaps on individual maps. In addition to overall general consistency of marker order across individual component maps, good agreement in overall distances between common marker pairs across the component maps used in this study was determined, using a difference ratio calculation. The obtained consensus map can be used as a reference resource for genetic studies in different genetic backgrounds, in addition to providing a framework for transferring genetic information between different marker technologies and for integrating DArT markers with other genomic resources. DArT markers represent an affordable, high throughput marker system with great utility in molecular breeding programs, especially in crops such as sorghum where SNP arrays are not publicly available.
Resumo:
Chromosomal alterations in leukemia have been shown to have prognostic and predictive significance and are also important minimal residual disease (MRD) markers in the follow-up of leukemia patients. Although specific oncogenes and tumor suppressors have been discovered in some of the chromosomal alterations, the role and target genes of many alterations in leukemia remain unknown. In addition, a number of leukemia patients have a normal karyotype by standard cytogenetics, but have variability in clinical course and are often molecularly heterogeneous. Cytogenetic methods traditionally used in leukemia analysis and diagnostics; G-banding, various fluorescence in situ hybridization (FISH) techniques, and chromosomal comparative genomic hybridization (cCGH), have enormously increased knowledge about the leukemia genome, but have limitations in resolution or in genomic coverage. In the last decade, the development of microarray comparative genomic hybridization (array-CGH, aCGH) for DNA copy number analysis and the SNP microarray (SNP-array) method for simultaneous copy number and loss of heterozygosity (LOH) analysis has enabled investigation of chromosomal and gene alterations genome-wide with high resolution and high throughput. In these studies, genetic alterations were analyzed in acute myeloid leukemia (AML) and chronic lymphocytic leukemia (CLL). The aim was to screen and characterize genomic alterations that could play role in leukemia pathogenesis by using aCGH and SNP-arrays. One of the most important goals was to screen cryptic alterations in karyotypically normal leukemia patients. In addition, chromosomal changes were evaluated to narrow the target regions, to find new markers, and to obtain tumor suppressor and oncogene candidates. The work presented here shows the capability of aCGH to detect submicroscopic copy number alterations in leukemia, with information about breakpoints and genes involved in the alterations, and that genome-wide microarray analyses with aCGH and SNP-array are advantageous methods in the research and diagnosis of leukemia. The most important findings were the cryptic changes detected with aCGH in karyotypically normal AML and CLL, characterization of amplified genes in 11q marker chromosomes, detection of deletion-based mechanisms of MLL-ARHGEF12 fusion gene formation, and detection of LOH without copy number alteration in karyotypically normal AML. These alterations harbor candidate oncogenes and tumor suppressors for further studies.
Resumo:
Marginal zone B-cell lymphomas (MZLs) have been divided into 3 distinct subtypes (extranodal MZLs of mucosa-associated lymphoid tissue [MALT] type, nodal MZLs, and splenic MZLs). Nevertheless, the relationship between the subtypes is still unclear. We performed a comprehensive analysis of genomic DNA copy number changes in a very large series of MZL cases with the aim of addressing this question. Samples from 218 MZL patients (25 nodal, 57 MALT, 134 splenic, and 2 not better specified MZLs) were analyzed with the Affymetrix Human Mapping 250K SNP arrays, and the data combined with matched gene expression in 33 of 218 cases. MALT lymphoma presented significantly more frequently gains at 3p, 6p, 18p, and del(6q23) (TNFAIP3/A20), whereas splenic MZLs was associated with del(7q31), del(8p). Nodal MZLs did not show statistically significant differences compared with MALT lymphoma while lacking the splenic MZLs-related 7q losses. Gains of 3q and 18q were common to all 3 subtypes. del(8p) was often present together with del(17p) (TP53). Although del(17p) did not determine a worse outcome and del(8p) was only of borderline significance, the presence of both deletions had a highly significant negative impact on the outcome of splenic MZLs.
Resumo:
Le caryotype moléculaire permet d’identifier un CNV chez 10-14% des individus atteints de déficience intellectuelle et/ou de malformations congénitales. C’est pourquoi il s’agit maintenant de l’analyse de première intention chez ces patients. Toutefois, le rendement diagnostique n’est pas aussi bien défini en contexte prénatal et l’identification de CNVs de signification clinique incertaine y est particulièrement problématique à cause du risque d’interruption de grossesse. Nous avons donc testé 49 fœtus avec malformations majeures et un caryotype conventionnel normal avec une micropuce CGH pangénomique, et obtenu un diagnostic dans 8,2% des cas. Par ailleurs, des micropuces à très haute résolution combinant le caryotype moléculaire et le génotypage de SNPs ont récemment été introduites sur le marché. En plus d’identifier les CNVs, ces plateformes détectent les LOHs, qui peuvent indiquer la présence d’une mutation homozygote ou de disomie uniparentale. Ces anomalies pouvant être associées à la déficience intellectuelle ou à des malformations, leur détection est particulièrement intéressante pour les patients dont le phénotype reste inexpliqué. Cependant, le rendement diagnostique de ces plateformes n’est pas confirmé, et l’utilité clinique réelle des LOHs n’est toujours pas établie. Nous avons donc testé 21 enfants atteints de déficience intellectuelle pour qui les méthodes standards d’analyse génétique n’avaient pas résulté en un diagnostic, et avons pu faire passer le rendement diagnostique de 14,3% à 28,6% grâce à l’information fournie par les LOHs. Cette étude démontre l’utilité clinique d’une micropuce CGH pangénomique chez des fœtus avec malformations, de même que celle d’une micropuce SNP chez des enfants avec déficience intellectuelle.
Resumo:
L’analyse des anomalies génomiques récurrentes est importante pour établir le diagnostic, le pronostic et pour orienter la thérapie des leucémies aiguës pédiatriques. L’objectif de notre étude est d’élaborer une stratégie optimale pour détecter les anomalies chromosomiques dans les leucémies aiguës lymphoblastiques (LAL) et myéloïdes (LAM) des enfants. Pour ce faire, nous avons caractérisé au caryotype, avec des panels d’hybridation in situ en fluorescence (FISH), par RT-PCR et par l’index d’ADN 253 leucémies de novo reçues au CHU Sainte-Justine entre 2005 et 2011 (186 LAL-B, 27 LAL-T et 40 LAM). Nous avons réussi à optimiser la détection des anomalies chromosomiques dans les trois types de leucémies, avec des fréquences de 93,5% dans les LAL-B (174/186), 66,7% dans les LAL-T (18/27) et 90% dans les LAM (36/40). Nos résultats suggèrent d’utiliser plusieurs tests génétiques concomitants afin d’optimiser la détection des anomalies génomiques dans les LAL et les LAM de novo pédiatriques.
Resumo:
Phenotypically discordant monozygotic twins offer the possibility of gene discovery through delineation of molecular abnormalities in one member of the twin pair. One proposed mechanism of discordance is postzygotically occurring genomic alterations resulting from mitotic recombination and other somatic changes. Detection of altered genomic fragments can reveal candidate gene loci that can be verified through additional analyses. We investigated this hypothesis using array comparative genomic hybridization; the 50K and 250K Affymetrix GeneChip (R) SNP arrays and an Illumina custom array consisting of 1,536 SNPs, to scan for genomic alterations in a sample of monozygotic twin pairs with discordant cleft lip and/or palate phenotypes. Paired analysis for deletions, amplifications and loss of heterozygosity, along with sequence verification of SNPs with discordant genotype calls did not reveal any genomic discordance between twin pairs in lymphocyte DNA samples. Our results demonstrate that postzygotic genomic alterations are not a common cause of monozygotic twin discordance for isolated cleft lip and/or palate. However, rare or balanced genomic alterations, tissue-specific events and small aberrations beyond the detection level of our experimental approach cannot be ruled out. The stability of genomes we observed in our study samples also suggests that detection of discordant events in other monozygotic twin pairs would be remarkable and of potential disease significance.
Resumo:
Marginal zone B-cell lymphomas (MZLs) have been divided into 3 distinct subtypes (extranodal MZLs of mucosa-associated lymphoid tissue [MALT] type, nodal MZLs, and splenic MZLs). Nevertheless, the relationship between the subtypes is still unclear. We performed a comprehensive analysis of genomic DNA copy number changes in a very large series of MZL cases with the aim of addressing this question. Samples from 218 MZL patients (25 nodal, 57 MALT, 134 splenic, and 2 not better specified MZLs) were analyzed with the Affymetrix Human Mapping 250K SNP arrays, and the data combined with matched gene expression in 33 of 218 cases. MALT lymphoma presented significantly more frequently gains at 3p, 6p, 18p, and del(6q23) (TNFAIP3/A20), whereas splenic MZLs was associated with del(7q31), del(8p). Nodal MZLs did not show statistically significant differences compared with MALT lymphoma while lacking the splenic MZLs-related 7q losses. Gains of 3q and 18q were common to all 3 subtypes. del(8p) was often present together with del(17p) (TP53). Although del(17p) did not determine a worse outcome and del(8p) was only of borderline significance, the presence of both deletions had a highly significant negative impact on the outcome of splenic MZLs.
Resumo:
High-throughput SNP arrays provide estimates of genotypes for up to one million loci, often used in genome-wide association studies. While these estimates are typically very accurate, genotyping errors do occur, which can influence in particular the most extreme test statistics and p-values. Estimates for the genotype uncertainties are also available, although typically ignored. In this manuscript, we develop a framework to incorporate these genotype uncertainties in case-control studies for any genetic model. We verify that using the assumption of a “local alternative” in the score test is very reasonable for effect sizes typically seen in SNP association studies, and show that the power of the score test is simply a function of the correlation of the genotype probabilities with the true genotypes. We demonstrate that the power to detect a true association can be substantially increased for difficult to call genotypes, resulting in improved inference in association studies.
Resumo:
Unique and shared cytogenetic abnormalities have been documented for marginal zone lymphomas (MZLs) arising at different sites. Recently, homozygous deletions of the chromosomal band 6q23, involving the tumor necrosis factor alpha-induced protein 3 (TNFAIP3, A20) gene, a negative regulator of NF-kappaB, were described in ocular adnexal MZL, suggesting a role for A20 as a tumor suppressor in this disease. Here, we investigated inactivation of A20 by DNA mutations or deletions in a panel of extranodal MZL (EMZL), nodal MZL (NMZL), and splenic MZL (SMZL). Inactivating mutations encoding truncated A20 proteins were identified in 6 (19%) of 32 MZLs, including 2 (18%) of 11 EMZLs, 3 (33%) of 9 NMZLs, and 1 (8%) of 12 SMZLs. Two additional unmutated nonsplenic MZLs also showed monoallelic or biallelic A20 deletions by fluorescent in situ hybridization (FISH) and/or SNP-arrays. Thus, A20 inactivation by either somatic mutation and/or deletion represents a common genetic aberration across all MZL subtypes, which may contribute to lymphomagenesis by inducing constitutive NF-kappaB activation.
Resumo:
Background. The impact of human genetic background on low-trauma fracture (LTF) risk has not been evaluated in the context of human immunodeficiency virus (HIV) and clinical LTF risk factors. Methods. In the general population, 6 common single-nucleotide polymorphisms (SNPs) associate with LTF through genome-wide association study. Using genome-wide SNP arrays and imputation, we genotyped these SNPs in HIV-positive, white Swiss HIV Cohort Study participants. We included 103 individuals with a first, physician-validated LTF and 206 controls matched on gender, whose duration of observation and whose antiretroviral therapy start dates were similar using incidence density sampling. Analyses of nongenetic LTF risk factors were based on 158 cases and 788 controls. Results. A genetic risk score built from the 6 LTF-associated SNPs did not associate with LTF risk, in both models including and not including parental hip fracture history. The contribution of clinical LTF risk factors was limited in our dataset. Conclusions. Genetic LTF markers with a modest effect size in the general population do not improve fracture prediction in persons with HIV, in whom clinical LTF risk factors are prevalent in both cases and controls.
Resumo:
Background: Esophageal adenocarcinoma (EA) is one of the fastest rising cancers in western countries. Barrett’s Esophagus (BE) is the premalignant precursor of EA. However, only a subset of BE patients develop EA, which complicates the clinical management in the absence of valid predictors. Genetic risk factors for BE and EA are incompletely understood. This study aimed to identify novel genetic risk factors for BE and EA.Methods: Within an international consortium of groups involved in the genetics of BE/EA, we performed the first meta-analysis of all genome-wide association studies (GWAS) available, involving 6,167 BE patients, 4,112 EA patients, and 17,159 representative controls, all of European ancestry, genotyped on Illumina high-density SNP-arrays, collected from four separate studies within North America, Europe, and Australia. Meta-analysis was conducted using the fixed-effects inverse variance-weighting approach. We used the standard genome-wide significant threshold of 5×10-8 for this study. We also conducted an association analysis following reweighting of loci using an approach that investigates annotation enrichment among the genome-wide significant loci. The entire GWAS-data set was also analyzed using bioinformatics approaches including functional annotation databases as well as gene-based and pathway-based methods in order to identify pathophysiologically relevant cellular pathways.Findings: We identified eight new associated risk loci for BE and EA, within or near the CFTR (rs17451754, P=4·8×10-10), MSRA (rs17749155, P=5·2×10-10), BLK (rs10108511, P=2·1×10-9), KHDRBS2 (rs62423175, P=3·0×10-9), TPPP/CEP72 (rs9918259, P=3·2×10-9), TMOD1 (rs7852462, P=1·5×10-8), SATB2 (rs139606545, P=2·0×10-8), and HTR3C/ABCC5 genes (rs9823696, P=1·6×10-8). A further novel risk locus at LPA (rs12207195, posteriori probability=0·925) was identified after re-weighting using significantly enriched annotations. This study thereby doubled the number of known risk loci. The strongest disease pathways identified (P<10-6) belong to muscle cell differentiation and to mesenchyme development/differentiation, which fit with current pathophysiological BE/EA concepts. To our knowledge, this study identified for the first time an EA-specific association (rs9823696, P=1·6×10-8) near HTR3C/ABCC5 which is independent of BE development (P=0·45).Interpretation: The identified disease loci and pathways reveal new insights into the etiology of BE and EA. Furthermore, the EA-specific association at HTR3C/ABCC5 may constitute a novel genetic marker for the prediction of transition from BE to EA. Mutations in CFTR, one of the new risk loci identified in this study, cause cystic fibrosis (CF), the most common recessive disorder in Europeans. Gastroesophageal reflux (GER) belongs to the phenotypic CF-spectrum and represents the main risk factor for BE/EA. Thus, the CFTR locus may trigger a common GER-mediated pathophysiology.