969 resultados para SNP microarray
Resumo:
There are many known examples of multiple semi-independent associations at individual loci; such associations might arise either because of true allelic heterogeneity or because of imperfect tagging of an unobserved causal variant. This phenomenon is of great importance in monogenic traits but has not yet been systematically investigated and quantified in complex-trait genome-wide association studies (GWASs). Here, we describe a multi-SNP association method that estimates the effect of loci harboring multiple association signals by using GWAS summary statistics. Applying the method to a large anthropometric GWAS meta-analysis (from the Genetic Investigation of Anthropometric Traits consortium study), we show that for height, body mass index (BMI), and waist-to-hip ratio (WHR), 3%, 2%, and 1%, respectively, of additional phenotypic variance can be explained on top of the previously reported 10% (height), 1.5% (BMI), and 1% (WHR). The method also permitted a substantial increase (by up to 50%) in the number of loci that replicate in a discovery-validation design. Specifically, we identified 74 loci at which the multi-SNP, a linear combination of SNPs, explains significantly more variance than does the best individual SNP. A detailed analysis of multi-SNPs shows that most of the additional variability explained is derived from SNPs that are not in linkage disequilibrium with the lead SNP, suggesting a major contribution of allelic heterogeneity to the missing heritability.
Resumo:
The relationship between inflammation and cancer is well established in several tumor types, including bladder cancer. We performed an association study between 886 inflammatory-gene variants and bladder cancer risk in 1,047 cases and 988 controls from the Spanish Bladder Cancer (SBC)/EPICURO Study. A preliminary exploration with the widely used univariate logistic regression approach did not identify any significant SNP after correcting for multiple testing. We further applied two more comprehensive methods to capture the complexity of bladder cancer genetic susceptibility: Bayesian Threshold LASSO (BTL), a regularized regression method, and AUC-Random Forest, a machine-learning algorithm. Both approaches explore the joint effect of markers. BTL analysis identified a signature of 37 SNPs in 34 genes showing an association with bladder cancer. AUC-RF detected an optimal predictive subset of 56 SNPs. 13 SNPs were identified by both methods in the total population. Using resources from the Texas Bladder Cancer study we were able to replicate 30% of the SNPs assessed. The associations between inflammatory SNPs and bladder cancer were reexamined among non-smokers to eliminate the effect of tobacco, one of the strongest and most prevalent environmental risk factor for this tumor. A 9 SNP-signature was detected by BTL. Here we report, for the first time, a set of SNP in inflammatory genes jointly associated with bladder cancer risk. These results highlight the importance of the complex structure of genetic susceptibility associated with cancer risk.
Resumo:
O objetivo deste trabalho foi validar a associação de marcadores moleculares do tipo "single nucleotide polymorphism" (SNP) para os genes FAD3A, FAD3B e FAD3C com o conteúdo de ácido linolênico (18:3) em sementes de soja e analisar a influência dos parâmetros genéticos destes marcadores nesta característica. Foram genotipadas 185 progênies F2 derivadas do cruzamento entre A29 (mutante para os três genes FAD3, 1% de 18:3) e Tucunaré (genótipo selvagem, 11% de 18:3). Os marcadores moleculares para os genes FAD3A, FAD3B e FAD3C explicaram a variação do conteúdo de 18:3 nas populações segregantes F2 e F2:3. Além disso, as substituições alélicas no loco FAD3A proporcionam maiores variações no conteúdo de 18:3 que as substituições nos outros dois locos.
Resumo:
Congenital heart defect (CHD) occurs in 40% of Down syndrome (DS) cases. While carrying three copies of chromosome 21 increases the risk for CHD, trisomy 21 itself is not sufficient to cause CHD. Thus, additional genetic variation and/or environmental factors could contribute to the CHD risk. Here we report genomic variations that in concert with trisomy 21, determine the risk for CHD in DS. This case-control GWAS includes 187 DS with CHD (AVSD = 69, ASD = 53, VSD = 65) as cases, and 151 DS without CHD as controls. Chromosome 21-specific association studies revealed rs2832616 and rs1943950 as CHD risk alleles (adjusted genotypic P-values <0.05). These signals were confirmed in a replication cohort of 92 DS-CHD cases and 80 DS-without CHD (nominal P-value 0.0022). Furthermore, CNV analyses using a customized chromosome 21 aCGH of 135K probes in 55 DS-AVSD and 53 DS-without CHD revealed three CNV regions associated with AVSD risk (FDR ≤ 0.05). Two of these regions that are located within the previously identified CHD region on chromosome 21 were further confirmed in a replication study of 49 DS-AVSD and 45 DS- without CHD (FDR ≤ 0.05). One of these CNVs maps near the RIPK4 gene, and the second includes the ZBTB21 (previously ZNF295) gene, highlighting the potential role of these genes in the pathogenesis of CHD in DS. We propose that the genetic architecture of the CHD risk of DS is complex and includes trisomy 21, and SNP and CNV variations in chromosome 21. In addition, a yet-unidentified genetic variation in the rest of the genome may contribute to this complex genetic architecture.
Resumo:
La industria de la producción de camarón es una de las industrias acuícolas que se encuentra en más crecimiento en la actualidad. Los estudios para encontrar marcadores genéticos son muy efectivos para la mejora de sus propiedades y de gran interés para los productores de camarón. En este trabajo se utilizaron seis individuos de una población de Litopenaeus vannamei, donde se encontraron cuatro polimorfismos de nucleótido único (SNPs) en el gen 5HT1R (5-hidroxitriptamina receptor1) y un SNP en el gen STAT (transductor de señal y activador de la transcripción). Sin embargo, el polimorfismo en el gen STAT resultó ser homocigoto en una población diferente utilizada para análisis de asociación. Los presentes análisis revelaron que el alelo C, en dos polimorfismos SNP (C109T y C395G) del gen 5HT1R, tiende a estar asociado con el aumento del peso corporal. Consideramos que hay necesidad de hacer nuevos estudios utilizando una muestra más amplia y diversa de la población en cuestión.
Resumo:
Background: Recent studies in pigs have detected copy number variants (CNVs) using the Comparative Genomic Hybridization technique in arrays designed to cover specific porcine chromosomes. The goal of this study was to identify CNV regions (CNVRs) in swine species based on whole genome SNP genotyping chips. Results: We used predictions from three different programs (cnvPartition, PennCNV and GADA) to analyze data from the Porcine SNP60 BeadChip. A total of 49 CNVRs were identified in 55 animals from an Iberian x Landrace cross (IBMAP) according to three criteria: detected in at least two animals, contained three or more consecutive SNPs and recalled by at least two programs. Mendelian inheritance of CNVRs was confirmed in animals belonging to several generations of the IBMAP cross. Subsequently, a segregation analysis of these CNVRs was performed in 372 additional animals from the IBMAP cross and its distribution was studied in 133 unrelated pig samples from different geographical origins. Five out of seven analyzed CNVRs were validated by real time quantitative PCR, some of which coincide with well known examples of CNVs conserved across mammalian species. Conclusions: Our results illustrate the usefulness of Porcine SNP60 BeadChip to detect CNVRs and show that structural variants can not be neglected when studying the genetic variability in this species.
Resumo:
In this work, we propose a copula-based method to generate synthetic gene expression data that account for marginal and joint probability distributions features captured from real data. Our method allows us to implant significant genes in the synthetic dataset in a controlled manner, giving the possibility of testing new detection algorithms under more realistic environments.
Resumo:
Background: We use an approach based on Factor Analysis to analyze datasets generated for transcriptional profiling. The method groups samples into biologically relevant categories, and enables the identification of genes and pathways most significantly associated to each phenotypic group, while allowing for the participation of a given gene in more than one cluster. Genes assigned to each cluster are used for the detection of pathways predominantly activated in that cluster by finding statistically significant associated GO terms. We tested the approach with a published dataset of microarray experiments in yeast. Upon validation with the yeast dataset, we applied the technique to a prostate cancer dataset. Results: Two major pathways are shown to be activated in organ-confined, non-metastatic prostate cancer: those regulated by the androgen receptor and by receptor tyrosine kinases. A number of gene markers (HER3, IQGAP2 and POR1) highlighted by the software and related to the later pathway have been validated experimentally a posteriori on independent samples. Conclusion: Using a new microarray analysis tool followed by a posteriori experimental validation of the results, we have confirmed several putative markers of malignancy associated with peptide growth factor signalling in prostate cancer and revealed others, most notably ERRB3 (HER3). Our study suggest that, in primary prostate cancer, HER3, together or not with HER4, rather than in receptor complexes involving HER2, could play an important role in the biology of these tumors. These results provide new evidence for the role of receptor tyrosine kinases in the establishment and progression of prostate cancer.
Resumo:
Integrating single nucleotide polymorphism (SNP) p-values from genome-wide association studies (GWAS) across genes and pathways is a strategy to improve statistical power and gain biological insight. Here, we present Pascal (Pathway scoring algorithm), a powerful tool for computing gene and pathway scores from SNP-phenotype association summary statistics. For gene score computation, we implemented analytic and efficient numerical solutions to calculate test statistics. We examined in particular the sum and the maximum of chi-squared statistics, which measure the strongest and the average association signals per gene, respectively. For pathway scoring, we use a modified Fisher method, which offers not only significant power improvement over more traditional enrichment strategies, but also eliminates the problem of arbitrary threshold selection inherent in any binary membership based pathway enrichment approach. We demonstrate the marked increase in power by analyzing summary statistics from dozens of large meta-studies for various traits. Our extensive testing indicates that our method not only excels in rigorous type I error control, but also results in more biologically meaningful discoveries.
Resumo:
BackgroundBipolar disorder is a highly heritable polygenic disorder. Recent enrichment analyses suggest that there may be true risk variants for bipolar disorder in the expression quantitative trait loci (eQTL) in the brain.AimsWe sought to assess the impact of eQTL variants on bipolar disorder risk by combining data from both bipolar disorder genome-wide association studies (GWAS) and brain eQTL.MethodTo detect single nucleotide polymorphisms (SNPs) that influence expression levels of genes associated with bipolar disorder, we jointly analysed data from a bipolar disorder GWAS (7481 cases and 9250 controls) and a genome-wide brain (cortical) eQTL (193 healthy controls) using a Bayesian statistical method, with independent follow-up replications. The identified risk SNP was then further tested for association with hippocampal volume (n = 5775) and cognitive performance (n = 342) among healthy individuals.ResultsIntegrative analysis revealed a significant association between a brain eQTL rs6088662 on chromosome 20q11.22 and bipolar disorder (log Bayes factor = 5.48; bipolar disorder P = 5.85×10(-5)). Follow-up studies across multiple independent samples confirmed the association of the risk SNP (rs6088662) with gene expression and bipolar disorder susceptibility (P = 3.54×10(-8)). Further exploratory analysis revealed that rs6088662 is also associated with hippocampal volume and cognitive performance in healthy individuals.ConclusionsOur findings suggest that 20q11.22 is likely a risk region for bipolar disorder; they also highlight the informative value of integrating functional annotation of genetic variants for gene expression in advancing our understanding of the biological basis underlying complex disorders, such as bipolar disorder.
Resumo:
Currently, numerous high-throughput technologies are available for the study of human carcinomas. In literature, many variations of these techniques have been described. The common denominator for these methodologies is the high amount of data obtained in a single experiment, in a short time period, and at a fairly low cost. However, these methods have also been described with several problems and limitations. The purpose of this study was to test the applicability of two selected high-throughput methods, cDNA and tissue microarrays (TMA), in cancer research. Two common human malignancies, breast and colorectal cancer, were used as examples. This thesis aims to present some practical considerations that need to be addressed when applying these techniques. cDNA microarrays were applied to screen aberrant gene expression in breast and colon cancers. Immunohistochemistry was used to validate the results and to evaluate the association of selected novel tumour markers with the outcome of the patients. The type of histological material used in immunohistochemistry was evaluated especially considering the applicability of whole tissue sections and different types of TMAs. Special attention was put on the methodological details in the cDNA microarray and TMA experiments. In conclusion, many potential tumour markers were identified in the cDNA microarray analyses. Immunohistochemistry could be applied to validate the observed gene expression changes of selected markers and to associate their expression change with patient outcome. In the current experiments, both TMAs and whole tissue sections could be used for this purpose. This study showed for the first time that securin and p120 catenin protein expression predict breast cancer outcome and the immunopositivity of carbonic anhydrase IX associates with the outcome of rectal cancer. The predictive value of these proteins was statistically evident also in multivariate analyses with up to a 13.1- fold risk for cancer specific death in a specific subgroup of patients.
Resumo:
High-throughput screening of cellular effects of RNA interference (RNAi) libraries is now being increasingly applied to explore the role of genes in specific cell biological processes and disease states. However, the technology is still limited to specialty laboratories, due to the requirements for robotic infrastructure, access to expensive reagent libraries, expertise in high-throughput screening assay development, standardization, data analysis and applications. In the future, alternative screening platforms will be required to expand functional large-scale experiments to include more RNAi constructs, allow combinatorial loss-of-function analyses (e.g. genegene or gene-drug interaction), gain-of-function screens, multi-parametric phenotypic readouts or comparative analysis of many different cell types. Such comprehensive perturbation of gene networks in cells will require a major increase in the flexibility of the screening platforms, throughput and reduction of costs. As an alternative for the conventional multi-well based high-throughput screening -platforms, here the development of a novel cell spot microarray method for production of high density siRNA reverse transfection arrays is described. The cell spot microarray platform is distinguished from the majority of other transfection cell microarray techniques by the spatially confined array layout that allow highly parallel screening of large-scale RNAi reagent libraries with assays otherwise difficult or not applicable to high-throughput screening. This study depicts the development of the cell spot microarray method along with biological application examples of high-content immunofluorescence and phenotype based cancer cell biological analyses focusing on the regulation of prostate cancer cell growth, maintenance of genomic integrity in breast cancer cells, and functional analysis of integrin protein-protein interactions in situ.
Resumo:
Resumo:O colapso induzido pelo exercício (EIC) é considerado uma síndrome autossômica recessiva que afeta principalmente cães da raça Labrador Retriever. A doença é caracterizada por fraqueza muscular e colapso após exercício intenso. Usualmente, ocorre recuperação clínica após o episódio, mas alguns animais podem vir a óbito. Os sinais clínicos são decorrentes do polimorfismo de base única (SNP) c.767G>T no gene Dynamin 1 (DNM1). O objetivo deste trabalho foi determinar a ocorrência deste SNP em 321 cães da raça Labrador Retriever do Estado de São Paulo. Primers específicos para a amplificação de todo o exon 6 do gene DNM1 foram usados nas PCRs utilizando DNA a partir de amostras de sangue ou swab bucal, a avaliação final foi realizada com sequenciamento direto dos produtos da PCR. Dentre os 321 animais estudados, 3,4 % (11/321) eram homozigotos para o SNP c.767G>T no gene DNM1 e 24,6% (79/321) eram heterozigotos. Somente um dos 11 animais homozigotos apresentavam sinais clínicos compatíveis com a EIC. Este é o primeiro estudo sobre a ocorrência deste SNP no Brasil e considerando que quase 25% dos animais estudados eram heterozigotos, a genotipagem dos animais para este SNP pode ser importante antes dos acasalamentos para cães desta raça. A EIC deve ser considerada nos diagnósticos diferenciais de enfermidades neuromusculares em cães da raça Labrador Retriever.
Resumo:
Abstract: Dermatosparaxis is an autosomal recessive disorder of connective tissue; the disorder is clinically characterized by skin fragility and hyperextensibility. Dermatosparaxis in White Dorper sheep is caused by a single nucleotide polymorphism (SNP) (c.421G>T) in the ADAM metalloproteinase with thrombospondin type 1 motif, 2 (ADAMTS2) gene. The aim of this study was to investigate the prevalence of this SNP in a White Dorper herd in São Paulo state, Brazil. In this study, we collected blood DNA samples from 303 White Dorper sheep and performed polymerase chain reaction to amplify the SNP region. The samples were sequenced to determine the presence of the SNP in the ADAMTS2 gene. The SNP prevalence in the studied population was 15.5%; this finding indicates that more effective control measures should be used to prevent the inheritance of SNP c.421G>T in the ADAMTS2 gene in Brazilian White Dorper herds.