372 resultados para Genome-wide Search


Relevância:

80.00% 80.00%

Publicador:

Resumo:

Genome-wide association studies (GWAS) have identified many risk loci for complex diseases, but effect sizes are typically small and information on the underlying biological processes is often lacking. Associations with metabolic traits as functional intermediates can overcome these problems and potentially inform individualized therapy. Here we report a comprehensive analysis of genotype-dependent metabolic phenotypes using a GWAS with non-targeted metabolomics. We identified 37 genetic loci associated with blood metabolite concentrations, of which 25 show effect sizes that are unusually high for GWAS and account for 10-60% differences in metabolite levels per allele copy. Our associations provide new functional insights for many disease-related associations that have been reported in previous studies, including those for cardiovascular and kidney disorders, type 2 diabetes, cancer, gout, venous thromboembolism and Crohn's disease. The study advances our knowledge of the genetic basis of metabolic individuality in humans and generates many new hypotheses for biomedical and pharmaceutical research.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Sequencing of pools of individuals (Pool-Seq) represents a reliable and cost-effective approach for estimating genome-wide SNP and transposable element insertion frequencies. However, Pool-Seq does not provide direct information on haplotypes so that, for example, obtaining inversion frequencies has not been possible until now. Here, we have developed a new set of diagnostic marker SNPs for seven cosmopolitan inversions in Drosophila melanogaster that can be used to infer inversion frequencies from Pool-Seq data. We applied our novel marker set to Pool-Seq data from an experimental evolution study and from North American and Australian latitudinal clines. In the experimental evolution data, we find evidence that positive selection has driven the frequencies of In(3R)C and In(3R)Mo to increase over time. In the clinal data, we confirm the existence of frequency clines for In(2L)t, In(3L)P and In(3R)Payne in both North America and Australia and detect a previously unknown latitudinal cline for In(3R)Mo in North America. The inversion markers developed here provide a versatile and robust tool for characterizing inversion frequencies and their dynamics in Pool-Seq data from diverse D. melanogaster populations.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Genome-wide association studies have been instrumental in identifying genetic variants associated with complex traits such as human disease or gene expression phenotypes. It has been proposed that extending existing analysis methods by considering interactions between pairs of loci may uncover additional genetic effects. However, the large number of possible two-marker tests presents significant computational and statistical challenges. Although several strategies to detect epistasis effects have been proposed and tested for specific phenotypes, so far there has been no systematic attempt to compare their performance using real data. We made use of thousands of gene expression traits from linkage and eQTL studies, to compare the performance of different strategies. We found that using information from marginal associations between markers and phenotypes to detect epistatic effects yielded a lower false discovery rate (FDR) than a strategy solely using biological annotation in yeast, whereas results from human data were inconclusive. For future studies whose aim is to discover epistatic effects, we recommend incorporating information about marginal associations between SNPs and phenotypes instead of relying solely on biological annotation. Improved methods to discover epistatic effects will result in a more complete understanding of complex genetic effects.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Most common human traits and diseases have a polygenic pattern of inheritance: DNA sequence variants at many genetic loci influence the phenotype. Genome-wide association (GWA) studies have identified more than 600 variants associated with human traits, but these typically explain small fractions of phenotypic variation, raising questions about the use of further studies. Here, using 183,727 individuals, we show that hundreds of genetic variants, in at least 180 loci, influence adult height, a highly heritable and classic polygenic trait. The large number of loci reveals patterns with important implications for genetic studies of common human diseases and traits. First, the 180 loci are not random, but instead are enriched for genes that are connected in biological pathways (P = 0.016) and that underlie skeletal growth defects (P < 0.001). Second, the likely causal gene is often located near the most strongly associated variant: in 13 of 21 loci containing a known skeletal growth gene, that gene was closest to the associated variant. Third, at least 19 loci have multiple independently associated variants, suggesting that allelic heterogeneity is a frequent feature of polygenic traits, that comprehensive explorations of already-discovered loci should discover additional variants and that an appreciable fraction of associated loci may have been identified. Fourth, associated variants are enriched for likely functional effects on genes, being over-represented among variants that alter amino-acid structure of proteins and expression levels of nearby genes. Our data explain approximately 10% of the phenotypic variation in height, and we estimate that unidentified common variants of similar effect sizes would increase this figure to approximately 16% of phenotypic variation (approximately 20% of heritable variation). Although additional approaches are needed to dissect the genetic architecture of polygenic human traits fully, our findings indicate that GWA studies can identify large numbers of loci that implicate biologically relevant genes and pathways.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Recent genome-wide association studies (GWAS) have identified genetic variations near the IL28B gene which are strongly associated with spontaneous and treatment-induced clearance of hepatitis C virus (HCV) infection. Protective IL28B variations are strongly associated with on-treatment viral kinetics and approximately 2-fold increased sustained virologic response (SVR) rates in HCV genotype 1 and 4 patients. In HCV genotype 1 patients, IL28B variations were shown to be the strongest pre-treatment predictor of virologic response. In the treatment of HCV genotype 2 and 3 infected patients, IL28B variations play only a minor role. Preliminary data indicate that IL28B variations are also associated with treatment outcome of regimens, including directly acting antiviral (DAA) agents, though their impact seems to be attenuated compared to standard treatment. Here, we review these important findings and discuss possible implications for clinical decision making in the treatment of HCV infection.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Adiponutrin (PNPLA3) is a predominantly liver-expressed transmembrane protein with phospholipase activity that is regulated by fasting and feeding. Recent genome-wide association studies identified PNPLA3 to be associated with hepatic fat content and liver function, thus pointing to a possible involvement in the hepatic lipoprotein metabolism. The aim of this study was to examine the association between two common variants in the adiponutrin gene and parameters of lipoprotein metabolism in 23,274 participants from eight independent West-Eurasian study populations including six population-based studies [Bruneck (n = 800), KORA S3/F3 (n = 1644), KORA S4/F4 (n = 1814), CoLaus (n = 5435), SHIP (n = 4012), Rotterdam (n = 5967)], the SAPHIR Study as a healthy working population (n = 1738) and the Utah Obesity Case-Control Study including a group of 1037 severely obese individuals (average BMI 46 kg/m2) and 827 controls from the same geographical region of Utah. We observed a strong additive association of a common non-synonymous variant within adiponutrin (rs738409) with age-, gender-, and alanine-aminotransferase-adjusted lipoprotein concentrations: each copy of the minor allele decreased levels of total cholesterol on average by 2.43 mg/dl (P = 8.87 x 10(-7)), non-HDL cholesterol levels by 2.35 mg/dl (P = 2.27 x 10(-6)) and LDL cholesterol levels by 1.48 mg/dl (P = 7.99 x 10(-4)). These associations remained significant after correction for multiple testing. We did not observe clear evidence for associations with HDL cholesterol or triglyceride concentrations. In conclusion, our study suggests that adiponutrin is involved in the metabolism of apoB-containing lipoproteins.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The prevalence of hypertension in African Americans (AAs) is higher than in other US groups; yet, few have performed genome-wide association studies (GWASs) in AA. Among people of European descent, GWASs have identified genetic variants at 13 loci that are associated with blood pressure. It is unknown if these variants confer susceptibility in people of African ancestry. Here, we examined genome-wide and candidate gene associations with systolic blood pressure (SBP) and diastolic blood pressure (DBP) using the Candidate Gene Association Resource (CARe) consortium consisting of 8591 AAs. Genotypes included genome-wide single-nucleotide polymorphism (SNP) data utilizing the Affymetrix 6.0 array with imputation to 2.5 million HapMap SNPs and candidate gene SNP data utilizing a 50K cardiovascular gene-centric array (ITMAT-Broad-CARe [IBC] array). For Affymetrix data, the strongest signal for DBP was rs10474346 (P= 3.6 × 10(-8)) located near GPR98 and ARRDC3. For SBP, the strongest signal was rs2258119 in C21orf91 (P= 4.7 × 10(-8)). The top IBC association for SBP was rs2012318 (P= 6.4 × 10(-6)) near SLC25A42 and for DBP was rs2523586 (P= 1.3 × 10(-6)) near HLA-B. None of the top variants replicated in additional AA (n = 11 882) or European-American (n = 69 899) cohorts. We replicated previously reported European-American blood pressure SNPs in our AA samples (SH2B3, P= 0.009; TBX3-TBX5, P= 0.03; and CSK-ULK3, P= 0.0004). These genetic loci represent the best evidence of genetic influences on SBP and DBP in AAs to date. More broadly, this work supports that notion that blood pressure among AAs is a trait with genetic underpinnings but also with significant complexity.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Abstract : The human body is composed of a huge number of cells acting together in a concerted manner. The current understanding is that proteins perform most of the necessary activities in keeping a cell alive. The DNA, on the other hand, stores the information on how to produce the different proteins in the genome. Regulating gene transcription is the first important step that can thus affect the life of a cell, modify its functions and its responses to the environment. Regulation is a complex operation that involves specialized proteins, the transcription factors. Transcription factors (TFs) can bind to DNA and activate the processes leading to the expression of genes into new proteins. Errors in this process may lead to diseases. In particular, some transcription factors have been associated with a lethal pathological state, commonly known as cancer, associated with uncontrolled cellular proliferation, invasiveness of healthy tissues and abnormal responses to stimuli. Understanding cancer-related regulatory programs is a difficult task, often involving several TFs interacting together and influencing each other's activity. This Thesis presents new computational methodologies to study gene regulation. In addition we present applications of our methods to the understanding of cancer-related regulatory programs. The understanding of transcriptional regulation is a major challenge. We address this difficult question combining computational approaches with large collections of heterogeneous experimental data. In detail, we design signal processing tools to recover transcription factors binding sites on the DNA from genome-wide surveys like chromatin immunoprecipitation assays on tiling arrays (ChIP-chip). We then use the localization about the binding of TFs to explain expression levels of regulated genes. In this way we identify a regulatory synergy between two TFs, the oncogene C-MYC and SP1. C-MYC and SP1 bind preferentially at promoters and when SP1 binds next to C-NIYC on the DNA, the nearby gene is strongly expressed. The association between the two TFs at promoters is reflected by the binding sites conservation across mammals, by the permissive underlying chromatin states 'it represents an important control mechanism involved in cellular proliferation, thereby involved in cancer. Secondly, we identify the characteristics of TF estrogen receptor alpha (hERa) target genes and we study the influence of hERa in regulating transcription. hERa, upon hormone estrogen signaling, binds to DNA to regulate transcription of its targets in concert with its co-factors. To overcome the scarce experimental data about the binding sites of other TFs that may interact with hERa, we conduct in silico analysis of the sequences underlying the ChIP sites using the collection of position weight matrices (PWMs) of hERa partners, TFs FOXA1 and SP1. We combine ChIP-chip and ChIP-paired-end-diTags (ChIP-pet) data about hERa binding on DNA with the sequence information to explain gene expression levels in a large collection of cancer tissue samples and also on studies about the response of cells to estrogen. We confirm that hERa binding sites are distributed anywhere on the genome. However, we distinguish between binding sites near promoters and binding sites along the transcripts. The first group shows weak binding of hERa and high occurrence of SP1 motifs, in particular near estrogen responsive genes. The second group shows strong binding of hERa and significant correlation between the number of binding sites along a gene and the strength of gene induction in presence of estrogen. Some binding sites of the second group also show presence of FOXA1, but the role of this TF still needs to be investigated. Different mechanisms have been proposed to explain hERa-mediated induction of gene expression. Our work supports the model of hERa activating gene expression from distal binding sites by interacting with promoter bound TFs, like SP1. hERa has been associated with survival rates of breast cancer patients, though explanatory models are still incomplete: this result is important to better understand how hERa can control gene expression. Thirdly, we address the difficult question of regulatory network inference. We tackle this problem analyzing time-series of biological measurements such as quantification of mRNA levels or protein concentrations. Our approach uses the well-established penalized linear regression models where we impose sparseness on the connectivity of the regulatory network. We extend this method enforcing the coherence of the regulatory dependencies: a TF must coherently behave as an activator, or a repressor on all its targets. This requirement is implemented as constraints on the signs of the regressed coefficients in the penalized linear regression model. Our approach is better at reconstructing meaningful biological networks than previous methods based on penalized regression. The method is tested on the DREAM2 challenge of reconstructing a five-genes/TFs regulatory network obtaining the best performance in the "undirected signed excitatory" category. Thus, these bioinformatics methods, which are reliable, interpretable and fast enough to cover large biological dataset, have enabled us to better understand gene regulation in humans.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

We conducted a genome-wide association study for androgenic alopecia in 1,125 men and identified a newly associated locus at chromosome 20p11.22, confirmed in three independent cohorts (n = 1,650; OR = 1.60, P = 1.1 x 10(-14) for rs1160312). The one man in seven who harbors risk alleles at both 20p11.22 and AR (encoding the androgen receptor) has a sevenfold-increased odds of androgenic alopecia (OR = 7.12, P = 3.7 x 10(-15)).

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The genomic loci occupied by RNA polymerase (RNAP) III have been characterized in human culture cells by genome-wide chromatin immunoprecipitations, followed by deep sequencing (ChIP-seq). These studies have shown that only ∼40% of the annotated 622 human tRNA genes and pseudogenes are occupied by RNAP-III, and that these genes are often in open chromatin regions rich in active RNAP-II transcription units. We have used ChIP-seq to characterize RNAP-III-occupied loci in a differentiated tissue, the mouse liver. Our studies define the mouse liver RNAP-III-occupied loci including a conserved mammalian interspersed repeat (MIR) as a potential regulator of an RNAP-III subunit-encoding gene. They reveal that synteny relationships can be established between a number of human and mouse RNAP-III genes, and that the expression levels of these genes are significantly linked. They establish that variations within the A and B promoter boxes, as well as the strength of the terminator sequence, can strongly affect RNAP-III occupancy of tRNA genes. They reveal correlations with various genomic features that explain the observed variation of 81% of tRNA scores. In mouse liver, loci represented in the NCBI37/mm9 genome assembly that are clearly occupied by RNAP-III comprise 50 Rn5s (5S RNA) genes, 14 known non-tRNA RNAP-III genes, nine Rn4.5s (4.5S RNA) genes, and 29 SINEs. Moreover, out of the 433 annotated tRNA genes, half are occupied by RNAP-III. Transfer RNA gene expression levels reflect both an underlying genomic organization conserved in dividing human culture cells and resting mouse liver cells, and the particular promoter and terminator strengths of individual genes.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Huntington's disease (HD) pathology is well understood at a histological level but a comprehensive molecular analysis of the effect of the disease in the human brain has not previously been available. To elucidate the molecular phenotype of HD on a genome-wide scale, we compared mRNA profiles from 44 human HD brains with those from 36 unaffected controls using microarray analysis. Four brain regions were analyzed: caudate nucleus, cerebellum, prefrontal association cortex [Brodmann's area 9 (BA9)] and motor cortex [Brodmann's area 4 (BA4)]. The greatest number and magnitude of differentially expressed mRNAs were detected in the caudate nucleus, followed by motor cortex, then cerebellum. Thus, the molecular phenotype of HD generally parallels established neuropathology. Surprisingly, no mRNA changes were detected in prefrontal association cortex, thereby revealing subtleties of pathology not previously disclosed by histological methods. To establish that the observed changes were not simply the result of cell loss, we examined mRNA levels in laser-capture microdissected neurons from Grade 1 HD caudate compared to control. These analyses confirmed changes in expression seen in tissue homogenates; we thus conclude that mRNA changes are not attributable to cell loss alone. These data from bona fide HD brains comprise an important reference for hypotheses related to HD and other neurodegenerative diseases.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

BACKGROUND: Genotypes obtained with commercial SNP arrays have been extensively used in many large case-control or population-based cohorts for SNP-based genome-wide association studies for a multitude of traits. Yet, these genotypes capture only a small fraction of the variance of the studied traits. Genomic structural variants (GSV) such as Copy Number Variation (CNV) may account for part of the missing heritability, but their comprehensive detection requires either next-generation arrays or sequencing. Sophisticated algorithms that infer CNVs by combining the intensities from SNP-probes for the two alleles can already be used to extract a partial view of such GSV from existing data sets. RESULTS: Here we present several advances to facilitate the latter approach. First, we introduce a novel CNV detection method based on a Gaussian Mixture Model. Second, we propose a new algorithm, PCA merge, for combining copy-number profiles from many individuals into consensus regions. We applied both our new methods as well as existing ones to data from 5612 individuals from the CoLaus study who were genotyped on Affymetrix 500K arrays. We developed a number of procedures in order to evaluate the performance of the different methods. This includes comparison with previously published CNVs as well as using a replication sample of 239 individuals, genotyped with Illumina 550K arrays. We also established a new evaluation procedure that employs the fact that related individuals are expected to share their CNVs more frequently than randomly selected individuals. The ability to detect both rare and common CNVs provides a valuable resource that will facilitate association studies exploring potential phenotypic associations with CNVs. CONCLUSION: Our new methodologies for CNV detection and their evaluation will help in extracting additional information from the large amount of SNP-genotyping data on various cohorts and use this to explore structural variants and their impact on complex traits.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

SNAP(c) is one of a few basal transcription factors used by both RNA polymerase (pol) II and pol III. To define the set of active SNAP(c)-dependent promoters in human cells, we have localized genome-wide four SNAP(c) subunits, GTF2B (TFIIB), BRF2, pol II, and pol III. Among some seventy loci occupied by SNAP(c) and other factors, including pol II snRNA genes, pol III genes with type 3 promoters, and a few un-annotated loci, most are primarily occupied by either pol II and GTF2B, or pol III and BRF2. A notable exception is the RPPH1 gene, which is occupied by significant amounts of both polymerases. We show that the large majority of SNAP(c)-dependent promoters recruit POU2F1 and/or ZNF143 on their enhancer region, and a subset also recruits GABP, a factor newly implicated in SNAP(c)-dependent transcription. These activators associate with pol II and III promoters in G1 slightly before the polymerase, and ZNF143 is required for efficient transcription initiation complex assembly. The results characterize a set of genes with unique properties and establish that polymerase specificity is not absolute in vivo.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Ten years ago, the first cellular receptor for the prototypic arenavirus lymphocytic choriomeningitis virus (LCMV) and the highly pathogenic Lassa virus (LASV) was identified as alpha-dystroglycan (alpha-DG), a versatile receptor for proteins of the extracellular matrix (ECM). Biochemical analysis of the interaction of alpha-DG with arenaviruses and ECM proteins revealed a strikingly similar mechanism of receptor recognition that critically depends on specific sugar modification on alpha-DG involving a novel class of putative glycosyltransferase, the LARGE proteins. Interestingly, recent genome-wide detection and characterization of positive selection in human populations revealed evidence for positive selection of a locus within the LARGE gene in populations from Western Africa, where LASV is endemic. While most enveloped viruses that enter the host cell in a pH-dependent manner use clathrin-mediated endocytosis, recent studies revealed that the Old World arenaviruses LCMV and LASV enter the host cell predominantly via a novel and unusual endocytotic pathway independent of clathrin, caveolin, dynamin, and actin. Upon internalization, the virus is rapidly delivered to endosomes via an unusual route of vesicular trafficking that is largely independent of the small GTPases Rab5 and Rab7. Since infection of cells with LCMV and LASV depends on DG, this unusual endocytotic pathway could be related to normal cellular trafficking of the DG complex. Alternatively, engagement of arenavirus particles may target DG for an endocytotic pathway not normally used in uninfected cells thereby inducing an entry route specifically tailored to the pathogen's needs.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Recently, a locus centred on rs9273349 in the HLA-DQ region emerged from genome-wide association studies of adult-onset asthma. We aimed to further investigate the role of human leukocyte antigen (HLA) class II in adult-onset asthma and a possible interaction with occupational exposures. We imputed classical HLA-II alleles from 7579 single-nucleotide polymorphisms in 6025 subjects (1202 with adult-onset asthma) from European cohorts: ECRHS, SAPALDIA, EGEA and B58C, and from surveys of bakers and agricultural workers. Based on an asthma-specific job-exposure matrix, 2629 subjects had ever been exposed to high molecular weight (HMW) allergens. We explored associations between 23 common HLA-II alleles and adult-onset asthma, and tested for gene-environment interaction with occupational exposure to HMW allergens. Interaction was also tested for rs9273349. Marginal associations of classical HLA-II alleles and adult-onset asthma were not statistically significant. Interaction was detected between the DPB1*03:01 allele and exposure to HMW allergens (p = 0.009), in particular to latex (p = 0.01). In the unexposed group, the DPB1*03:01 allele was associated with adult-onset asthma (OR 0.67, 95%CI 0.53-0.86). HMW allergen exposures did not modify the association of rs9273349 with adult-onset asthma. Common classical HLA-II alleles were not marginally associated with adult-onset asthma. The association of latex exposure and adult-onset asthma may be modified by DPB1*03:01.