939 resultados para bacteria genome nucleotide usage
Resumo:
Genome-wide association studies followed by replication provide a powerful approach to map genetic risk factors for asthma. We sought to search for new variants associated with asthma and attempt to replicate the association with four loci reported previously (ORMDL3, PDE4D, DENND1B and IL1RL1). Genome-wide association analyses of individual single nucleotide polymorphisms (SNPs), rare copy number variants (CNVs) and overall CNV burden were carried out in 986 asthma cases and 1846 asthma-free controls from Australia. The most-associated locus in the SNP analysis was ORMDL3 (rs6503525, P = 4.8 x 10(-)(7)). Five other loci were associated with P < 10(-)(5), most notably the chemokine CXC motif ligand 14 (CXCL14) gene (rs31263, P = 7.8 x 10(-)(6)). We found no evidence for association with the specific risk variants reported recently for PDE4D, DENND1B and ILR1L1. However, a variant in IL1RL1 that is in low linkage disequilibrium with that reported previously was associated with asthma risk after accounting for all variants tested (rs10197862, gene wide P = 0.01). This association replicated convincingly in an independent cohort (P = 2.4 x 10(-)(4)). A 300-kb deletion on chromosome 17q21 was associated with asthma risk, but this did not reach experiment-wide significance. Asthma cases and controls had comparable CNV rates, length and number of genes affected by deletions or duplications. In conclusion, we confirm the association between asthma risk and variants in ORMDL3 and identify a novel risk variant in IL1RL1. Follow-up of the 17q21 deletion in larger cohorts is warranted.
Resumo:
Serum butyrylcholinesterase (BCHE) activity is associated with obesity, blood pressure and biomarkers of cardiovascular and diabetes risk. We have conducted a genome-wide association scan to discover genetic variants affecting BCHE activity, and to clarify whether the associations between BCHE activity and cardiometabolic risk factors are caused by variation in BCHE or whether BCHE variation is secondary to the metabolic abnormalities. We measured serum BCHE in adolescents and adults from three cohorts of Australian twin and family studies. The genotypes from approximately 2.4 million single-nucleotide polymorphisms (SNPs) were available in 8791 participants with BCHE measurements. We detected significant associations with BCHE activity at three independent groups of SNPs at the BCHE locus (P = 5.8 x 10(-262), 7.8 x 10(-47), 2.9 x 10(-12)) and at four other loci: RNPEP (P = 9.4 x 10(-16)), RAPH1-ABI2 (P = 4.1 x 10(-18)), UGT1A1 (P = 4.0 x 10(-8)) and an intergenic region on chromosome 8 (P = 1.4 x 10(-8)). These loci affecting BCHE activity were not associated with metabolic risk factors. On the other hand, SNPs in genes previously associated with metabolic risk had effects on BCHE activity more often than can be explained by chance. In particular, SNPs within FTO and GCKR were associated with BCHE activity, but their effects were partly mediated by body mass index and triglycerides, respectively. We conclude that variation in BCHE activity is due to multiple variants across the spectrum from uncommon/large effect to common/small effect, and partly results from (rather than causes) metabolic abnormalities.
Resumo:
Colorectal cancer (CRC) is one of the most frequent malignancies in Western countries. Inherited factors have been suggested to be involved in 35% of CRCs. The hereditary CRC syndromes explain only ~6% of all CRCs, indicating that a large proportion of the inherited susceptibility is still unexplained. Much of the remaining genetic predisposition for CRC is probably due to undiscovered low-penetrance variations. This study was conducted to identify germline and somatic changes that contribute to CRC predisposition and tumorigenesis. MLH1 and MSH2, that underlie Hereditary non-polyposis colorectal cancer (HNPCC) are considered to be tumor suppressor genes; the first hit is inherited in the germline and somatic inactivation of the wild type allele is required for tumor initiation. In a recent study, frequent loss of the mutant allele in HNPCC tumors was detected and a new model, arguing against the two-hit hypothesis, was proposed for somatic HNPCC tumorigenesis. We tested this hypothesis by conducting LOH analysis on 25 colorectal HNPCC tumors with a known germline mutation in the MLH1 or MSH2 genes. LOH was detected in 56% of the tumors. All the losses targeted the wild type allele supporting the classical two-hit model for HNPCC tumorigenesis. The variants 3020insC, R702W and G908R in NOD2 predispose to Crohn s disease. Contribution of NOD2 to CRC predisposition has been examined in several case-control series, with conflicting results. We have previously shown that 3020insC does not predispose to CRC in Finnish CRC patients. To expand our previous study the variants R702W and G908R were genotyped in a population-based series of 1042 Finnish CRC patients and 508 healthy controls. Association analyses did not show significant evidence for association of the variants with CRC. Single nucleotide polymorphism (SNP) rs6983267 at chromosome 8q24 was the first CRC susceptibility variant identified through genome-wide association studies. To characterize the role of rs6983267 in CRC predisposition in the Finnish population, we genotyped the SNP in the case-control material of 1042 cases and 1012 controls and showed that G allele of rs6983267 is associated with the increased risk of CRC (OR 1.22; P=0.0018). Examination of allelic imbalance in the tumors heterozygous for rs6983267 revealed that copy number increase affected 22% of the tumors and interestingly, it favored the G allele. By utilizing a computer algorithm, Enhancer Element Locator (EEL), an evolutionary conserved regulatory motif containing rs6983267 was identified. The SNP affected the binding site of TCF4, a transcription factor that mediates Wnt signaling in cells, and has proven to be crucial in colorectal neoplasia. The preferential binding of TCF4 to the risk allele G was showed in vitro and in vivo. The element drove lacZ marker gene expression in mouse embryos in a pattern that is consistent with genes regulated by the Wnt signaling pathway. These results suggest that rs6983267 at 8q24 exerts its effect in CRC predisposition by regulating gene expression. The most obvious target gene for the enhancer element is MYC, residing ~335 kb downstream, however further studies are required to establish the transcriptional target(s) of the predicted enhancer element.
Resumo:
In this thesis, two separate single nucleotide polymorphism (SNP) genotyping techniques were set up at the Finnish Genome Center, pooled genotyping was evaluated as a screening method for large-scale association studies, and finally, the former approaches were used to identify genetic factors predisposing to two distinct complex diseases by utilizing large epidemiological cohorts and also taking environmental factors into account. The first genotyping platform was based on traditional but improved restriction-fragment-length-polymorphism (RFLP) utilizing 384-microtiter well plates, multiplexing, small reaction volumes (5 µl), and automated genotype calling. We participated in the development of the second genotyping method, based on single nucleotide primer extension (SNuPeTM by Amersham Biosciences), by carrying out the alpha- and beta tests for the chemistry and the allele-calling software. Both techniques proved to be accurate, reliable, and suitable for projects with thousands of samples and tens of markers. Pooled genotyping (genotyping of pooled instead of individual DNA samples) was evaluated with Sequenom s MassArray MALDI-TOF, in addition to SNuPeTM and PCR-RFLP techniques. We used MassArray mainly as a point of comparison, because it is known to be well suited for pooled genotyping. All three methods were shown to be accurate, the standard deviations between measurements being 0.017 for the MassArray, 0.022 for the PCR-RFLP, and 0.026 for the SNuPeTM. The largest source of error in the process of pooled genotyping was shown to be the volumetric error, i.e., the preparation of pools. We also demonstrated that it would have been possible to narrow down the genetic locus underlying congenital chloride diarrhea (CLD), an autosomal recessive disorder, by using the pooling technique instead of genotyping individual samples. Although the approach seems to be well suited for traditional case-control studies, it is difficult to apply if any kind of stratification based on environmental factors is needed. Therefore we chose to continue with individual genotyping in the following association studies. Samples in the two separate large epidemiological cohorts were genotyped with the PCR-RFLP and SNuPeTM techniques. The first of these association studies concerned various pregnancy complications among 100,000 consecutive pregnancies in Finland, of which we genotyped 2292 patients and controls, in addition to a population sample of 644 blood donors, with 7 polymorphisms in the potentially thrombotic genes. In this thesis, the analysis of a sub-study of pregnancy-related venous thromboses was included. We showed that the impact of factor V Leiden polymorphism on pregnancy-related venous thrombosis, but not the other tested polymorphisms, was fairly large (odds ratio 11.6; 95% CI 3.6-33.6), and increased multiplicatively when combined with other risk factors such as obesity or advanced age. Owing to our study design, we were also able to estimate the risks at the population level. The second epidemiological cohort was the Helsinki Birth Cohort of men and women who were born during 1924-1933 in Helsinki. The aim was to identify genetic factors that might modify the well known link between small birth size and adult metabolic diseases, such as type 2 diabetes and impaired glucose tolerance. Among ~500 individuals with detailed birth measurements and current metabolic profile, we found that an insertion/deletion polymorphism of the angiotensin converting enzyme (ACE) gene was associated with the duration of gestation, and weight and length at birth. Interestingly, the ACE insertion allele was also associated with higher indices of insulin secretion (p=0.0004) in adult life, but only among individuals who were born small (those among the lowest third of birth weight). Likewise, low birth weight was associated with higher indices of insulin secretion (p=0.003), but only among carriers of the ACE insertion allele. The association with birth measurements was also found with a common haplotype of the glucocorticoid receptor (GR) gene. Furthermore, the association between short length at birth and adult impaired glucose tolerance was confined to carriers of this haplotype (p=0.007). These associations exemplify the interaction between environmental factors and genotype, which, possibly due to altered gene expression, predisposes to complex metabolic diseases. Indeed, we showed that the common GR gene haplotype associated with reduced mRNA expression in thymus of three individuals (p=0.0002).
Resumo:
Chromosomal alterations in leukemia have been shown to have prognostic and predictive significance and are also important minimal residual disease (MRD) markers in the follow-up of leukemia patients. Although specific oncogenes and tumor suppressors have been discovered in some of the chromosomal alterations, the role and target genes of many alterations in leukemia remain unknown. In addition, a number of leukemia patients have a normal karyotype by standard cytogenetics, but have variability in clinical course and are often molecularly heterogeneous. Cytogenetic methods traditionally used in leukemia analysis and diagnostics; G-banding, various fluorescence in situ hybridization (FISH) techniques, and chromosomal comparative genomic hybridization (cCGH), have enormously increased knowledge about the leukemia genome, but have limitations in resolution or in genomic coverage. In the last decade, the development of microarray comparative genomic hybridization (array-CGH, aCGH) for DNA copy number analysis and the SNP microarray (SNP-array) method for simultaneous copy number and loss of heterozygosity (LOH) analysis has enabled investigation of chromosomal and gene alterations genome-wide with high resolution and high throughput. In these studies, genetic alterations were analyzed in acute myeloid leukemia (AML) and chronic lymphocytic leukemia (CLL). The aim was to screen and characterize genomic alterations that could play role in leukemia pathogenesis by using aCGH and SNP-arrays. One of the most important goals was to screen cryptic alterations in karyotypically normal leukemia patients. In addition, chromosomal changes were evaluated to narrow the target regions, to find new markers, and to obtain tumor suppressor and oncogene candidates. The work presented here shows the capability of aCGH to detect submicroscopic copy number alterations in leukemia, with information about breakpoints and genes involved in the alterations, and that genome-wide microarray analyses with aCGH and SNP-array are advantageous methods in the research and diagnosis of leukemia. The most important findings were the cryptic changes detected with aCGH in karyotypically normal AML and CLL, characterization of amplified genes in 11q marker chromosomes, detection of deletion-based mechanisms of MLL-ARHGEF12 fusion gene formation, and detection of LOH without copy number alteration in karyotypically normal AML. These alterations harbor candidate oncogenes and tumor suppressors for further studies.
Resumo:
Celiac disease, or gluten intolerance, is triggered by dietary glutens in genetically susceptible individuals and it affects approximately 1% of the Caucasian population. The best known genetic risk factors for celiac disease are HLA DQ2 and DQ8 heterodimers, which are necessary for the development of the disease. However, they alone are not sufficient for disease induction, other risk factors are required. This thesis investigated genetic factors for celiac disease, concentrating on susceptibility loci on chromosomes 5q31-q33, 19p13 and 2q12 previously reported in genome-wide linkage and association studies. In addition, a novel genotyping method for the detection of HLA DQ2 and DQ8 coding haplotypes was validated. This study was conducted using Finnish and Hungarian family materials, and Finnish, Hungarian and Italian case-control materials. Genetic linkage and association were analysed in these materials using candidate gene and fine-mapping approaches. The results confirmed linkage to celiac disease on the chromosomal regions 5q31-q33 and 19p13. Fine-mapping on chromosome 5q31-q33 revealed several modest associations in the region, and highlighted the need for further investigations to locate the causal risk variants. The MYO9B gene on chromosome 19p13 showed evidence for linkage and association particularly with dermatitis herpetiformis, the skin manifestation of celiac disease. This implies a potential difference in the genetic background of the intestinal and skin forms of the disease, although studies on larger samplesets are required. The IL18RAP locus on chromosome 2q12, shown to be associated with celiac disease in a previous genome-wide association study and a subsequent follow-up, showed association in the Hungarian population in this study. The expression of IL18RAP was further investigated in small intestinal tissue and in peripheral blood mononuclear cells. The results showed that IL18RAP is expressed in the relevant tissues. Two putative isoforms of IL18RAP were detected by Western blot analysis, and the results suggested that the ratios and total levels of these isoforms may contribute to the aetiology of celiac disease. A novel genotyping method for celiac disease-associated HLA haplotypes was also validated in this thesis. The method utilises single-nucleotide polymorphisms tagging these HLA haplotypes with high sensitivity and specificity. Our results suggest that this method is transferable between populations, and it is suitable for large-scale analysis. In conclusion, this doctorate study provides an insight into the roles of the 5q31-q33, MYO9B, IL18RAP and HLA loci in the susceptibility to celiac disease in the Finnish, Hungarian and Italian populations, highlighting the need for further studies at these genetic loci and examination of the function of the candidate genes.
Resumo:
SNPs discovered by genome-wide association studies (GWASs) account for only a small fraction of the genetic variation of complex traits in human populations. Where is the remaining heritability? We estimated the proportion of variance for human height explained by 294,831 SNPs genotyped on 3,925 unrelated individuals using a linear model analysis, and validated the estimation method with simulations based on the observed genotype data. We show that 45% of variance can be explained by considering all SNPs simultaneously. Thus, most of the heritability is not missing but has not previously been detected because the individual effects are too small to pass stringent significance tests. We provide evidence that the remaining heritability is due to incomplete linkage disequilibrium between causal variants and genotyped SNPs, exacerbated by causal variants having lower minor allele frequency than the SNPs explored to date.
Resumo:
Variation in personality traits is 30-60% attributed to genetic influences. Attempts to unravel these genetic influences at the molecular level have, so far, been inconclusive. We performed the first genome-wide association study of Cloninger's temperament scales in a sample of 5117 individuals, in order to identify common genetic variants underlying variation in personality. Participants' scores on Harm Avoidance, Novelty Seeking, Reward Dependence, and Persistence were tested for association with 1,252,387 genetic markers. We also performed gene-based association tests and biological pathway analyses. No genetic variants that significantly contribute to personality variation were identified, while our sample provides over 90% power to detect variants that explain only 1% of the trait variance. This indicates that individual common genetic variants of this size or greater do not contribute to personality trait variation, which has important implications regarding the genetic architecture of personality and the evolutionary mechanisms by which heritable variation is maintained.
Resumo:
Obesity is globally prevalent and highly heritable, but its underlying genetic factors remain largely elusive. To identify genetic loci for obesity susceptibility, we examined associations between body mass index and approximately 2.8 million SNPs in up to 123,865 individuals with targeted follow up of 42 SNPs in up to 125,931 additional individuals. We confirmed 14 known obesity susceptibility loci and identified 18 new loci associated with body mass index (P < 5 x 10(-)(8)), one of which includes a copy number variant near GPRC5B. Some loci (at MC4R, POMC, SH2B1 and BDNF) map near key hypothalamic regulators of energy balance, and one of these loci is near GIPR, an incretin receptor. Furthermore, genes in other newly associated loci may provide new insights into human body weight regulation.
Resumo:
Most common human traits and diseases have a polygenic pattern of inheritance: DNA sequence variants at many genetic loci influence the phenotype. Genome-wide association (GWA) studies have identified more than 600 variants associated with human traits, but these typically explain small fractions of phenotypic variation, raising questions about the use of further studies. Here, using 183,727 individuals, we show that hundreds of genetic variants, in at least 180 loci, influence adult height, a highly heritable and classic polygenic trait. The large number of loci reveals patterns with important implications for genetic studies of common human diseases and traits. First, the 180 loci are not random, but instead are enriched for genes that are connected in biological pathways (P = 0.016) and that underlie skeletal growth defects (P < 0.001). Second, the likely causal gene is often located near the most strongly associated variant: in 13 of 21 loci containing a known skeletal growth gene, that gene was closest to the associated variant. Third, at least 19 loci have multiple independently associated variants, suggesting that allelic heterogeneity is a frequent feature of polygenic traits, that comprehensive explorations of already-discovered loci should discover additional variants and that an appreciable fraction of associated loci may have been identified. Fourth, associated variants are enriched for likely functional effects on genes, being over-represented among variants that alter amino-acid structure of proteins and expression levels of nearby genes. Our data explain approximately 10% of the phenotypic variation in height, and we estimate that unidentified common variants of similar effect sizes would increase this figure to approximately 16% of phenotypic variation (approximately 20% of heritable variation). Although additional approaches are needed to dissect the genetic architecture of polygenic human traits fully, our findings indicate that GWA studies can identify large numbers of loci that implicate biologically relevant genes and pathways.
Resumo:
Migraine is a common episodic neurological disorder, typically presenting with recurrent attacks of severe headache and autonomic dysfunction. Apart from rare monogenic subtypes, no genetic or molecular markers for migraine have been convincingly established. We identified the minor allele of rs1835740 on chromosome 8q22.1 to be associated with migraine (P = 5.38 x 10(-)(9), odds ratio = 1.23, 95% CI 1.150-1.324) in a genome-wide association study of 2,731 migraine cases ascertained from three European headache clinics and 10,747 population-matched controls. The association was replicated in 3,202 cases and 40,062 controls for an overall meta-analysis P value of 1.69 x 10(-)(1)(1) (odds ratio = 1.18, 95% CI 1.127-1.244). rs1835740 is located between MTDH (astrocyte elevated gene 1, also known as AEG-1) and PGCP (encoding plasma glutamate carboxypeptidase). In an expression quantitative trait study in lymphoblastoid cell lines, transcript levels of the MTDH were found to have a significant correlation to rs1835740 (P = 3.96 x 10(-)(5), permuted threshold for genome-wide significance 7.7 x 10(-)(5). To our knowledge, our data establish rs1835740 as the first genetic risk factor for migraine.
Resumo:
Hair morphology is highly differentiated between populations and among people of European ancestry. Whereas hair morphology in East Asian populations has been studied extensively, relatively little is known about the genetics of this trait in Europeans. We performed a genome-wide association scan for hair morphology (straight, wavy, curly) in three Australian samples of European descent. All three samples showed evidence of association implicating the Trichohyalin gene (TCHH), which is expressed in the developing inner root sheath of the hair follicle, and explaining approximately 6% of variance (p=1.5x10(-31)). These variants are at their highest frequency in Northern Europeans, paralleling the distribution of the straight-hair EDAR variant in Asian populations.
Resumo:
Blood cells participate in vital physiological processes, and their numbers are tightly regulated so that homeostasis is maintained. Disruption of key regulatory mechanisms underlies many blood-related Mendelian diseases but also contributes to more common disorders, including atherosclerosis. We searched for quantitative trait loci (QTL) for hematology traits through a whole-genome association study, because these could provide new insights into both hemopoeitic and disease mechanisms. We tested 1.8 million variants for association with 13 hematology traits measured in 6015 individuals from the Australian and Dutch populations. These traits included hemoglobin composition, platelet counts, and red blood cell and white blood cell indices. We identified three regions of strong association that, to our knowledge, have not been previously reported in the literature. The first was located in an intergenic region of chromosome 9q31 near LPAR1, explaining 1.5% of the variation in monocyte counts (best SNP rs7023923, p=8.9x10(-14)). The second locus was located on chromosome 6p21 and associated with mean cell erythrocyte volume (rs12661667, p=1.2x10(-9), 0.7% variance explained) in a region that spanned five genes, including CCND3, a member of the D-cyclin gene family that is involved in hematopoietic stem cell expansion. The third region was also associated with erythrocyte volume and was located in an intergenic region on chromosome 6q24 (rs592423, p=5.3x10(-9), 0.6% variance explained). All three loci replicated in an independent panel of 1543 individuals (p values=0.001, 9.9x10(-5), and 7x10(-5), respectively). The identification of these QTL provides new opportunities for furthering our understanding of the mechanisms regulating hemopoietic cell fate.
Resumo:
OBJECTIVE: To further investigate a common variant (rs9939609) in the fat mass- and obesity-associated gene (FTO), which recent genome-wide association studies have shown to be associated with body mass index (BMI) and obesity. DESIGN: We examined the effect of this FTO variant on BMI in 3353 Australian adult male and female twins. RESULTS: The minor A allele of rs9939609 was associated with an increased BMI (P=0.0007). Each additional copy of the A allele was associated with a mean BMI increase of approximately 1.04 kg/m(2) (approximately 3.71 kg). Using variance components decomposition, we estimate that this single-nucleotide polymorphism accounts for approximately 3% of the genetic variance in BMI in our sample (approximately 2% of the total variance). By comparing intrapair variances of monozygotic twins of different genotypes we were able to perform a direct test of gene by environment (G x E) interaction in both sexes and gene by parity (G x P) interaction in women, but no evidence was found for either. CONCLUSIONS: In addition to supporting earlier findings that the rs9939609 variant in the FTO gene is associated with an increased BMI, our results indicate that the associated genetic effect does not interact with environment or parity.