35 resultados para Genome wide mapping
Resumo:
BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a powerful tool for genome-wide transcription studies. Unlike microarrays, it has the ability to detect novel forms of RNA such as alternatively spliced and antisense transcripts, without the need for prior knowledge of their existence. One limitation of using SAGE on an organism with a complex genome and lacking detailed sequence information, such as the hexaploid bread wheat Triticum aestivum, is accurate annotation of the tags generated. Without accurate annotation it is impossible to fully understand the dynamic processes involved in such complex polyploid organisms. Hence we have developed and utilised novel procedures to characterise, in detail, SAGE tags generated from the whole grain transcriptome of hexaploid wheat. RESULTS: Examination of 71,930 Long SAGE tags generated from six libraries derived from two wheat genotypes grown under two different conditions suggested that SAGE is a reliable and reproducible technique for use in studying the hexaploid wheat transcriptome. However, our results also showed that in poorly annotated and/or poorly sequenced genomes, such as hexaploid wheat, considerably more information can be extracted from SAGE data by carrying out a systematic analysis of both perfect and "fuzzy" (partially matched) tags. This detailed analysis of the SAGE data shows first that while there is evidence of alternative polyadenylation this appears to occur exclusively within the 3' untranslated regions. Secondly, we found no strong evidence for widespread alternative splicing in the developing wheat grain transcriptome. However, analysis of our SAGE data shows that antisense transcripts are probably widespread within the transcriptome and appear to be derived from numerous locations within the genome. Examination of antisense transcripts showing sequence similarity to the Puroindoline a and Puroindoline b genes suggests that such antisense transcripts might have a role in the regulation of gene expression. CONCLUSION: Our results indicate that the detailed analysis of transcriptome data, such as SAGE tags, is essential to understand fully the factors that regulate gene expression and that such analysis of the wheat grain transcriptome reveals that antisense transcripts maybe widespread and hence probably play a significant role in the regulation of gene expression during grain development.
Resumo:
Mean platelet volume (MPV) and platelet count (PLT) are highly heritable and tightly regulated traits. We performed a genome-wide association study for MPV and identified one SNP, rs342293, as having highly significant and reproducible association with MPV (per-G allele effect 0.016 +/- 0.001 log fL; P < 1.08 x 10(-24)) and PLT (per-G effect -4.55 +/- 0.80 10(9)/L; P < 7.19 x 10(-8)) in 8586 healthy subjects. Whole-genome expression analysis in the 1-MB region showed a significant association with platelet transcript levels for PIK3CG (n = 35; P = .047). The G allele at rs342293 was also associated with decreased binding of annexin V to platelets activated with collagen-related peptide (n = 84; P = .003). The region 7q22.3 identifies the first QTL influencing platelet volume, counts, and function in healthy subjects. Notably, the association signal maps to a chromosome region implicated in myeloid malignancies, indicating this site as an important regulatory site for hematopoiesis. The identification of loci regulating MPV by this and other studies will increase our insight in the processes of megakaryopoiesis and proplatelet formation, and it may aid the identification of genes that are somatically mutated in essential thrombocytosis. (Blood. 2009; 113: 3831-3837)
Resumo:
Supplementation of diets with plant extracts such as ginkgo biloba extract (EGb 761®) (definition see editorial) for health and prevention of degenerative diseases is popular. However, it is often difficult to analyse the biological activities of plant extracts due to their complex nature and the possible synergistic and/or antagonistic effects of their components. Genome-wide expression monitoring with high-density oligonucleotide arrays provides one way to examine the molecular targets of plant extracts and may prove a useful tool in evaluating their therapeutic claims. Here, we will briefly describe some of our work on the effect of EGb 761® on differential gene expression in relation to its potential anti-carcinogenic, photoprotective and neuromodulatory properties.
Resumo:
Objective: SNPs identified from genome wide association studies associate with lipid risk markers of cardiovascular disease. This study investigated whether these SNPs altered the plasma lipid response to diet in the ‘RISCK’ study cohort. Methods: Participants (n = 490) from a dietary intervention to lower saturated fat by replacement with carbohydrate or monounsaturated fat, were genotyped for 39 lipid-associated SNPs. The association of each individual SNP, and of the SNPs combined (using genetic predisposition scores), with plasma lipid concentrations was assessed at baseline, and on change in response to 24 weeks on diets. Results: The associations between SNPs and lipid concentrations were directionally consistent with previous findings. The genetic predisposition scores were associated with higher baseline concentrations of plasma total(P = 0.02) and LDL (P = 0.002) cholesterol, triglycerides (P = 0.001) and apolipoprotein B (P = 0.004), and with lower baseline concentrations of HDL cholesterol (P < 0.001) and apolipoprotein A-I (P < 0.001). None of the SNPs showed significant association with the reduction of plasma lipids in response to the dietary interventions and there was no evidence of diet-gene interactions. Conclusion: Results from this exploratory study have shown that increased genetic predisposition was associated with an unfavourable plasma lipid profile at baseline, but did not influence the improvement in lipid profiles by the low-saturated-fat diets.
Resumo:
A study or experiment can be described as sequential if its design includes one or more interim analyses at which it is possible to stop the study, having reached a definitive conclusion concerning the primary question of interest. The potential of the sequential study to terminate earlier than the equivalent fixed sample size study means that, typically, there are ethical and economic advantages to be gained from using a sequential design. These advantages have secured a place for the methodology in the conduct of many clinical trials of novel therapies. Recently, there has been increasing interest in pharmacogenetics: the study of how DNA variation in the human genome affects the safety and efficacy of drugs. The potential for using sequential methodology in pharmacogenetic studies is considered and the conduct of candidate gene association studies, family-based designs and genome-wide association studies within the sequential setting is explored. The objective is to provide a unified framework for the conduct of these types of studies as sequential designs and hence allow experimenters to consider using sequential methodology in their future pharmacogenetic studies.
Resumo:
Red meat consumption is associated with an increased colorectal cancer (CRC) risk, which may be due to an increased endogenous formation of genotoxic N-nitroso compounds (NOCs). To assess the impact of red meat consumption on potential risk factors of CRC, we investigated the effect of a 7-day dietary red meat intervention in human subjects on endogenous NOC formation and fecal water genotoxicity in relation to genome-wide transcriptomic changes induced in colonic tissue. The intervention showed no effect on fecal NOC excretion but fecal water genotoxicity significantly increased in response to red meat intake. Colonic inflammation caused by inflammatory bowel disease, which has been suggested to stimulate endogenous nitrosation, did not influence fecal NOC excretion or fecal water genotoxicity. Transcriptomic analyses revealed that genes significantly correlating with the increase in fecal water genotoxicity were involved in biological pathways indicative of genotoxic effects, including modifications in DNA damage repair, cell cycle, and apoptosis pathways. Moreover, WNT signaling and nucleosome remodeling pathways were modulated which are implicated in human CRC development. We conclude that the gene expression changes identified in this study corroborate the genotoxic potential of diets high in red meat and point towards a potentially increased CRC risk in humans.
Resumo:
The accurate prediction of the biochemical function of a protein is becoming increasingly important, given the unprecedented growth of both structural and sequence databanks. Consequently, computational methods are required to analyse such data in an automated manner to ensure genomes are annotated accurately. Protein structure prediction methods, for example, are capable of generating approximate structural models on a genome-wide scale. However, the detection of functionally important regions in such crude models, as well as structural genomics targets, remains an extremely important problem. The method described in the current study, MetSite, represents a fully automatic approach for the detection of metal-binding residue clusters applicable to protein models of moderate quality. The method involves using sequence profile information in combination with approximate structural data. Several neural network classifiers are shown to be able to distinguish metal sites from non-sites with a mean accuracy of 94.5%. The method was demonstrated to identify metal-binding sites correctly in LiveBench targets where no obvious metal-binding sequence motifs were detectable using InterPro. Accurate detection of metal sites was shown to be feasible for low-resolution predicted structures generated using mGenTHREADER where no side-chain information was available. High-scoring predictions were observed for a recently solved hypothetical protein from Haemophilus influenzae, indicating a putative metal-binding site.
Resumo:
Genome-wide association studies have identified SNPs reproducibly associated with type 2 diabetes (T2D). We examined the effect of genetic predisposition to T2D on insulin sensitivity and secretion using detailed phenotyping in overweight individuals with no diagnosis of T2D. Furthermore, we investigated whether this genetic predisposition modifies the responses in beta-cell function and insulin sensitivity to a 24-week dietary intervention. We genotyped 25 T2D-associated SNPs in 377 white participants from the RISCK study. Participants underwent an IVGTT prior to and following a dietary intervention that aimed to lower saturated fat intake by replacement with monounsaturated fat or carbohydrate. We composed a genetic predisposition score (T2D-GPS) by summing the T2D risk-increasing alleles of the 25 SNPs and tested for association with insulin secretion and sensitivity at baseline, and with the change in response to the dietary intervention. At baseline, a higher T2D-GPS was associated with lower acute insulin secretion (AIRg 4% lower/risk allele, P = 0.006) and lower insulin secretion for a given level of insulin sensitivity, assessed by the disposition index (DI 5% lower/risk allele, P = 0.002), but not with insulin sensitivity (Si). T2D-GPS did not modify changes in insulin secretion, insulin sensitivity or the disposition index in response to the dietary interventions to lower saturated fat. Participants genetically predisposed to T2D have an impaired ability to compensate for peripheral insulin resistance with insulin secretion at baseline, but this does not modify the response to a reduction in dietary saturated fat through iso-energetic replacement with carbohydrate or monounsaturated fat.
Resumo:
Within the healthy population, there is substantial, heritable, and interindividual variability in the platelet response. We explored whether a proportion of this variability could be accounted for by interindividual variation in gene expression. Through a correlative analysis of genome-wide platelet RNA expression data from 37 subjects representing the normal range of platelet responsiveness within a cohort of 500 subjects, we identified 63 genes in which transcript levels correlated with variation in the platelet response to adenosine diphosphate and/or the collagen-mimetic peptide, cross-linked collagen-related peptide. Many of these encode proteins with no reported function in platelets. An association study of 6 of the 63 genes in 4235 cases and 6379 controls showed a putative association with myocardial infarction for COMMD7 (COMM domain-containing protein 7) and a major deviation from the null hypo thesis for LRRFIP1 [leucine-rich repeat (in FLII) interacting protein 1]. Morpholino-based silencing in Danio rerio identified a modest role for commd7 and a significant effect for lrrfip1 as positive regulators of thrombus formation. Proteomic analysis of human platelet LRRFIP1-interacting proteins indicated that LRRFIP1 functions as a component of the platelet cytoskeleton, where it interacts with the actin-remodeling proteins Flightless-1 and Drebrin. Taken together, these data reveal novel proteins regulating the platelet response.
Resumo:
Gene expression is a quantitative trait that can be mapped genetically in structured populations to identify expression quantitative trait loci (eQTL). Genes and regulatory networks underlying complex traits can subsequently be inferred. Using a recently released genome sequence, we have defined cis- and trans-eQTL and their environmental response to low phosphorus (P) availability within a complex plant genome and found hotspots of trans-eQTL within the genome. Interval mapping, using P supply as a covariate, revealed 18,876 eQTL. trans-eQTL hotspots occurred on chromosomes A06 and A01 within Brassica rapa; these were enriched with P metabolism-related Gene Ontology terms (A06) as well as chloroplast-and photosynthesis-related terms (A01). We have also attributed heritability components to measures of gene expression across environments, allowing the identification of novel gene expression markers and gene expression changes associated with low P availability. Informative gene expression markers were used to map eQTL and P use efficiency-related QTL. Genes responsive to P supply had large environmental and heritable variance components. Regulatory loci and genes associated with P use efficiency identified through eQTL analysis are potential targets for further characterization and may have potential for crop improvement.
Resumo:
The prevalence of obesity and diabetes, which are heritable traits that arise from the interactions of multiple genes and lifestyle factors, continues to rise worldwide, causing serious health problems and imposing a substantial economic burden on societies. For the past 15 years, candidate gene and genome-wide linkage studies have been the main genetic epidemiological approaches to identify genetic loci for obesity and diabetes, yet progress has been slow and success limited. The genome-wide association approach, which has become available in recent years, has dramatically changed the pace of gene discoveries. Genome-wide association is a hypothesis-generating approach that aims to identify new loci associated with the disease or trait of interest. So far, three waves of large-scale genome-wide association studies have identified 19 loci for common obesity and 18 for common type 2 diabetes. Although the combined contribution of these loci to the variation in obesity and diabetes risk is small and their predictive value is typically low, these recently identified loci are set to substantially improve our insights into the pathophysiology of obesity and diabetes. This will require integration of genetic epidemiological methods with functional genomics and proteomics. However, the use of these novel insights for genetic screening and personalised treatment lies some way off in the future.
Resumo:
The INSIG2 rs7566605 polymorphism was identified for obesity (BMI> or =30 kg/m(2)) in one of the first genome-wide association studies, but replications were inconsistent. We collected statistics from 34 studies (n = 74,345), including general population (GP) studies, population-based studies with subjects selected for conditions related to a better health status ('healthy population', HP), and obesity studies (OB). We tested five hypotheses to explore potential sources of heterogeneity. The meta-analysis of 27 studies on Caucasian adults (n = 66,213) combining the different study designs did not support overall association of the CC-genotype with obesity, yielding an odds ratio (OR) of 1.05 (p-value = 0.27). The I(2) measure of 41% (p-value = 0.015) indicated between-study heterogeneity. Restricting to GP studies resulted in a declined I(2) measure of 11% (p-value = 0.33) and an OR of 1.10 (p-value = 0.015). Regarding the five hypotheses, our data showed (a) some difference between GP and HP studies (p-value = 0.012) and (b) an association in extreme comparisons (BMI> or =32.5, 35.0, 37.5, 40.0 kg/m(2) versus BMI<25 kg/m(2)) yielding ORs of 1.16, 1.18, 1.22, or 1.27 (p-values 0.001 to 0.003), which was also underscored by significantly increased CC-genotype frequencies across BMI categories (10.4% to 12.5%, p-value for trend = 0.0002). We did not find evidence for differential ORs (c) among studies with higher than average obesity prevalence compared to lower, (d) among studies with BMI assessment after the year 2000 compared to those before, or (e) among studies from older populations compared to younger. Analysis of non-Caucasian adults (n = 4889) or children (n = 3243) yielded ORs of 1.01 (p-value = 0.94) or 1.15 (p-value = 0.22), respectively. There was no evidence for overall association of the rs7566605 polymorphism with obesity. Our data suggested an association with extreme degrees of obesity, and consequently heterogeneous effects from different study designs may mask an underlying association when unaccounted for. The importance of study design might be under-recognized in gene discovery and association replication so far.
Resumo:
The first genome-wide association study for BMI identified a polymorphism, rs7566605, 10 kb upstream of the insulin-induced gene 2 (INSIG2) transcription start site, as the most significantly associated variant in children and adults. Subsequent studies, however, showed inconsistent association of this polymorphism with obesity traits. This polymorphism has been hypothesized to alter INSIG2 expression leading to inhibition of fatty acid and cholesterol synthesis. Hence, we investigated the association of the INSIG2 rs7566605 polymorphism with obesity- and lipid-related traits in Danish and Estonian children (930 boys and 1,073 girls) from the European Youth Heart Study (EYHS), a school-based, cross-sectional study of pre- and early pubertal children. The association between the polymorphism and obesity traits was tested using additive and recessive models adjusted for age, age-group, gender, maturity and country. Interactions were tested by including the interaction terms in the model. Despite having sufficient power (98%) to detect the previously reported effect size for association with BMI, we did not find significant effects of rs7566605 on BMI (additive, P = 0.68; recessive, P = 0.24). Accordingly, the polymorphism was not associated with overweight (P = 0.87) or obesity (P = 0.34). We also did not find association with waist circumference (WC), sum of four skinfolds, or with total cholesterol, triglycerides, low-density lipoprotein, or high-density lipoprotein. There were no gender-specific (P = 0.55), age-group-specific (P = 0.63) or country-specific (P = 0.56) effects. There was also no evidence of interaction between genotype and physical activity (P = 0.95). Despite an adequately powered study, our findings suggest that rs7566605 is not associated with obesity-related traits and lipids in the EYHS.
Resumo:
Common variants at only two loci, FTO and MC4R, have been reproducibly associated with body mass index (BMI) in humans. To identify additional loci, we conducted meta-analysis of 15 genome-wide association studies for BMI (n > 32,000) and followed up top signals in 14 additional cohorts (n > 59,000). We strongly confirm FTO and MC4R and identify six additional loci (P < 5 x 10(-8)): TMEM18, KCTD15, GNPDA2, SH2B1, MTCH2 and NEGR1 (where a 45-kb deletion polymorphism is a candidate causal variant). Several of the likely causal genes are highly expressed or known to act in the central nervous system (CNS), emphasizing, as in rare monogenic forms of obesity, the role of the CNS in predisposition to obesity.
Resumo:
LRRK2 was identified in 2004 as the causative protein product of the Parkinson’s disease locus designated PARK8. In the decade since then, genetic studies have revealed at least 6 dominant mutations in LRRK2 linked to Parkinson’s disease, alongside one associated with cancer. It is now well established that coding changes in LRRK2 are one of the most common causes of Parkinson’s. Genome-wide association studies (GWAs) have, more recently, reported single nucleotide polymorphisms (SNPs) around the LRRK2 locus to be associated with risk of developing sporadic Parkinson’s disease and inflammatory bowel disorder. The functional research that has followed these genetic breakthroughs has generated an extensive literature regarding LRRK2 pathophysiology; however, there is still no consensus as to the biological function of LRRK2. To provide insight into the aspects of cell biology that are consistently related to LRRK2 activity, we analysed the plethora of candidate LRRK2 interactors available through the BioGRID and IntAct data repositories. We then performed GO terms enrichment for the LRRK2 interactome. We found that, in two different enrichment portals, the LRRK2 interactome was associated with terms referring to transport, cellular organization, vesicles and the cytoskeleton. We also verified that 21 of the LRRK2 interactors are genetically linked to risk for Parkin- son’s disease or inflammatory bowel disorder. The implications of these findings are discussed, with particular regard to potential novel areas of investigation.