12 resultados para Genomic Regions
em BORIS: Bern Open Repository and Information System - Berna - Suiça
Resumo:
Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these risk variants. It has previously been observed that different genes harboring causal mutations for the same Mendelian disease often physically interact. We sought to evaluate the degree to which this is true of genes within strongly associated loci in complex disease. Using sets of loci defined in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more densely connected than chance expectation. To confirm biological relevance, we show that the components of the networks tend to be expressed in similar tissues relevant to the phenotypes in question, suggesting the network indicates common underlying processes perturbed by risk loci. Furthermore, we show that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non-immune traits to assess its applicability to complex traits in general. We find that genes in loci associated to height and lipid levels assemble into significantly connected networks but did not detect excess connectivity among Type 2 Diabetes (T2D) loci beyond chance. Taken together, our results constitute evidence that, for many of the complex diseases studied here, common genetic associations implicate regions encoding proteins that physically interact in a preferential manner, in line with observations in Mendelian disease.
Resumo:
Background Levels of differentiation among populations depend both on demographic and selective factors: genetic drift and local adaptation increase population differentiation, which is eroded by gene flow and balancing selection. We describe here the genomic distribution and the properties of genomic regions with unusually high and low levels of population differentiation in humans to assess the influence of selective and neutral processes on human genetic structure. Methods Individual SNPs of the Human Genome Diversity Panel (HGDP) showing significantly high or low levels of population differentiation were detected under a hierarchical-island model (HIM). A Hidden Markov Model allowed us to detect genomic regions or islands of high or low population differentiation. Results Under the HIM, only 1.5% of all SNPs are significant at the 1% level, but their genomic spatial distribution is significantly non-random. We find evidence that local adaptation shaped high-differentiation islands, as they are enriched for non-synonymous SNPs and overlap with previously identified candidate regions for positive selection. Moreover there is a negative relationship between the size of islands and recombination rate, which is stronger for islands overlapping with genes. Gene ontology analysis supports the role of diet as a major selective pressure in those highly differentiated islands. Low-differentiation islands are also enriched for non-synonymous SNPs, and contain an overly high proportion of genes belonging to the 'Oncogenesis' biological process. Conclusions Even though selection seems to be acting in shaping islands of high population differentiation, neutral demographic processes might have promoted the appearance of some genomic islands since i) as much as 20% of islands are in non-genic regions ii) these non-genic islands are on average two times shorter than genic islands, suggesting a more rapid erosion by recombination, and iii) most loci are strongly differentiated between Africans and non-Africans, a result consistent with known human demographic history.
Resumo:
As part of the European research consortium IBDase, we addressed the role of proteases and protease inhibitors (P/PIs) in inflammatory bowel disease (IBD), characterized by chronic mucosal inflammation of the gastrointestinal tract, which affects 2.2 million people in Europe and 1.4 million people in North America. We systematically reviewed all published genetic studies on populations of European ancestry (67 studies on Crohn's disease [CD] and 37 studies on ulcerative colitis [UC]) to identify critical genomic regions associated with IBD. We developed a computer algorithm to map the 807 P/PI genes with exact genomic locations listed in the MEROPS database of peptidases onto these critical regions and to rank P/PI genes according to the accumulated evidence for their association with CD and UC. 82 P/PI genes (75 coding for proteases and 7 coding for protease inhibitors) were retained for CD based on the accumulated evidence. The cylindromatosis/turban tumor syndrome gene (CYLD) on chromosome 16 ranked highest, followed by acylaminoacyl-peptidase (APEH), dystroglycan (DAG1), macrophage-stimulating protein (MST1) and ubiquitin-specific peptidase 4 (USP4), all located on chromosome 3. For UC, 18 P/PI genes were retained (14 proteases and 4 protease inhibitors), with a considerably lower amount of accumulated evidence. The ranking of P/PI genes as established in this systematic review is currently used to guide validation studies of candidate P/PI genes, and their functional characterization in interdisciplinary mechanistic studies in vitro and in vivo as part of IBDase. The approach used here overcomes some of the problems encountered when subjectively selecting genes for further evaluation and could be applied to any complex disease and gene family.
Resumo:
Arabidopsis thaliana has emerged as a leading model species in plant genetics and functional genomics including research on the genetic causes of heterosis. We applied a triple testcross (TTC) design and a novel biometrical approach to identify and characterize quantitative trait loci (QTL) for heterosis of five biomass-related traits by (i) estimating the number, genomic positions, and genetic effects of heterotic QTL, (ii) characterizing their mode of gene action, and (iii) testing for presence of epistatic effects by a genomewide scan and marker x marker interactions. In total, 234 recombinant inbred lines (RILs) of Arabidopsis hybrid C24 x Col-0 were crossed to both parental lines and their F1 and analyzed with 110 single-nucleotide polymorphism (SNP) markers. QTL analyses were conducted using linear transformations Z1, Z2, and Z3 calculated from the adjusted entry means of TTC progenies. With Z1, we detected 12 QTL displaying augmented additive effects. With Z2, we mapped six QTL for augmented dominance effects. A one-dimensional genome scan with Z3 revealed two genomic regions with significantly negative dominance x additive epistatic effects. Two-way analyses of variance between marker pairs revealed nine digenic epistatic interactions: six reflecting dominance x dominance effects with variable sign and three reflecting additive x additive effects with positive sign. We conclude that heterosis for biomass-related traits in Arabidopsis has a polygenic basis with overdominance and/or epistasis being presumably the main types of gene action.
Resumo:
Heterosis is widely used in breeding, but the genetic basis of this biological phenomenon has not been elucidated. We postulate that additive and dominance genetic effects as well as two-locus interactions estimated in classical QTL analyses are not sufficient for quantifying the contributions of QTL to heterosis. A general theoretical framework for determining the contributions of different types of genetic effects to heterosis was developed. Additive x additive epistatic interactions of individual loci with the entire genetic background were identified as a major component of midparent heterosis. On the basis of these findings we defined a new type of heterotic effect denoted as augmented dominance effect di* that comprises the dominance effect at each QTL minus half the sum of additive x additive interactions with all other QTL. We demonstrate that genotypic expectations of QTL effects obtained from analyses with the design III using testcrosses of recombinant inbred lines and composite-interval mapping precisely equal genotypic expectations of midparent heterosis, thus identifying genomic regions relevant for expression of heterosis. The theory for QTL mapping of multiple traits is extended to the simultaneous mapping of newly defined genetic effects to improve the power of QTL detection and distinguish between dominance and overdominance.
Resumo:
Echinococcus multilocularis is characterised by a wide geographical distribution, encompassing three continents (North America, Asia and Europe) yet very low genetic variability is documented. Recently, this parasite has been detected in red foxes (Vulpes vulpes) circulating in an Alpine region of Italy, close to Austria. This finding raised the question as to whether an autochthonous cycle exists in Italy or whether the infected foxes originated from the neighbouring regions of Austria. Studies have shown that multi-locus microsatellite analysis can identify genomic regions carrying mutations that result in a local adaptation. We used a tandem repeated multi-locus microsatellite (EmsB) to evaluate the genetic differences amongst adult worms of E. multilocularis collected in Italy, worms from neighbouring Austria and from other European and extra-European countries. Fluorescent PCR was performed on a panel of E. multilocularis samples to assess intra-specific polymorphism. The analysis revealed four closed genotypes for Italian samples of E. multilocularis which were unique compared with the other 25 genotypes from Europe and the five genotypes from Alaska. An analysis in the Alpine watershed, comparing Italian adult worms with those from neighbouring areas in Austria, showed a unique cluster for Italian samples. This result supports the hypothesis of the presence of an autochthonous cycle of E. multilocularis in Italy. EmsB can be useful for 'tracking' the source of infection of this zoonotic parasite and developing appropriate measures for preventing or reducing the risk of human alveolar echinococcosis.
Resumo:
Intense selective pressures applied over short evolutionary time have resulted in homogeneity within, but substantial variation among, horse breeds. Utilizing this population structure, 744 individuals from 33 breeds, and a 54,000 SNP genotyping array, breed-specific targets of selection were identified using an F(ST)-based statistic calculated in 500-kb windows across the genome. A 5.5-Mb region of ECA18, in which the myostatin (MSTN) gene was centered, contained the highest signature of selection in both the Paint and Quarter Horse. Gene sequencing and histological analysis of gluteal muscle biopsies showed a promoter variant and intronic SNP of MSTN were each significantly associated with higher Type 2B and lower Type 1 muscle fiber proportions in the Quarter Horse, demonstrating a functional consequence of selection at this locus. Signatures of selection on ECA23 in all gaited breeds in the sample led to the identification of a shared, 186-kb haplotype including two doublesex related mab transcription factor genes (DMRT2 and 3). The recent identification of a DMRT3 mutation within this haplotype, which appears necessary for the ability to perform alternative gaits, provides further evidence for selection at this locus. Finally, putative loci for the determination of size were identified in the draft breeds and the Miniature horse on ECA11, as well as when signatures of selection surrounding candidate genes at other loci were examined. This work provides further evidence of the importance of MSTN in racing breeds, provides strong evidence for selection upon gait and size, and illustrates the potential for population-based techniques to find genomic regions driving important phenotypes in the modern horse.
Resumo:
OBJECTIVES This study sought to identify nonredundant atrial fibrillation (AF) genetic susceptibility signals and examine their cumulative relations with AF risk. BACKGROUND AF-associated loci span broad genomic regions that may contain multiple susceptibility signals. Whether multiple signals exist at AF loci has not been systematically explored. METHODS We performed association testing conditioned on the most significant, independently associated genetic markers at 9 established AF loci using 2 complementary techniques in 64,683 individuals of European ancestry (3,869 incident and 3,302 prevalent AF cases). Genetic risk scores were created and tested for association with AF in Europeans and an independent sample of 11,309 individuals of Japanese ancestry (7,916 prevalent AF cases). RESULTS We observed at least 4 distinct AF susceptibility signals on chromosome 4q25 upstream of PITX2, but not at the remaining 8 AF loci. A multilocus score comprised 12 genetic markers demonstrated an estimated 5-fold gradient in AF risk. We observed a similar spectrum of risk associated with these markers in Japanese. Regions containing AF signals on chromosome 4q25 displayed a greater degree of evolutionary conservation than the remainder of the locus, suggesting that they may tag regulatory elements. CONCLUSIONS The chromosome 4q25 AF locus is architecturally complex and harbors at least 4 AF susceptibility signals in individuals of European ancestry. Similar polygenic AF susceptibility exists between Europeans and Japanese. Future work is necessary to identify causal variants, determine mechanisms by which associated loci predispose to AF, and explore whether AF susceptibility signals classify individuals at risk for AF and related morbidity.
Genome-Wide Analyses Suggest Mechanisms Involving Early B-Cell Development in Canine IgA Deficiency.
Resumo:
Immunoglobulin A deficiency (IgAD) is the most common primary immune deficiency disorder in both humans and dogs, characterized by recurrent mucosal tract infections and a predisposition for allergic and other immune mediated diseases. In several dog breeds, low IgA levels have been observed at a high frequency and with a clinical resemblance to human IgAD. In this study, we used genome-wide association studies (GWAS) to identify genomic regions associated with low IgA levels in dogs as a comparative model for human IgAD. We used a novel percentile groups-approach to establish breed-specific cut-offs and to perform analyses in a close to continuous manner. GWAS performed in four breeds prone to low IgA levels (German shepherd, Golden retriever, Labrador retriever and Shar-Pei) identified 35 genomic loci suggestively associated (p <0.0005) to IgA levels. In German shepherd, three genomic regions (candidate genes include KIRREL3 and SERPINA9) were genome-wide significantly associated (p <0.0002) with IgA levels. A ~20kb long haplotype on CFA28, significantly associated (p = 0.0005) to IgA levels in Shar-Pei, was positioned within the first intron of the gene SLIT1. Both KIRREL3 and SLIT1 are highly expressed in the central nervous system and in bone marrow and are potentially important during B-cell development. SERPINA9 expression is restricted to B-cells and peaks at the time-point when B-cells proliferate into antibody-producing plasma cells. The suggestively associated regions were enriched for genes in Gene Ontology gene sets involving inflammation and early immune cell development.
Resumo:
Hybrid zones are regions where individuals from genetically differentiated populations meet and mate, resulting in at least some offspring of mixed ancestry. Patterns of gene flow (introgression) in hybrid zones vary across the genome, allowing assessment of the role of individual genes or genome regions in reproductive isolation. Here, we document patterns of introgression between two recently diverged species of field crickets. We sampled at a very fine spatial scale and genotyped crickets for 110 highly differentiated single nucleotide polymorphisms (SNPs) identified through transcriptome scans. Using both genomic and geographic cline analysis, we document remarkably abrupt transitions (<100 m) in allele frequencies for 50 loci, despite high levels of gene flow at other loci. These are among the steepest clines documented for any hybridizing taxa. Furthermore, the cricket hybrid zone provides one of the clearest examples of the semi-permeability of species boundaries. Comparisons between data from the fine-scale transect and data (for the same set of markers) from sampling a much larger area in a different region of the cricket hybrid zone reveal consistent patterns of introgression for individual loci. The consistency in patterns of introgression between these two distant and distinct regions of the hybrid zone suggests that strong selection is acting to maintain abrupt discontinuities within the hybrid zone and that genomic regions with restricted introgression likely include genes that contribute to nonecological prezygotic barriers.
Resumo:
Background: Speciation reversal: the erosion of species differentiation via an increase in introgressive hybridization due to the weakening of previously divergent selection regimes, is thought to be an important, yet poorly understood, driver of biodiversity loss. Our study system, the Alpine whitefish (Coregonus spp.) species complex is a classic example of a recent postglacial adaptive radiation: forming an array of endemic lake flocks, with the independent origination of similar ecotypes among flocks. However, many of the lakes of the Alpine radiation have been seriously impacted by anthropogenic nutrient enrichment, resulting in a collapse in neutral genetic and phenotypic differentiation within the most polluted lakes. Here we investigate the effects of eutrophication on the selective forces that have shaped this radiation, using population genomics. We studied eight sympatric species assemblages belonging to five independent parallel adaptive radiations, and one species pair in secondary contact. We used AFLP markers, and applied FST outlier (BAYESCAN, DFDIST) and logistic regression analyses (MATSAM), to identify candidate regions for disruptive selection in the genome and their associations with adaptive traits within each lake flock. The number of outlier and adaptive trait associated loci identified per lake were then regressed against two variables (historical phosphorus concentration and contemporary oxygen concentration) representing the strength of eutrophication. Results: Whilst we identify disruptive selection candidate regions in all lake flocks, we find similar trends, across analysis methods, towards fewer disruptive selection candidate regions and fewer adaptive trait/candidate loci associations in the more polluted lakes. Conclusions: Weakened disruptive selection and a concomitant breakdown in reproductive isolating mechanisms in more polluted lakes has lead to increased gene flow between coexisting Alpine whitefish species. We hypothesize that the resulting higher rates of interspecific recombination reduce either the number or extent of genomic islands of divergence surrounding loci evolving under disruptive natural selection. This produces the negative trend seen in the number of selection candidate loci recovered during genome scans of whitefish species flocks, with increasing levels of anthropogenic eutrophication: as the likelihood decreases that AFLP restriction sites will fall within regions of heightened genomic divergence and therefore be classified as FST outlier loci. This study explores for the first time the potential effects of human-mediated relaxation of disruptive selection on heterogeneous genomic divergence between coexisting species.
Resumo:
Herein we provide a detailed molecular analysis of the spatial heterogeneity of clinically localized, multifocal prostate cancer to delineate new oncogenes or tumor suppressors. We initially determined the copy number aberration (CNA) profiles of 74 patients with index tumors of Gleason score 7. Of these, 5 patients were subjected to whole-genome sequencing using DNA quantities achievable in diagnostic biopsies, with detailed spatial sampling of 23 distinct tumor regions to assess intraprostatic heterogeneity in focal genomics. Multifocal tumors are highly heterogeneous for single-nucleotide variants (SNVs), CNAs and genomic rearrangements. We identified and validated a new recurrent amplification of MYCL, which is associated with TP53 deletion and unique profiles of DNA damage and transcriptional dysregulation. Moreover, we demonstrate divergent tumor evolution in multifocal cancer and, in some cases, tumors of independent clonal origin. These data represent the first systematic relation of intraprostatic genomic heterogeneity to predicted clinical outcome and inform the development of novel biomarkers that reflect individual prognosis.