917 resultados para whole genome duplication
Resumo:
Background: In the spondyloarthropathies, the underlying molecular and cellular pathways driving disease are poorly understood. By undertaking a study in knee synovial biopsies from spondyloarthropathy (SpA) and ankylosing spondylitis (AS) patients we aimed to elucidate dysregulated genes and pathways. Methods RNA was extracted from six SpA, two AS, three osteoarthritis (OA) and four normal control knee synovial biopsies. Whole genome expression profiling was undertaken using the Illumina DASL system, which assays 24000 cDNA probes. Differentially expressed candidate genes were then validated using quantitative PCR and immunohistochemistry. Results: Four hundred and sixteen differentially expressed genes were identified that clearly delineated between AS/SpA and control groups. Pathway analysis showed altered gene-expression in oxidoreductase activity, B-cell associated, matrix catabolic, and metabolic pathways. Altered «myogene» profiling was also identified. The inflammatory mediator, MMP3, was strongly upregulated (5-fold) in AS/SpA samples and the Wnt pathway inhibitors DKK3 (2.7-fold) and Kremen1 (1.5-fold) were downregulated. Conclusions: Altered expression profiling in SpA and AS samples demonstrates that disease pathogenesis is associated with both systemic inflammation as well as local tissue alterations that may underlie tissue damaging modelling and remodelling outcomes. This supports the hypothesis that initial systemic inflammation in spondyloarthropathies transfers to and persists in the local joint environment, and might subsequently mediate changes in genes directly involved in the destructive tissue remodelling.
Resumo:
Objective. To identify genomic regions linked with determinants of age at symptom onset, disease activity, and functional impairment in ankylosing spondylitis (AS). Methods. A whole genome linkage scan was performed in 188 affected sibling pair families with 454 affected individuals. Traits assessed were age at symptom onset, disease activity assessed by the Bath Ankylosing Spondylitis Disease Activity Index (BASDAI), and functional impairment assessed by the Bath Ankylosing Spondylitis Functional Index (BASFI). Parametric and nonparametric quantitative linkage analysis was performed using parameters defined in a previous segregation study. Results. Heritabilities of the traits studied in this data set were as follows: BASDAI 0.49 (P = 0.0001, 95% confidence interval [95% CI] 0.23-0.75), BASFI 0.76 (P = 10-7, 95% CI 0.49-1.0), and age at symptom onset 0.33 (P = 0.005, 95% CI 0.04-0.62). No linkage was observed between the major histocompatibility complex (MHC) and any of the traits studied (logarithm of odds [LOD] score <1.0). "Significant" linkage (LOD score 4.0) was observed between a region on chromosome 18p and the BASDAI. Age at symptom onset showed "suggestive" linkage to chromosome 11p (LOD score 3.3). Maximum linkage with the BASFI was seen at chromosome 2q (LOD score 2.9). Conclusion. In contrast to the genetic determinants of susceptibility to AS, clinical manifestations of the disease measured by the BASDAI, BASFI, and age at symptom onset are largely determined by a small number of genes not encoded within the MHC.
Resumo:
Whole genome sequences are generally accepted as excellent tools for studying evolutionary relationships. Due to the problems caused by the uncertainty in alignment, existing tools for phylogenetic analysis based on multiple alignments could not be directly applied to the whole-genome comparison and phylogenomic studies. There has been a growing interest in alignment-free methods for phylogenetic analysis using complete genome data. The “distances” used in these alignment-free methods are not proper distance metrics in the strict mathematical sense. In this study, we first review them in a more general frame — dissimilarity. Then we propose some new dissimilarities for phylogenetic analysis. Last three genome datasets are employed to evaluate these dissimilarities from a biological point of view.
Resumo:
The successful completion of the Human Genome Project (HGP) was an unprecedented scientific advance that has become an invaluable resource in the search for genes that cause monogenic and common (polygenic) diseases. Prior to the HGP, linkage analysis had successfully mapped many disease genes for monogenic disorders; however, the limitations of this approach were particularly evident for identifying causative genes in rare genetic disorders affecting lifespan and/or reproductive fitness, such as skeletal dysplasias. In this review, we illustrate the challenges of mapping disease genes in such conditions through the ultra-rare disorder fibrodysplasia ossificans progressiva (FOP) and we discuss the advances that are being made through current massively parallel (“next generation”) sequencing (MPS) technologies.
Resumo:
Phenotypic convergence is thought to be driven by parallel substitutions coupled with natural selection at the sequence level. Multiple independent evolutionary transitions of mammals to an aquatic environment offer an opportunity to test this thesis. Here, whole genome alignment of coding sequences identified widespread parallel amino acid substitutions in marine mammals; however, the majority of these changes were not unique to these animals. Conversely, we report that candidate aquatic adaptation genes, identified by signatures of likelihood convergence and/or elevated ratio of nonsynonymous to synonymous nucleotide substitution rate, are characterized by very few parallel substitutions and exhibit distinct sequence changes in each group. Moreover, no significant positive correlation was found between likelihood convergence and positive selection in all three marine lineages. These results suggest that convergence in protein coding genes associated with aquatic lifestyle is mainly characterized by independent substitutions and relaxed negative selection.
Resumo:
Epigenetics plays a crucial role in schizophrenia susceptibility. In a previous study, we identified over 4500 differentially methylated sites in prefrontal cortex (PFC) samples from schizophrenia patients. We believe this was the first genome-wide methylation study performed on human brain tissue using the Illumina Infinium HumanMethylation450 Bead Chip. To understand the biological significance of these results, we sought to identify a smaller number of differentially methylated regions (DMRs) of more functional relevance compared with individual differentially methylated sites. Since our schizophrenia whole genome methylation study was performed, another study analysing two separate data sets of post-mortem tissue in the PFC from schizophrenia patients has been published. We analysed all three data sets using the bumphunter function found in the Bioconductor package minfi to identify regions that are consistently differentially methylated across distinct cohorts. We identified seven regions that are consistently differentially methylated in schizophrenia, despite considerable heterogeneity in the methylation profiles of patients with schizophrenia. The regions were near CERS3, DPPA5, PRDM9, DDX43, REC8, LY6G5C and a region on chromosome 10. Of particular interest is PRDM9 which encodes a histone methyltransferase that is essential for meiotic recombination and is known to tag genes for epigenetic transcriptional activation. These seven DMRs are likely to be key epigenetic factors in the aetiology of schizophrenia and normal brain neurodevelopment.
Resumo:
The sequential nature of gel-based marker systems entails low throughput and high costs per assay. Commonly used marker systems such as SSR and SNP are also dependent on sequence information. These limitations result in high cost per data point and significantly limit the capacity of breeding programs to obtain sufficient return on investment to justify the routine use of marker-assisted breeding for many traits and particularly quantitative traits. Diversity Arrays Technology (DArT™) is a cost effective hybridisation-based marker technology that offers a high multiplexing level while being independent of sequence information. This technology offers sorghum breeding programs an alternative approach to whole-genome profiling. We report on the development, application, mapping and utility of DArT™ markers for sorghum germplasm. Results: A genotyping array was developed representing approximately 12,000 genomic clones using PstI+BanII complexity with a subset of clones obtained through the suppression subtractive hybridisation (SSH) method. The genotyping array was used to analyse a diverse set of sorghum genotypes and screening a Recombinant Inbred Lines (RIL) mapping population. Over 500 markers detected variation among 90 accessions used in a diversity analysis. Cluster analysis discriminated well between all 90 genotypes. To confirm that the sorghum DArT markers behave in a Mendelian manner, we constructed a genetic linkage map for a cross between R931945-2-2 and IS 8525 integrating DArT and other marker types. In total, 596 markers could be placed on the integrated linkage map, which spanned 1431.6 cM. The genetic linkage map had an average marker density of 1/2.39 cM, with an average DArT marker density of 1/3.9 cM. Conclusion: We have successfully developed DArT markers for Sorghum bicolor and have demonstrated that DArT provides high quality markers that can be used for diversity analyses and to construct medium-density genetic linkage maps. The high number of DArT markers generated in a single assay not only provides a precise estimate of genetic relationships among genotypes, but also their even distribution over the genome offers real advantages for a range of molecular breeding and genomics applications.
Resumo:
Major effect genes are often used for germplasm identification, for diversity analyses and as selection targets in breeding. To date, only a few morphological characters have been mapped as major effect genes across a range of genetic linkage maps based on different types of molecular markers in sorghum (Sorghum bicolor (L.) Moench). This study aims to integrate all available previously mapped major effect genes onto a complete genome map, linked to the whole genome sequence, allowing sorghum breeders and researchers to link this information to QTL studies and to be aware of the consequences of selection for major genes. This provides new opportunities for breeders to take advantage of readily scorable morphological traits and to develop more effective breeding strategies. We also provide examples of the impact of selection for major effect genes on quantitative traits in sorghum. The concepts described in this paper have particular application to breeding programmes in developing countries where molecular markers are expensive or impossible to access.
Resumo:
Obesity increases the risk for several conditions, including type 2 diabetes mellitus, cardiovascular disease, hypertension, osteoarthirits and certain types of cancer. Twin- and family studies have shown that there is a major genetic component in the determination of body mass. In recent years several technological and scientific advance have been made in obesity research. For instance, novel replicated loci have been revealed by a number of genome wide association studies. This thesis aimed to investigate the association of genetic factors and obesity-related quantitative traits. The first study investigated the role of the lactase gene in anthropometric traits. We genetically defined lactose persistence by genotyping 31 720 individuals of European descent. We found that lactase persistence was significantly correlated with weight and body mass index but not with height. In the second study we performed the largest whole genome linkage scan for body mass index to date. The sample consisted of 4401 twin families and 10 535 individuals from six European countries. We found supporting evidence for two loci (3q29 and 7q36). We observed that the heritability estimate increased substantially when additional family members were removed from the analyses, which suggests reduced environmental variance in the twin sample. In the third study we assessed metabonomic, transcriptomic and genomic variation in a Finnish population cohort of 518 individuals. We formed gene expression networks to portray pathways and showed that a set of highly correlated genes of an inflammatory pathway associated with 80 serum metabolites (of 134 quantified measures). Strong association was found, for example, with several lipoprotein subclasses. We inferred causality by using genetic variation as anchors. The expression of the network genes was found to be dependent on the circulatory metabolite concentrations.
Resumo:
Blood cells participate in vital physiological processes, and their numbers are tightly regulated so that homeostasis is maintained. Disruption of key regulatory mechanisms underlies many blood-related Mendelian diseases but also contributes to more common disorders, including atherosclerosis. We searched for quantitative trait loci (QTL) for hematology traits through a whole-genome association study, because these could provide new insights into both hemopoeitic and disease mechanisms. We tested 1.8 million variants for association with 13 hematology traits measured in 6015 individuals from the Australian and Dutch populations. These traits included hemoglobin composition, platelet counts, and red blood cell and white blood cell indices. We identified three regions of strong association that, to our knowledge, have not been previously reported in the literature. The first was located in an intergenic region of chromosome 9q31 near LPAR1, explaining 1.5% of the variation in monocyte counts (best SNP rs7023923, p=8.9x10(-14)). The second locus was located on chromosome 6p21 and associated with mean cell erythrocyte volume (rs12661667, p=1.2x10(-9), 0.7% variance explained) in a region that spanned five genes, including CCND3, a member of the D-cyclin gene family that is involved in hematopoietic stem cell expansion. The third region was also associated with erythrocyte volume and was located in an intergenic region on chromosome 6q24 (rs592423, p=5.3x10(-9), 0.6% variance explained). All three loci replicated in an independent panel of 1543 individuals (p values=0.001, 9.9x10(-5), and 7x10(-5), respectively). The identification of these QTL provides new opportunities for furthering our understanding of the mechanisms regulating hemopoietic cell fate.
Studies of the genetic epidemiology of cardiovascular disease: focus on inflammatory candidate genes
Resumo:
Cardiovascular disease (CVD) is a complex disease with multifactorial aetiology. Both genetic and environmental factors contribute to the disease risk. The lifetime risk for CVD differs markedly between men and women, men being at increased risk. Inflammatory reaction contributes to the development of the disease by promoting atherosclerosis in artery walls. In the first part of this thesis, we identified several inflammatory related CVD risk factors associating with the amount of DNA from whole blood samples, indicating a potential source of bias if a genetic study selects the participants based on the available amount of DNA. In the following studies, this observation was taken into account by applying whole genome amplification to samples otherwise subjected to exclusion due to very low DNA yield. We continued by investigating the contribution of inflammatory genes to the risk for CVD separately in men and women, and looked for sex-genotype interaction. In the second part, we explored a new candidate gene and its role in the risk for CVD. Selenoprotein S (SEPS1) is a membrane protein residing in the endoplasmic reticulum where it participates in retro-translocation of unfolded proteins to cytosolic protein degradation. Previous studies have indicated that SEPS1 protects cells from oxidative stress and that variations in the gene are associated with circulating levels of inflammatory cytokines. In our study, we identified two variants in the SEPS1 gene, which associated with coronary heart disease and ischemic stroke in women. This is, to our knowledge, the first study suggesting a role of SEPS1 in the risk for CVD after extensively examining the variation within the gene region. In the third part of this thesis, we focused on a set of seven genes (angiotensin converting enzyme, angiotensin II receptor type I, C-reactive protein (CRP), and fibrinogen alpha-, beta-, and gamma-chains (FGA, FGB, FGG)) related to inflammatory cytokine interleukin 6 (IL6) and their association with the risk for CVD. We identified one variant in the IL6 gene conferring risk for CVD in men and a variant pair from IL6 and FGA genes associated with decreased risk. Moreover, we identified and confirmed an association between a rare variant in the CRP gene and lower CRP levels, and found two variants in the FGA and FGG genes associating with fibrinogen. The results from this third study suggest a role for the interleukin 6 pathway genes in the pathogenesis of CVD and warrant further studies in other populations. In addition to the IL6 -related genes, we describe in this thesis several sex-specific associations in other genes included in this study. The majority of the findings were evident only in women encouraging other studies of cardiovascular disease to include and analyse women separately from men.
Resumo:
Cardiovascular diseases (CVD) are major contributors to morbidity and mortality worldwide. Several interacting environmental, biochemical, and genetic risk factors can increase disease susceptibility. While some of the genes involved in the etiology of CVD are known, many are yet to be discovered. During the last few decades, scientists have searched for these genes with genome-wide linkage and association methods, and with more targeted candidate gene studies. This thesis investigates variation within the upstream transcription factor 1 (USF1) gene locus in relation to CVD risk factors, atherosclerosis, and incidence and prevalence of CVD. This candidate gene was first identified in Finnish families ascertained for familial combined hyperlipidemia, a common dyslipidemia predisposing to coronary heart disease. The gene is a ubiquitously expressed transcription factor regulating expression of several genes from lipid and glucose metabolism, inflammation, and endothelial function. First, we examined association between USF1 variants and several CVD risk factors, such as lipid phenotypes, body composition measures, and metabolic syndrome, in two prospective population cohorts. Our data suggested that USF1 contributes to these CVD risk factors at the population level. Notably, the associations with quantitative measurements were mostly detected among study subjects with CVD or metabolic syndrome, suggesting complex interactions between USF1 effects and the pathophysiological state of an individual. Second, we investigated how variation at the USF1 locus contributes to atherosclerotic lesions of the coronary arteries and abdominal aorta. For this, we used two study samples of middle-aged men with detailed measurements of atherosclerosis obtained in autopsy. USF1 variation significantly associated with areas of several types of lesions, especially with calcification of the arteries. Next, we tested what effect the USF1 risk variants have on sudden cardiac death and incidence of CVD. The atherosclerosis-associated risk variant increased the risk of sudden cardiac death of the same study subjects. Furthermore, USF1 alleles associated with incidence of CVD in the Finnish population follow-up cohorts. These associations were especially prominent among women, suggesting a sex specific effect, which has also been detected in subsequent studies. Finally, as some of the low-yield DNA samples of the Finnish follow-up study cohort needed to be whole-genome amplified (WGA) prior to genotyping, we evaluated whether the produced WGA genotypes were of good quality. Although the samples giving genotype discrepancies could not be detected before genotyping with standard laboratory quality control methods, our results suggested that enhanced quality control at the time of the genotyping could identify such samples. In addition, combining two WGA reactions into one pooled DNA sample for genotyping markedly reduced the number of discrepancies and samples showing them. In conclusion, USF1 seems to have a role in the etiology of CVD. Additional studies are warranted to identify functional variants and to study interactions between USF1 and other genetic or environmental factors. This USF1 study, and other studies with low DNA yield of some samples, can benefit from whole genome amplification of the low-yield samples prior to genotyping. Careful quality control procedures are, however, needed in WGA genotyping.
Resumo:
Knowing the chromosomal areas or actual genes affecting the traits under selection would add more information to be used in the selection decisions which would potentially lead to higher genetic response. The first objective of this study was to map quantitative trait loci (QTL) affecting economically important traits in the Finnish Ayrshire population. The second objective was to investigate the effects of using QTL information in marker-assisted selection (MAS) on the genetic response and the linkage disequilibrium between the different parts of the genome. Whole genome scans were carried out on a grand-daughter design with 12 half-sib families and a total of 493 sons. Twelve different traits were studied: milk yield, protein yield, protein content, fat yield, fat content, somatic cell score (SCS), mastitis treatments, other veterinary treatments, days open, fertility treatments, non-return rate, and calf mortality. The average spacing of the typed markers was 20 cM with 2 to 14 markers per chromosome. Associations between markers and traits were analyzed with multiple marker regression. Significance was determined by permutation and genome-wise P-values obtained by Bonferroni correction. The benefits from MAS were investigated by simulation: a conventional progeny testing scheme was compared to a scheme where QTL information was used within families to select among full-sibs in the male path. Two QTL on different chromosomes were modelled. The effects of different starting frequencies of the favourable alleles and different size of the QTL effects were evaluated. A large number of QTL, 48 in total, were detected at 5% or higher chromosome-wise significance. QTL for milk production were found on 8 chromosomes, for SCS on 6, for mastitis treatments on 1, for other veterinary treatments on 5, for days open on 7, for fertility treatments on 7, for calf mortality on 6, and for non-return rate on 2 chromosomes. In the simulation study the total genetic response was faster with MAS than with conventional selection and the advantage of MAS persisted over the studied generations. The rate of response and the difference between the selection schemes reflected clearly the changes in allele frequencies of the favourable QTL. The disequilibrium between the polygenes and QTL was always negative and it was larger with larger QTL size. The disequilibrium between the two QTL was larger with QTL of large effect and it was somewhat larger with MAS for scenarios with starting frequencies below 0.5 for QTL of moderate size and below 0.3 for large QTL. In conclusion, several QTL affecting economically important traits of dairy cattle were detected. Further studies are needed to verify these QTL, check their presence in the present breeding population, look for pleiotropy and fine map the most interesting QTL regions. The results of the simulation studies show that using MAS together with embryo transfer to pre-select young bulls within families is a useful approach to increase the genetic merit of the AI-bulls compared to conventional selection.
Resumo:
Japanese isolates of Candidatus Liberibacter asiaticus have been shown to be clearly differentiated by simple sequence repeat (SSR) profiles at four loci. In this study, 25 SSR loci, including these four loci, were selected from the whole-genome sequence and were used to differentiate non-Japanese samples of Ca. Liberibacter asiaticus (13 Indian, 3 East Timorese, 1 Papuan and 8 Floridian samples). Out of the 25 SSR loci, 13 were polymorphic. Dendrogram analysis using SSR loci showed that the clusters were mostly consistent with the geographical origins of the isolates. When single nucleotide polymorphisms (SNPs) were searched around these 25 loci, only the upstream region of locus 091 exhibited polymorphism. Phylogenetic tree analysis of the SNPs in the upstream region of locus 091 showed that Floridian samples were clustered into one group as shown by dendrogram analysis using SSR loci. The differences in nucleotide sequences were not associated with differences in the citrus hosts (lime, mandarin, lemon and sour orange) from which the isolates were originally derived.
Resumo:
Key message The potential for exploiting heterosis for sorghum hybrid production in Ethiopia with improved local adaptation and farmers preferences has been investigated and populations suitable for initial hybrid development have been identified. Abstract Hybrids in sorghum have demonstrated increased productivity and stability of performance in the developed world. In Ethiopia, the uptake of hybrid sorghum has been limited to date, primarily due to poor adaptation and absence of farmer’s preferred traits in existing hybrids. This study aimed to identify complementary parental pools to develop locally adapted hybrids, through an analysis of whole genome variability of 184 locally adapted genotypes and introduced hybrid parents (R and B). Genetic variability was assessed using genetic distance, model-based STRUCTURE analysis and pair-wise comparison of groups. We observed a high degree of genetic similarity between the Ethiopian improved inbred genotypes and a subset of landraces adapted to lowland agro-ecology with the introduced R lines. This coupled with the genetic differentiation from existing B lines, indicated that these locally adapted genotype groups are expected to have similar patterns of heterotic expression as observed between introduced R and B line pools. Additionally, the hybrids derived from these locally adapted genotypes will have the benefit of containing farmers preferred traits. The groups most divergent from introduced B lines were the Ethiopian landraces adapted to highland and intermediate agro-ecologies and a subset of lowland-adapted genotypes, indicating the potential for increased heterotic response of their hybrids. However, these groups were also differentiated from the R lines, and hence are different from the existing complementary heterotic pools. This suggests that although these groups could provide highly divergent parental pools, further research is required to investigate the extent of heterosis and their hybrid performance.