957 resultados para human genome variation
Resumo:
We review the study of flower color polymorphisms in the morning glory as a model for the analysis of adaptation. The pathway involved in the determination of flower color phenotype is traced from the molecular and genetic levels to the phenotypic level. Many of the genes that determine the enzymatic components of flavonoid biosynthesis are redundant, but, despite this complexity, it is possible to associate discrete floral phenotypes with individual genes. An important finding is that almost all of the mutations that determine phenotypic differences are the result of transposon insertions. Thus, the flower color diversity seized on by early human domesticators of this plant is a consequence of the rich variety of mobile elements that reside in the morning glory genome. We then consider a long history of research aimed at uncovering the ecological fate of these various flower phenotypes in the southeastern U.S. A large body of work has shown that insect pollinators discriminate against white phenotypes when white flowers are rare in populations. Because the plant is self-compatible, pollinator bias causes an increase in self-fertilization in white maternal plants, which should lead to an increase in the frequency of white genes, according to modifier gene theory. Studies of geographical distributions indicate other, as yet undiscovered, disadvantages associated with the white phenotype. The ultimate goal of connecting ecology to molecular genetics through the medium of phenotype is yet to be attained, but this approach may represent a model for analyzing the translation between these two levels of biological organization.
Resumo:
Current evidence on the long-term evolutionary effect of insertion of sequence elements into gene regions is reviewed, restricted to cases where a sequence derived from a past insertion participates in the regulation of expression of a useful gene. Ten such examples in eukaryotes demonstrate that segments of repetitive DNA or mobile elements have been inserted in the past in gene regions, have been preserved, sometimes modified by selection, and now affect control of transcription of the adjacent gene. Included are only examples in which transcription control was modified by the insert. Several cases in which merely transcription initiation occurred in the insert were set aside. Two of the examples involved the long terminal repeats of mammalian endogenous retroviruses. Another two examples were control of transcription by repeated sequence inserts in sea urchin genomes. There are now six published examples in which Alu sequences were inserted long ago into human gene regions, were modified, and now are central in control/enhancement of transcription. The number of published examples of Alu sequences affecting gene control has grown threefold in the last year and is likely to continue growing. Taken together, all of these examples show that the insertion of sequence elements in the genome has been a significant source of regulatory variation in evolution.
Resumo:
Human gene MAGE-1 encodes tumor-specific antigens that are recognized on melanoma cells by autologous cytolytic T lymphocytes. This gene is expressed in a significant proportion of tumors of various histological types, but not in normal tissues except male germ-line cells. We reported previously that reporter genes driven by the MAGE-1 promoter are active not only in the tumor cell lines that express MAGE-1 but also in those that do not. This suggests that the critical factor causing the activation of MAGE-1 in certain tumors is not the presence of the appropriate transcription factors. The two major MAGE-1 promoter elements have an Ets binding site, which contains a CpG dinucleotide. We report here that these CpG are demethylated in the tumor cell lines that express MAGE-1, and are methylated in those that do not express the gene. Methylation of these CpG inhibits the binding of transcription factors, as seen by mobility shift assay. Treatment with the demethylating agent 5-aza-2'-deoxycytidine activated gene MAGE-1 not only in tumor cell lines but also in primary fibroblasts. Finally, the overall level of CpG methylation was evaluated in 20 different tumor cell lines. It was inversely correlated with the expression of MAGE-1. We conclude that the activation of MAGE-1 in cancer cells is due to the demethylation of the promoter. This appears to be a consequence of a genome-wide demethylation process that occurs in many cancers and is correlated with tumor progression.
Resumo:
We have characterized a family of repetitive DNA elements with homology to the MgPa cellular adhesion operon of Mycoplasma genitalium, a bacterium that has the smallest known genome of any free-living organism. One element, 2272 bp in length and flanked by DNA with no homology to MgPa, was completely sequenced. At least four others were partially sequenced. The complete element is a composite of six regions. Five of these regions show sequence similarity with nonadjacent segments of genes of the MgPa operon. The sixth region, located near the center of the element, is an A+T-rich sequence that has only been found in this repeat family. Open reading frames are present within the five individual regions showing sequence homology to MgPa and the adjacent open reading frame 3 (ORF3) gene. However, termination codons are found between adjacent regions of homology to the MgPa operon and in the A+T-rich sequence. Thus, these repetitive elements do not appear to be directly expressible protein coding sequences. The sequence of one region from five different repetitive elements was compared with the homologous region of the MgPa gene from the type strain G37 and four newly isolated M. genitalium strains. Recombination between repetitive elements of strain G37 and the MgPa operon can explain the majority of polymorphisms within our partial sequences of the MgPa genes of the new isolates. Therefore, we propose that the repetitive elements of M. genitalium provide a reservoir of sequence that contributes to antigenic variation in proteins of the MgPa cellular adhesion operon.
Resumo:
Mode of access: Internet.
Resumo:
We have rated eye color on a 3-point scale (1=blue/grey, 2=hazel/green, 3=brown) in 502 twin families and carried out a 5-10 cM genome scan (400-757 markers). We analyzed eye color as a threshold trait and performed multipoint sib pair linkage analysis using variance components analysis in Mx. A lod of 19.2 was found at the marker D15S1002, less than 1 cM from OCA2, which has been previously implicated in eye color variation. We estimate that 74% of variance in eye color liability is due to this QTL and a further 18% due to polygenic effects. However, a large shoulder on this peak suggests that other loci affecting eye color may be telomeric of OCA2 and inflating the QTL estimate. No other peaks reached genome-wide significance, although lods >2 were seen on 5p and 14q and lods >1 were additionally seen on chromosomes 2, 3, 6, 7, 8, 9, 17 and 18. Most of these secondary peaks were reduced or eliminated when we repeated the scan as a two locus analysis with the 15q linkage included, although this does not necessarily exclude them as false positives. We also estimated the interaction between the 15q QTL and the other marker locus but there was only minor evidence for additive x additive epistasis. Elaborating the analysis to the full two-locus model including non-additive main effects and interactions did not strengthen the evidence for epistasis. We conclude that most variation in eye color in Europeans is due to polymorphism in OCA2 but that there may be modifiers at several other loci.
Resumo:
Chlamydia pneumoniae is an obligate intracellular respiratory pathogen that causes 10% of community-acquired pneumonia and has been associated with cardiovascular disease. Both whole-genome sequencing and specific gene typing suggest that there is relatively little genetic variation in human isolates of C. pneumoniae. To date, there has been little genomic analysis of strains from human cardiovascular sites. The genotypes of C. pneumoniae present in human atherosclerotic carotid plaque were analysed and several polymorphisms in the variable domain 4 (VD4) region of the outer-membrane protein-A (ompA) gene and the intergenic region between the ygeD and uridine kinase (ygeD-urk) genes were found. While one genotype was identified that was the same as one reported previously in humans (respiratory and cardiovascular), another genotype was found that was identical to a genotype from non-human sources (frog/koala).
Resumo:
Full-length genome sequences of five virulent and five avirulent strains of Newcastle disease virus isolated between 1998 and 2002 in Victoria and New South Wales, Australia were determined. Comparisons between these strains revealed that coding sequence variability in the haemagglutinin-neuraminidase (HN), matrix (M) and phosphoprotein (P) gene sequences appeared to be more variable than in the fusion (F), nucleocapsid (N) and RNA dependent-RNA replicase (L) genes. Sequence analysis of a number of other isolates made during the recent virulent NDV outbreaks, also identified the presence of a number of variants with altered F gene cleavage sites, which resulted in altered biological properties of those viruses. Quasispecies analysis of a number of field isolates indicated the presence of virulent virus in one particular isolate. Gene sequence analysis of the progenitor virus isolated in 1998 showed very little sequence variation when compared to that of a progenitor-like virus isolated in 2001 demonstrating that in the field. viral genome sequence variation appears to be biologically restricted to that of a consensus sequence. (c) 2005 Elsevier B.V. All rights reserved.
Resumo:
Our sleep timing preference, or chronotype, is a manifestation of our internal biological clock. Variation in chronotype has been linked to sleep disorders, cognitive and physical performance, and chronic disease. Here we perform a genome-wide association study of self-reported chronotype within the UK Biobank cohort (n=100,420). We identify 12 new genetic loci that implicate known components of the circadian clock machinery and point to previously unstudied genetic variants and candidate genes that might modulate core circadian rhythms or light-sensing pathways. Pathway analyses highlight central nervous and ocular systems and fear-response-related processes. Genetic correlation analysis suggests chronotype shares underlying genetic pathways with schizophrenia, educational attainment and possibly BMI. Further, Mendelian randomization suggests that evening chronotype relates to higher educational attainment. These results not only expand our knowledge of the circadian system in humans but also expose the influence of circadian characteristics over human health and life-history variables such as educational attainment.
Resumo:
Acknowledgements This study was funded by a BBSRC studentship (MA Wenzel) and NERC grants NE/H00775X/1 and NE/D000602/1 (SB Piertney). The authors are grateful to Fiona Leckie, Andrew MacColl, Jesús Martínez-Padilla, François Mougeot, Steve Redpath, Pablo Vergara† and Lucy M.I. Webster for samples; Keliya Bai, Daisy Brickhill, Edward Graham, Alyson Little, Daniel Mifsud, Lizzie Molyneux and Mario Röder for fieldwork assistance; Gillian Murray-Dickson and Laura Watt for laboratory assistance; Heather Ritchie for helpful comments on manuscript drafts; and all estate owners, factors and keepers for access to field sites, most particularly Stuart Young and Derek Calder (Edinglassie), Simon Blackett, Jim Davidson and Liam Donald (Invercauld and Glas Choille), Richard Cooke and Fred Taylor† (Invermark) and T. Helps (Catterick).
Resumo:
A large proportion of the variation in traits between individuals can be attributed to variation in the nucleotide sequence of the genome. The most commonly studied traits in human genetics are related to disease and disease susceptibility. Although scientists have identified genetic causes for over 4,000 monogenic diseases, the underlying mechanisms of many highly prevalent multifactorial inheritance disorders such as diabetes, obesity, and cardiovascular disease remain largely unknown. Identifying genetic mechanisms for complex traits has been challenging because most of the variants are located outside of protein-coding regions, and determining the effects of such non-coding variants remains difficult. In this dissertation, I evaluate the hypothesis that such non-coding variants contribute to human traits and diseases by altering the regulation of genes rather than the sequence of those genes. I will specifically focus on studies to determine the functional impacts of genetic variation associated with two related complex traits: gestational hyperglycemia and fetal adiposity. At the genomic locus associated with maternal hyperglycemia, we found that genetic variation in regulatory elements altered the expression of the HKDC1 gene. Furthermore, we demonstrated that HKDC1 phosphorylates glucose in vitro and in vivo, thus demonstrating that HKDC1 is a fifth human hexokinase gene. At the fetal-adiposity associated locus, we identified variants that likely alter VEPH1 expression in preadipocytes during differentiation. To make such studies of regulatory variation high-throughput and routine, we developed POP-STARR, a novel high throughput reporter assay that can empirically measure the effects of regulatory variants directly from patient DNA. By combining targeted genome capture technologies with STARR-seq, we assayed thousands of haplotypes from 760 individuals in a single experiment. We subsequently used POP-STARR to identify three key features of regulatory variants: that regulatory variants typically have weak effects on gene expression; that the effects of regulatory variants are often coordinated with respect to disease-risk, suggesting a general mechanism by which the weak effects can together have phenotypic impact; and that nucleotide transversions have larger impacts on enhancer activity than transitions. Together, the findings presented here demonstrate successful strategies for determining the regulatory mechanisms underlying genetic associations with human traits and diseases, and value of doing so for driving novel biological discovery.
Resumo:
This thesis describes two newly sequenced B. longum subsp. longum genomes and subsequent comparative analysis with publicly available B. longum subsp. longum, B. longum subsp. infantis and B. longum subsp. suis genomes (Chapter 2). The acquired data revealed a closed pan-genome for this bifidobacterial species and furthermore facilitated the definition of the B. longum core genome. The comparative analysis also highlights differences in the potential metabolic abilities of all three sub-species. Interestingly, phylogenetic analysis of the B. longum core genome indicated the existence of a novel B. longum subspecies. Characterisation of restriction-modification systems from two B. longum subsp. longum strains is described in Chapter 3. These defence mechanisms limit the uptake of genetic material, which was successfully demonstrated for some of the identified systems. When these systems were by-passed by methylation of DNA prior to the transformation procedure, the resulting transformation efficiency of both B. longum subsp. longum strains was increased to a level that allowed for the generation of mutants via homologous recombination. Arabinoxylan metabolism by B. longum subsp. longum NCIMB 8809 was investigated in Chapter 4 of this thesis. Transcriptome analysis allowed the identification of a number of genes involved in the degradation, uptake and utilisation of arabinoxylan. Biochemical analysis revealed that three of the identified genes encode arabinofuranosidase activity. Phenotypic assessment of a number of insertion mutants in genes identified by the transcriptome analysis revealed the essential role of two of these enzymes in arabinoxylan metabolism, and a third enzyme in the metabolism of debranched arabinan. Furthermore, this investigation revealed that B. longum subsp. longum NCIMB 8809 does not completely degrade arabinoxylan, but utilises the arabinose substitutions only, while leaving the xylan backbone untouched.Finally, Chapter 5 outlines that B. longum subsp. longum NCIMB 8809 is capable of removing ferulic and p-coumaric acid substitutions that originate from arabinoxylan. Analysis of the genome sequence led to the identification of a candidate gene for this activity, which was subsequently cloned and expressed in E. coli. Biochemical analysis revealed that the enzyme, designated here as FaeA, is indeed capable of releasing both ferulic and p-coumaric acid from arabinoxylan. Furthermore, it is shown that a derivative of B. longum subsp. longum NCIMB 8809 carrying an insertion mutation in faeA had lost the ability to release ferulic and p-coumaric acid from arabinoxylan, and that growth of this mutant strain is negatively affected when cultivated on growth-limiting levels of arabinoxylan.
Resumo:
The use of whole-genome phylogenetic analysis has revolutionized our understanding of the evolution and spread of many important bacterial pathogens due to the high resolution view it provides. However, the majority of such analyses do not consider the potential role of accessory genes when inferring evolutionary trajectories. Moreover, the recently discovered importance of the switching of gene regulatory elements suggests that an exhaustive analysis, combining information from core and accessory genes with regulatory elements could provide unparalleled detail of the evolution of a bacterial population. Here we demonstrate this principle by applying it to a worldwide multi-host sample of the important pathogenic E. coli lineage ST131. Our approach reveals the existence of multiple circulating subtypes of the major drug–resistant clade of ST131 and provides the first ever population level evidence of core genome substitutions in gene regulatory regions associated with the acquisition and maintenance of different accessory genome elements.
Resumo:
1. Genomewide association studies (GWAS) enable detailed dissections of the genetic basis for organisms' ability to adapt to a changing environment. In long-term studies of natural populations, individuals are often marked at one point in their life and then repeatedly recaptured. It is therefore essential that a method for GWAS includes the process of repeated sampling. In a GWAS, the effects of thousands of single-nucleotide polymorphisms (SNPs) need to be fitted and any model development is constrained by the computational requirements. A method is therefore required that can fit a highly hierarchical model and at the same time is computationally fast enough to be useful. 2. Our method fits fixed SNP effects in a linear mixed model that can include both random polygenic effects and permanent environmental effects. In this way, the model can correct for population structure and model repeated measures. The covariance structure of the linear mixed model is first estimated and subsequently used in a generalized least squares setting to fit the SNP effects. The method was evaluated in a simulation study based on observed genotypes from a long-term study of collared flycatchers in Sweden. 3. The method we present here was successful in estimating permanent environmental effects from simulated repeated measures data. Additionally, we found that especially for variable phenotypes having large variation between years, the repeated measurements model has a substantial increase in power compared to a model using average phenotypes as a response. 4. The method is available in the R package RepeatABEL. It increases the power in GWAS having repeated measures, especially for long-term studies of natural populations, and the R implementation is expected to facilitate modelling of longitudinal data for studies of both animal and human populations.