262 resultados para human genome variation
Resumo:
Pneumocystis jirovecii is a fungal parasite that colonizes specifically humans and turns into an opportunistic pathogen in immunodeficient individuals. The fungus is able to reproduce extracellularly in host lungs without eliciting massive cellular death. The molecular mechanisms that govern this process are poorly understood, in part because of the lack of an in vitro culture system for Pneumocystis spp. In this study, we explored the origin and evolution of the putative biotrophy of P. jirovecii through comparative genomics and reconstruction of ancestral gene repertoires. We used the maximum parsimony method and genomes of related fungi of the Taphrinomycotina subphylum. Our results suggest that the last common ancestor of Pneumocystis spp. lost 2,324 genes in relation to the acquisition of obligate biotrophy. These losses may result from neutral drift and affect the biosyntheses of amino acids and thiamine, the assimilation of inorganic nitrogen and sulfur, and the catabolism of purines. In addition, P. jirovecii shows a reduced panel of lytic proteases and has lost the RNA interference machinery, which might contribute to its genome plasticity. Together with other characteristics, that is, a sex life cycle within the host, the absence of massive destruction of host cells, difficult culturing, and the lack of virulence factors, these gene losses constitute a unique combination of characteristics which are hallmarks of both obligate biotrophs and animal parasites. These findings suggest that Pneumocystis spp. should be considered as the first described obligate biotrophs of animals, whose evolution has been marked by gene losses.
Resumo:
To identify common variants influencing body mass index (BMI), we analyzed genome-wide association data from 16,876 individuals of European descent. After previously reported variants in FTO, the strongest association signal (rs17782313, P = 2.9 x 10(-6)) mapped 188 kb downstream of MC4R (melanocortin-4 receptor), mutations of which are the leading cause of monogenic severe childhood-onset obesity. We confirmed the BMI association in 60,352 adults (per-allele effect = 0.05 Z-score units; P = 2.8 x 10(-15)) and 5,988 children aged 7-11 (0.13 Z-score units; P = 1.5 x 10(-8)). In case-control analyses (n = 10,583), the odds for severe childhood obesity reached 1.30 (P = 8.0 x 10(-11)). Furthermore, we observed overtransmission of the risk allele to obese offspring in 660 families (P (pedigree disequilibrium test average; PDT-avg) = 2.4 x 10(-4)). The SNP location and patterns of phenotypic associations are consistent with effects mediated through altered MC4R function. Our findings establish that common variants near MC4R influence fat mass, weight and obesity risk at the population level and reinforce the need for large-scale data integration to identify variants influencing continuous biomedical traits.
Resumo:
It is important to characterise the amount of variation on the mammalian Y chromosome in order to assess its potential for use in evolutionary studies. We report very low levels of polymorphism on the Y chromosome of Saudi-Arabian hamadryas baboons, Papio hamadryas hamadryas. We found no segregating sites on the Y, despite sequence analysis of 3 kb noncontiguous intron sequence in 16 males with divergent autosomal microsatellite genotypes, and a further analysis of 1.1 kb intron sequence in 97 males from four populations by SSCP. In addition, we tested seven human-derived Y-linked microsatellites in baboons. Only four of these loci were male-specific and only one was polymorphic in our 97 male sample set. Polymorphism on the Y chromosome of Arabian hamadryas appears to be low compared to other primate species for which data are available (eg humans, chimpanzees and bonobos). Low effective population size (Ne) of paternal genes due to polygyny and female-biased adult sex ratio is a potential reason for low Y chromosome variation in this species. However, low Ne for the Y should be counterbalanced to some extent by the species' atypical pattern of male philopatry and female-biased dispersal. Allelic richness averaged over seven loci was not significantly different between an African and an Arabian population, suggesting that loss of variation during the colonisation of Arabia does not explain low Y variation. Finally, in the absence of nucleotide polymorphism, it is unclear to what extent selection could be responsible for low Y variation in this species.
Resumo:
Narcolepsy is a rare sleep disorder with the strongest human leukocyte antigen (HLA) association ever reported. Since the associated HLA-DRB1*1501-DQB1*0602 haplotype is common in the general population (15-25%), it has been suggested that it is almost necessary but not sufficient for developing narcolepsy. To further define the genetic basis of narcolepsy risk, we performed a genome-wide association study (GWAS) in 562 European individuals with narcolepsy (cases) and 702 ethnically matched controls, with independent replication in 370 cases and 495 controls, all heterozygous for DRB1*1501-DQB1*0602. We found association with a protective variant near HLA-DQA2 (rs2858884; P < 3 x 10(-8)). Further analysis revealed that rs2858884 is strongly linked to DRB1*03-DQB1*02 (P < 4 x 10(-43)) and DRB1*1301-DQB1*0603 (P < 3 x 10(-7)). Cases almost never carried a trans DRB1*1301-DQB1*0603 haplotype (odds ratio = 0.02; P < 6 x 10(-14)). This unexpected protective HLA haplotype suggests a virtually causal involvement of the HLA region in narcolepsy susceptibility.
Resumo:
The problem of how cooperation can evolve between individuals or entities with conflicting interests is central to biology as many of the major evolutionary transitions, from the first replicating molecules to human societies, have required solving this problem. There are many routes to cooperation but humans seem to be distinct from other species as they have more complex and diverse mechanisms, often due to their higher cognitive skills, allowing them to reap the benefits from living in groups. Among those mechanisms, the use of reputation or past experience with others as well as sanctioning mechanisms both seem to be of major importance. They have often been considered separately but the interaction between the two might provide new insights as to how punishment could have appeared as a means to enforce cooperation in early humans. In this thesis, I firstly use theoretical approaches from evolutionary game theory to investigate the evolution of punishment and cooperation through a reputation system based on punitive actions, and compare the efficacy of this system, in terms of cooperation achieved, with one based on cooperative actions. On the other hand, I use empirical approaches from economics to test, in real life, predictions from theoretical models but also to explore further conditions such as environmental variation, constrained memory, or even the scale of competition between individuals. Both approaches have allowed contributing to the understanding of how these factors affect reputation and punishment use, and ultimately how cooperation is achieved.
Resumo:
In vertebrates, the RAD51 protein is required for genetic recombination, DNA repair, and cellular proliferation. Five paralogs of RAD51, known as RAD51B, RAD51C, RAD51D, XRCC2, and XRCC3, have been identified and also shown to be required for recombination and genome stability. At the present time, however, very little is known about their biochemical properties or precise biological functions. As a first step toward understanding the roles of the RAD51 paralogs in recombination, the human RAD51C and XRCC3 proteins were overexpressed and purified from baculovirus-infected insect cells. The two proteins copurify as a complex, a property that reflects their endogenous association observed in HeLa cells. Purified RAD51C--XRCC3 complex binds single-stranded, but not duplex DNA, to form protein--DNA networks that have been visualized by electron microscopy.
Resumo:
Regulation of viral genome expression is the result of complex cooperation between viral proteins and host cell factors. We report here the characterization of a novel cellular factor sharing homology with the specific cysteine-rich C-terminal domain of the basic helix-loop-helix repressor protein I-mfa. The synthesis of this new factor, called HIC for Human I-mfa domain-Containing protein, is controlled at the translational level by two different codons, an ATG and an upstream non-ATG translational initiator, allowing the production of two protein isoforms, p32 and p40, respectively. We show that the HIC protein isoforms present different subcellular localizations, p32 being mainly distributed throughout the cytoplasm, whereas p40 is targeted to the nucleolus. Moreover, in trying to understand the function of HIC, we have found that both isoforms stimulate in T-cells the expression of a luciferase reporter gene driven by the human T-cell leukemia virus type I-long terminal repeat in the presence of the viral transactivator Tax. We demonstrate by mutagenesis that the I-mfa-like domain of HIC is involved in this regulation. Finally, we also show that HIC is able to down-regulate the luciferase expression from the human immunodeficiency virus type 1-long terminal repeat induced by the viral transactivator Tat. From these results, we propose that HIC and I-mfa represent two members of a new family of proteins regulating gene expression and characterized by a particular cysteine-rich C-terminal domain.
Resumo:
BACKGROUND: HIV-infected individuals have an increased risk of myocardial infarction. Antiretroviral therapy (ART) is regarded as a major determinant of dyslipidemia in HIV-infected individuals. Previous genetic studies have been limited by the validity of the single-nucleotide polymorphisms (SNPs) interrogated and by cross-sectional design. Recent genome-wide association studies have reliably associated common SNPs to dyslipidemia in the general population. METHODS AND RESULTS: We validated the contribution of 42 SNPs (33 identified in genome-wide association studies and 9 previously reported SNPs not included in genome-wide association study chips) and of longitudinally measured key nongenetic variables (ART, underlying conditions, sex, age, ethnicity, and HIV disease parameters) to dyslipidemia in 745 HIV-infected study participants (n=34 565 lipid measurements; median follow-up, 7.6 years). The relative impact of SNPs and ART to lipid variation in the study population and their cumulative influence on sustained dyslipidemia at the level of the individual were calculated. SNPs were associated with lipid changes consistent with genome-wide association study estimates. SNPs explained up to 7.6% (non-high-density lipoprotein cholesterol), 6.2% (high-density lipoprotein cholesterol), and 6.8% (triglycerides) of lipid variation; ART explained 3.9% (non-high-density lipoprotein cholesterol), 1.5% (high-density lipoprotein cholesterol), and 6.2% (triglycerides). An individual with the most dyslipidemic antiretroviral and genetic background had an approximately 3- to 5-fold increased risk of sustained dyslipidemia compared with an individual with the least dyslipidemic therapy and genetic background. CONCLUSIONS: In the HIV-infected population treated with ART, the weight of the contribution of common SNPs and ART to dyslipidemia was similar. When selecting an ART regimen, genetic information should be considered in addition to the dyslipidemic effects of ART agents.
Resumo:
High systemic levels of IP-10 at onset of combination therapy for chronic hepatitis C mirror intrahepatic mRNA levels and predict a slower first phase decline in HCV RNA as well as poor outcome. Recently several genome wide association studies have revealed that single nucleotide polymorphisms (SNPs) on chromosome19 within proximity of IL28B predict spontaneous clearance of HCV infection and as therapeutic outcome among patients infected with HCV genotype 1, with three such SNPs being highly predictive: rs12979860, rs12980275, and rs8099917. In the present study, we correlated genetic variations in these SNPs from 253 Caucasian patients with pretreatment plasma levels of IP-10 and HCV RNA throughout therapy within a phase III treatment trial (HCV-DITTO). The favorable genetic variations in all three SNPs (CC, AA, and TT respectively) was significantly associated with lower baseline IP-10 (CC vs. CT/TT at rs12979860: median 189 vs. 258 pg/mL, P=0.02, AA vs. AG/GG at rs12980275: median 189 vs. 258 pg/mL, P=0.01, TT vs. TG/GG at rs8099917: median 224 vs. 288 pg/mL, P=0.04), were significantly less common among HCV genotype 1 infected patients than genotype 2/3 (P<0.0001, P<0.0001, and P=0.01 respectively) and had significantly higher baseline viral load than carriers of the SNP genotypes (6.3 vs. 5.9 log 10 IU/mL, P=0.0012, 6.3 vs. 6.0 log 10 IU/mL, P=0.026, and 6.3 vs. 5.8 log 10 IU/mL, P=0.0003 respectively). Among HCV genotype 1 infected homozygous or heterogeneous carriers of the favorable C, A, and T genotypes, lower baseline IP-10 was significantly associated with greater decline in HCV-RNA day 0-4, which translated into increased rates of achieving SVR among homozygous patients with baseline IP-10 below 150 pg/mL (85%, 75%, and 75% respectively). In a multivariate analysis among genotype 1 infected patients, both baseline IP-10 and the SNPs were significant independent predictors of SVR. Conclusion: Baseline plasma IP-10 is significantly associated with IL28B variations, and augments the predictiveness of the first phase decline in HCV RNA and final treatment outcome.
Resumo:
The use of comparative genomics to infer genome function relies on the understanding of how different components of the genome change over evolutionary time. The aim of such comparative analysis is to identify conserved, functionally transcribed sequences such as protein-coding genes and non-coding RNA genes, and other functional sequences such as regulatory regions, as well as other genomic features. Here, we have compared the entire human chromosome 21 with syntenic regions of the mouse genome, and have identified a large number of conserved blocks of unknown function. Although previous studies have made similar observations, it is unknown whether these conserved sequences are genes or not. Here we present an extensive experimental and computational analysis of human chromosome 21 in an effort to assign function to sequences conserved between human chromosome 21 (ref. 8) and the syntenic mouse regions. Our data support the presence of a large number of potentially functional non-genic sequences, probably regulatory and structural. The integration of the properties of the conserved components of human chromosome 21 to the rapidly accumulating functional data for this chromosome will improve considerably our understanding of the role of sequence conservation in mammalian genomes.
Resumo:
The relative occurrence of genetic variants of human alpha 1-acid glycoprotein (AGP) in relation to changes in glycosylation was studied in sera of patients with burn injury, media of cytokine-treated primary cultures of human hepatocytes and Hep 3B cells, and sera of transgenic mice expressing the human AGP-A gene. It is concluded (i) that the glycosylation of AGP was not dependent on its genetic expression and (ii) that both the variants determined by the AGP-A gene as well as by the AGP-B/B' genes are increased after inflammation or treatment with interleukins 1 and 6.
Resumo:
Copy number variation (CNV) has recently gained considerable interest as a source of genetic variation likely to play a role in phenotypic diversity and evolution. Much effort has been put into the identification and mapping of regions that vary in copy number among seemingly normal individuals in humans and a number of model organisms, using bioinformatics or hybridization-based methods. These have allowed uncovering associations between copy number changes and complex diseases in whole-genome association studies, as well as identify new genomic disorders. At the genome-wide scale, however, the functional impact of CNV remains poorly studied. Here we review the current catalogs of CNVs, their association with diseases and how they link genotype and phenotype. We describe initial evidence which revealed that genes in CNV regions are expressed at lower and more variable levels than genes mapping elsewhere, and also that CNV not only affects the expression of genes varying in copy number, but also have a global influence on the transcriptome. Further studies are warranted for complete cataloguing and fine mapping of CNVs, as well as to elucidate the different mechanisms by which they influence gene expression.
Resumo:
BACKGROUND: There is an ever-increasing volume of data on host genes that are modulated during HIV infection, influence disease susceptibility or carry genetic variants that impact HIV infection. We created GuavaH (Genomic Utility for Association and Viral Analyses in HIV, http://www.GuavaH.org), a public resource that supports multipurpose analysis of genome-wide genetic variation and gene expression profile across multiple phenotypes relevant to HIV biology. FINDINGS: We included original data from 8 genome and transcriptome studies addressing viral and host responses in and ex vivo. These studies cover phenotypes such as HIV acquisition, plasma viral load, disease progression, viral replication cycle, latency and viral-host genome interaction. This represents genome-wide association data from more than 4,000 individuals, exome sequencing data from 392 individuals, in vivo transcriptome microarray data from 127 patients/conditions, and 60 sets of RNA-seq data. Additionally, GuavaH allows visualization of protein variation in ~8,000 individuals from the general population. The publicly available GuavaH framework supports queries on (i) unique single nucleotide polymorphism across different HIV related phenotypes, (ii) gene structure and variation, (iii) in vivo gene expression in the setting of human infection (CD4+ T cells), and (iv) in vitro gene expression data in models of permissive infection, latency and reactivation. CONCLUSIONS: The complexity of the analysis of host genetic influences on HIV biology and pathogenesis calls for comprehensive motors of research on curated data. The tool developed here allows queries and supports validation of the rapidly growing body of host genomic information pertinent to HIV research.
Resumo:
Numerous genetic loci have been associated with systolic blood pressure (SBP) and diastolic blood pressure (DBP) in Europeans. We now report genome-wide association studies of pulse pressure (PP) and mean arterial pressure (MAP). In discovery (N = 74,064) and follow-up studies (N = 48,607), we identified at genome-wide significance (P = 2.7 × 10(-8) to P = 2.3 × 10(-13)) four new PP loci (at 4q12 near CHIC2, 7q22.3 near PIK3CG, 8q24.12 in NOV and 11q24.3 near ADAMTS8), two new MAP loci (3p21.31 in MAP4 and 10q25.3 near ADRB1) and one locus associated with both of these traits (2q24.3 near FIGN) that has also recently been associated with SBP in east Asians. For three of the new PP loci, the estimated effect for SBP was opposite of that for DBP, in contrast to the majority of common SBP- and DBP-associated variants, which show concordant effects on both traits. These findings suggest new genetic pathways underlying blood pressure variation, some of which may differentially influence SBP and DBP.
Resumo:
Searching for matches between large collections of short (14-30 nucleotides) words and sequence databases comprising full genomes or transcriptomes is a common task in biological sequence analysis. We investigated the performance of simple indexing strategies for handling such tasks and developed two programs, fetchGWI and tagger, that index either the database or the query set. Either strategy outperforms megablast for searches with more than 10,000 probes. FetchGWI is shown to be a versatile tool for rapidly searching multiple genomes, whose performance is limited in most cases by the speed of access to the filesystem. We have made publicly available a Web interface for searching the human, mouse, and several other genomes and transcriptomes with oligonucleotide queries.