119 resultados para Enriched genomic library
Resumo:
Chromosomal inversion polymorphisms are common in animals and plants, and recent models suggest that alternative arrangements spread by capturing different combinations of alleles acting additively or epistatically to favour local adaptation. It is also thought that inversions typically maintain favoured combinations for a long time by suppressing recombination between alternative chromosomal arrangements. Here, we consider patterns of linkage disequilibrium and genetic divergence in an old inversion polymorphism in Drosophila melanogaster (In(3R)Payne) known to be associated with climate change adaptation and a recent invasion event into Australia. We extracted, karyotyped and sequenced whole chromosomes from two Australian populations, so that changes in the arrangement of the alleles between geographically separated tropical and temperate areas could be compared. Chromosome-wide linkage disequilibrium (LD) analysis revealed strong LD within the region spanned by In(3R)Payne. This genomic region also showed strong differentiation between the tropical and the temperate populations, but no differentiation between different karyotypes from the same population, after controlling for chromosomal arrangement. Patterns of differentiation across the chromosome arm and in gene ontologies were enhanced by the presence of the inversion. These data support the notion that inversions are strongly selected by bringing together combinations of genes, but it is still not clear if such combinations act additively or epistatically. Our data suggest that climatic adaptation through inversions can be dynamic, reflecting changes in the relative abundance of different forms of an inversion and ongoing evolution of allelic content within an inversion.
Resumo:
Alternative splicing produces multiple isoforms from the same gene, thus increasing the number of transcripts of the species. Alternative splicing is a virtually ubiquitous mechanism in eukaryotes, for example more than 90% of protein-coding genes in human are alternatively spliced. Recent evolutionary studies showed that alternative splicing is a fast evolving and highly species- specific mechanism. The rapid evolution of alternative splicing was considered as a contribution to the phenotypic diversity between species. However, the function of many isoforms produced by alternative splicing remains unclear and they might be the result of noisy splicing. Thus, the functional relevance of alternative splicing and the evolutionary mechanisms of its rapid divergence among species are still poorly understood. During my thesis, I performed a large-scale analysis of the regulatory mechanisms that drive the rapid evolution of alternative splicing. To study the evolution of alternative splicing regulatory mechanisms, I used an extensive RNA-sequencing dataset comprising 12 tetrapod species (human, chimpanzee and bonobo, gorilla, orangutan, macaque, marmoset, mouse, opossum, platypus, chicken and frog) and 8 tissues (cerebellum, brain, heart, kidney, liver, testis, placenta and ovary). To identify the catalogue of alternative splicing eis-acting regulatory elements in the different tetrapod species, I used a previously defined computational approach. This approach is a statistical analysis of exons/introns and splice sites composition and relies on a principle of compensation between splice sites strength and the presence of additional regulators. With an evolutionary comparative analysis of the exonic eis-acting regulators, I showed that these regulatory elements are generally shared among primates and more conserved than non-regulatory elements. In addition, I showed that the usage of these regulatory elements is also more conserved than expected by chance. In addition to the identification of species- specific eis-acting regulators, these results may explain the rapid evolution of alternative splicing. I also developed a new approach based on evolutionary sequence changes and corresponding alternative splicing changes to identify potential splicing eis-acting regulators in primates. The identification of lineage-specific substitutions and corresponding lineage-specific alternative splicing changes, allowed me to annotate the genomic sequences that might have played a role in the alternative splicing pattern differences among primates. Finally, I showed that the identified splicing eis-acting regulator datasets are enriched in human disease-causing mutations, thus confirming their biological relevance.
Resumo:
BACKGROUND: Known antiretroviral restriction factors are encoded by genes that are under positive selection pressure, induced during HIV-1 infection, up-regulated by interferons, and/or interact with viral proteins. To identify potential novel restriction factors, we performed genome-wide scans for human genes sharing molecular and evolutionary signatures of known restriction factors and tested the anti-HIV-1 activity of the most promising candidates. RESULTS: Our analyses identified 30 human genes that share characteristics of known restriction factors. Functional analyses of 27 of these candidates showed that over-expression of a strikingly high proportion of them significantly inhibited HIV-1 without causing cytotoxic effects. Five factors (APOL1, APOL6, CD164, TNFRSF10A, TNFRSF10D) suppressed infectious HIV-1 production in transfected 293T cells by >90% and six additional candidates (FCGR3A, CD3E, OAS1, GBP5, SPN, IFI16) achieved this when the virus was lacking intact accessory vpr, vpu and nef genes. Unexpectedly, over-expression of two factors (IL1A, SP110) significantly increased infectious HIV-1 production. Mechanistic studies suggest that the newly identified potential restriction factors act at different steps of the viral replication cycle, including proviral transcription and production of viral proteins. Finally, we confirmed that mRNA expression of most of these candidate restriction factors in primary CD4+ T cells is significantly increased by type I interferons. CONCLUSIONS: A limited number of human genes share multiple characteristics of genes encoding for known restriction factors. Most of them display anti-retroviral activity in transient transfection assays and are expressed in primary CD4+ T cells.
Resumo:
Experimental models demonstrated that therapeutic induction of CD8 T cell responses may offer protection against tumors or infectious diseases providing that T cells have sufficiently high TCR/CD8:pMHC avidity for efficient Ag recognition and consequently strong immune functions. However, comprehensive characterization of TCR/CD8:pMHC avidity in clinically relevant situations has remained elusive. In this study, using the novel NTA-His tag-containing multimer technology, we quantified the TCR:pMHC dissociation rates (koff) of tumor-specific vaccine-induced CD8 T cell clones (n = 139) derived from seven melanoma patients vaccinated with IFA, CpG, and the native/EAA or analog/ELA Melan-A(MART-1)(26-35) peptide, binding with low or high affinity to MHC, respectively. We observed substantial correlations between koff and Ca(2+) mobilization (p = 0.016) and target cell recognition (p < 0.0001), with the latter independently of the T cell differentiation state. Our strategy was successful in demonstrating that the type of peptide impacted on TCR/CD8:pMHC avidity, as tumor-reactive T cell clones derived from patients vaccinated with the low-affinity (native) peptide expressed slower koff rates than those derived from patients vaccinated with the high-affinity (analog) peptide (p < 0.0001). Furthermore, we observed that the low-affinity peptide promoted the selective differentiation of tumor-specific T cells bearing TCRs with high TCR/CD8:pMHC avidity (p < 0.0001). Altogether, TCR:pMHC interaction kinetics correlated strongly with T cell functions. Our study demonstrates the feasibility and usefulness of TCR/CD8:pMHC avidity assessment by NTA-His tag-containing multimers of naturally occurring polyclonal T cell responses, which represents a strong asset for the development of immunotherapy.
Resumo:
UNLABELLED: CcrM is an orphan DNA methyltransferase nearly universally conserved in a vast group of Alphaproteobacteria. In Caulobacter crescentus, it controls the expression of key genes involved in the regulation of the cell cycle and cell division. Here, we demonstrate, using an experimental evolution approach, that C. crescentus can significantly compensate, through easily accessible genetic changes like point mutations, the severe loss in fitness due to the absence of CcrM, quickly improving its growth rate and cell morphology in rich medium. By analyzing the compensatory mutations genome-wide in 12 clones sampled from independent ΔccrM populations evolved for ~300 generations, we demonstrated that each of the twelve clones carried at least one mutation that potentially stimulated ftsZ expression, suggesting that the low intracellular levels of FtsZ are the major burden of ΔccrM mutants. In addition, we demonstrate that the phosphoenolpyruvate-carbohydrate phosphotransfer system (PTS) actually modulates ftsZ and mipZ transcription, uncovering a previously unsuspected link between metabolic regulation and cell division in Alphaproteobacteria. We present evidence that point mutations found in genes encoding proteins of the PTS provide the strongest fitness advantage to ΔccrM cells cultivated in rich medium despite being disadvantageous in minimal medium. This environmental sign epistasis might prevent such mutations from getting fixed under changing natural conditions, adding a plausible explanation for the broad conservation of CcrM. IMPORTANCE: In bacteria, DNA methylation has a variety of functions, including the control of DNA replication and/or gene expression. The cell cycle-regulated DNA methyltransferase CcrM modulates the transcription of many genes and is critical for fitness in Caulobacter crescentus. Here, we used an original experimental evolution approach to determine which of its many targets make CcrM so important physiologically. We show that populations lacking CcrM evolve quickly, accumulating an excess of mutations affecting, directly or indirectly, the expression of the ftsZ cell division gene. This finding suggests that the most critical function of CcrM in C. crescentus is to promote cell division by enhancing FtsZ intracellular levels. During this work, we also discovered an unexpected link between metabolic regulation and cell division that might extend to other Alphaproteobacteria.
Resumo:
The discovery of long non-coding RNA (lncRNA) has dramatically altered our understanding of cancer. Here, we describe a comprehensive analysis of lncRNA alterations at transcriptional, genomic, and epigenetic levels in 5,037 human tumor specimens across 13 cancer types from The Cancer Genome Atlas. Our results suggest that the expression and dysregulation of lncRNAs are highly cancer type specific compared with protein-coding genes. Using the integrative data generated by this analysis, we present a clinically guided small interfering RNA screening strategy and a co-expression analysis approach to identify cancer driver lncRNAs and predict their functions. This provides a resource for investigating lncRNAs in cancer and lays the groundwork for the development of new diagnostics and treatments.
Resumo:
Owing to recent advances in genomic technologies, personalized oncology is poised to fundamentally alter cancer therapy. In this paradigm, the mutational and transcriptional profiles of tumors are assessed, and personalized treatments are designed based on the specific molecular abnormalities relevant to each patient's cancer. To date, such approaches have yielded impressive clinical responses in some patients. However, a major limitation of this strategy has also been revealed: the vast majority of tumor mutations are not targetable by current pharmacological approaches. Immunotherapy offers a promising alternative to exploit tumor mutations as targets for clinical intervention. Mutated proteins can give rise to novel antigens (called neoantigens) that are recognized with high specificity by patient T cells. Indeed, neoantigen-specific T cells have been shown to underlie clinical responses to many standard treatments and immunotherapeutic interventions. Moreover, studies in mouse models targeting neoantigens, and early results from clinical trials, have established proof of concept for personalized immunotherapies targeting next-generation sequencing identified neoantigens. Here, we review basic immunological principles related to T-cell recognition of neoantigens, and we examine recent studies that use genomic data to design personalized immunotherapies. We discuss the opportunities and challenges that lie ahead on the road to improving patient outcomes by incorporating immunotherapy into the paradigm of personalized oncology.
Resumo:
Menopause timing has a substantial impact on infertility and risk of disease, including breast cancer, but the underlying mechanisms are poorly understood. We report a dual strategy in ∼70,000 women to identify common and low-frequency protein-coding variation associated with age at natural menopause (ANM). We identified 44 regions with common variants, including two regions harboring additional rare missense alleles of large effect. We found enrichment of signals in or near genes involved in delayed puberty, highlighting the first molecular links between the onset and end of reproductive lifespan. Pathway analyses identified major association with DNA damage response (DDR) genes, including the first common coding variant in BRCA1 associated with any complex trait. Mendelian randomization analyses supported a causal effect of later ANM on breast cancer risk (∼6% increase in risk per year; P = 3 × 10(-14)), likely mediated by prolonged sex hormone exposure rather than DDR mechanisms.
Resumo:
Ease of worldwide travel provides increased opportunities for organisms not only to colonize new environments but also to encounter related but diverged populations. Such events of reconnection and secondary contact of previously isolated populations are widely observed at different time scales. For example, during the quaternary glaciation, sea water level fluctuations caused temporal isolation of populations, often to be followed by secondary contact. At shorter time scales, population isolation and reconnection of viruses are commonly observed, and such events are often associated with epidemics and pandemics. Here, using coalescent theory and simulations, we describe the temporal impact of population reconnection after isolation on nucleotide differences and the site frequency spectrum, as well as common summary statistics of DNA variation. We identify robust genomic signatures of population reconnection after isolation. We utilize our development to infer the recent evolutionary history of human immunodeficiency virus 1 (HIV-1) in Asia and South America, successfully retrieving the successive HIV subtype colonization events in these regions. Our analysis reveals that divergent HIV-1 subtype populations are currently admixing in these regions, suggesting that HIV-1 may be undergoing a process of homogenization, contrary to popular belief.
Resumo:
Ticks transmit more pathogens to humans and animals than any other arthropod. We describe the 2.1 Gbp nuclear genome of the tick, Ixodes scapularis (Say), which vectors pathogens that cause Lyme disease, human granulocytic anaplasmosis, babesiosis and other diseases. The large genome reflects accumulation of repetitive DNA, new lineages of retro-transposons, and gene architecture patterns resembling ancient metazoans rather than pancrustaceans. Annotation of scaffolds representing ∼57% of the genome, reveals 20,486 protein-coding genes and expansions of gene families associated with tick-host interactions. We report insights from genome analyses into parasitic processes unique to ticks, including host 'questing', prolonged feeding, cuticle synthesis, blood meal concentration, novel methods of haemoglobin digestion, haem detoxification, vitellogenesis and prolonged off-host survival. We identify proteins associated with the agent of human granulocytic anaplasmosis, an emerging disease, and the encephalitis-causing Langat virus, and a population structure correlated to life-history traits and transmission of the Lyme disease agent.
Resumo:
The ability of Mycobacterium tuberculosis to establish a latent infection (LTBI) in humans confounds the treatment of tuberculosis. Consequently, there is a need to discover new therapeutic agents that can kill M. tuberculosis both during active disease and LTBI. The streptomycin-dependent strain of M. tuberculosis, 18b, provides a useful tool for this purpose since upon removal of streptomycin (STR) it enters a non-replicating state that mimics latency both in vitro and in animal models. The 4.41 Mb genome sequence of M. tuberculosis 18b was determined and this revealed the strain to belong to clade 3 of the ancient ancestral lineage of the Beijing family. STR-dependence was attributable to insertion of a single cytosine in the 530 loop of the 16S rRNA and to a single amino acid insertion in the N-terminal domain of initiation factor 3. RNA-seq was used to understand the genetic programme activated upon STR-withdrawal and hence to gain insight into LTBI. This revealed reconfiguration of gene expression and metabolic pathways showing strong similarities between non-replicating 18b and M. tuberculosis residing within macrophages, and with the core stationary phase and microaerophilic responses. The findings of this investigation confirm the validity of 18b as a model for LTBI, and provide insight into both the evolution of tubercle bacilli and the functioning of the ribosome.
Resumo:
In the recent years, many protocols aimed at reproducibly sequencing reduced-genome subsets in non-model organisms have been published. Among them, RAD-sequencing is one of the most widely used. It relies on digesting DNA with specific restriction enzymes and performing size selection on the resulting fragments. Despite its acknowledged utility, this method is of limited use with degraded DNA samples, such as those isolated from museum specimens, as these samples are less likely to harbor fragments long enough to comprise two restriction sites making possible ligation of the adapter sequences (in the case of double-digest RAD) or performing size selection of the resulting fragments (in the case of single-digest RAD). Here, we address these limitations by presenting a novel method called hybridization RAD (hyRAD). In this approach, biotinylated RAD fragments, covering a random fraction of the genome, are used as baits for capturing homologous fragments from genomic shotgun sequencing libraries. This simple and cost-effective approach allows sequencing of orthologous loci even from highly degraded DNA samples, opening new avenues of research in the field of museum genomics. Not relying on the restriction site presence, it improves among-sample loci coverage. In a trial study, hyRAD allowed us to obtain a large set of orthologous loci from fresh and museum samples from a non-model butterfly species, with a high proportion of single nucleotide polymorphisms present in all eight analyzed specimens, including 58-year-old museum samples. The utility of the method was further validated using 49 museum and fresh samples of a Palearctic grasshopper species for which the spatial genetic structure was previously assessed using mtDNA amplicons. The application of the method is eventually discussed in a wider context. As it does not rely on the restriction site presence, it is therefore not sensitive to among-sample loci polymorphisms in the restriction sites that usually causes loci dropout. This should enable the application of hyRAD to analyses at broader evolutionary scales.
Resumo:
LncRNAs are transcripts greater than 200 nucleotides in length with no apparent coding potential. They exert important regulatory functions in the genome. Their role in cardiac fibrosis is however unexplored. To identify IncRNAs that could modulate cardiac fibrosis, we profiled the long non-coding transcriptome in the infarcted mouse heart, and identified 1500 novel IncRNAs. These IncRNAs have unique characteristics such as high tissue and cell type specificity. Their expression is highly correlated with parameters of cardiac dimensions and function. The majority of these novel IncRNAs are conserved in human. Importantly, human IncRNAs appear to be differentially expressed in heart disease. Using a computational pipeline, we identified a super-enhancer-associated IncRNA, which is dynamically expressed after myocardial infarction. We named this particular transcript Wisper for «Wisp2 super-enhancer- derived IncRNA ». Interestingly, Wisper expression is overexpressed in cardiac fibroblasts as compared to cardiomyocytes or to fibroblasts isolated from other organs than the heart. The importance of Wisper in the biology of fibroblasts was demonstrated in knockdown experiments. Differentiation of cardiac fibroblast into myofibroblasts in vitro is significantly impaired upon Wisper knockdown. Wisper downregulation in cardiac fibroblasts results in a dramatic reduction of fibrotic gene expression, a diminished cell proliferation and an increase in apoptotic cell death. In vivo, depletion of Wisper during the acute phase of the response to infarction is detrimental via increasing the risk of cardiac rupture. On the other hand, Wisper knockdown following infarction in a prevention study reduces fibrosis and preserves cardiac function. Since WISPER is detectable in the human heart, where it is associated with severe cardiac fibrosis, these data suggest that Wisper could represent a novel therapeutic target for limiting the extent of the fibrotic response in the heart. -- Les long ARN non-codants (IncRNAs) sont des ARN de plus de 200 nucléotides qui ne codent pas pour des protéines. Ils exercent d'importantes fonctions dans le génome. Par contre, leur importance dans le développement de la fibrose cardiaque n'a pas été étudiée. Pour identifier des IncRNAs jouant un rôle dans ce processus, le transcriptome non-codant a été étudié dans le coeur de'souris après un infarctus du myocarde. Nous avons découverts 1500 nouveaux IncRNAs. Ces transcrits ont d'uniques caractéristiques. En particulier ils sont extrêmement spécifiques de sous-populations de cellules cardiaques. Par ailleurs, leur expression est remarquablement corrélée avec les paramètres définissant les dimensions du coeur et la fonction cardiaque. La majorité de ces IncRNAs sont conservés chez l'humain. Certains sont modulés dans des pathologies cardiaques. En utilisant une approche bioinformatique, nous avons identifié un IncRNA qui est associé à des séquences amplificatrices et qui est particulièrement enrichi dans les fibroblastes cardiaques. Ce transcrit a été nommé Wisper pour «Wisp2 super-enhancer-derived IncRNA ». L'importance de Wisper dans la biologie des fibroblastes cardiaques est démontrée dans des expériences de déplétion. En l'absence de Wisper, l'expression de protéines impliquées dans le développement de la fibrose est dramatiquement réduite dans les fibroblastes cardiaques. Ceux-ci montrent une prolifération réduite. Le niveau d'apoptose est largement augmenté. In vivo, la déplétion de Wisper pendant la phase aiguë de l'infarctus rehausse le risque de rupture cardiaque. Au contraire, la réduction de l'expression de Wisper pendant la phase chronique diminue la fibrose cardiaque et améliore la fonction du coeur. Puisque Wisper est exprimé dans le coeur humain, ce transcrit représente une nouvelle cible thérapeutique pour limiter la réponse fibrotique dans le coeur.
Resumo:
Clines in chromosomal inversion polymorphisms-presumably driven by climatic gradients-are common but there is surprisingly little evidence for selection acting on them. Here we address this long-standing issue in Drosophila melanogaster by using diagnostic single nucleotide polymorphism (SNP) markers to estimate inversion frequencies from 28 whole-genome Pool-seq samples collected from 10 populations along the North American east coast. Inversions In(3L)P, In(3R)Mo, and In(3R)Payne showed clear latitudinal clines, and for In(2L)t, In(2R)NS, and In(3R)Payne the steepness of the clinal slopes changed between summer and fall. Consistent with an effect of seasonality on inversion frequencies, we detected small but stable seasonal fluctuations of In(2R)NS and In(3R)Payne in a temperate Pennsylvanian population over 4 years. In support of spatially varying selection, we observed that the cline in In(3R)Payne has remained stable for >40 years and that the frequencies of In(2L)t and In(3R)Payne are strongly correlated with climatic factors that vary latitudinally, independent of population structure. To test whether these patterns are adaptive, we compared the amount of genetic differentiation of inversions versus neutral SNPs and found that the clines in In(2L)t and In(3R)Payne are maintained nonneutrally and independent of admixture. We also identified numerous clinal inversion-associated SNPs, many of which exhibit parallel differentiation along the Australian cline and reside in genes known to affect fitness-related traits. Together, our results provide strong evidence that inversion clines are maintained by spatially-and perhaps also temporally-varying selection. We interpret our data in light of current hypotheses about how inversions are established and maintained.