26 resultados para Human Genomics
Resumo:
MicroRNAs (miRNAs) constitute an important class of gene regulators. While models have been proposed to explain their appearance and expansion, the validation of these models has been difficult due to the lack of comparative studies. Here, we analyze miRNA evolutionary patterns in two mammals, human and mouse, in relation to the age of miRNA families. In this comparative framework, we confirm some predictions of previously advanced models of miRNA evolution, e.g. that miRNAs arise more frequently de novo than by duplication, or that the number of protein-coding gene targeted by miRNAs decreases with evolutionary time. We also corroborate that miRNAs display an increase in expression level with evolutionary time, however we show that this relation is largely tissue-dependent, and especially low in embryonic or nervous tissues. We identify a bias of tag-sequencing techniques regarding the assessment of breadth of expression, leading us, contrary to predictions, to find more tissue-specific expression of older miRNAs. Together, our results refine the models used so far to depict the evolution of miRNA genes. They underline the role of tissue-specific selective forces on the evolution of miRNAs, as well as the potential co-evolution patterns between miRNAs and the protein-coding genes they target.
Resumo:
BACKGROUND & AIMS: Regulation of gene expression in the follicle-associated epithelium (FAE) over Peyer's patches is largely unknown. CCL20, a chemokine that recruits immature dendritic cells, is one of the few FAE-specific markers described so far. Lymphotoxin beta (LTalpha1beta2) expressed on the membrane of immune cells triggers CCL20 expression in enterocytes. In this study, we measured expression profiles of LTalpha1beta2-treated intestinal epithelial cells and selected CCL20 -coregulated genes to identify new FAE markers. METHODS: Genomic profiles of T84 and Caco-2 cell lines treated with either LTalpha1beta2, flagellin, or tumor necrosis factor alpha were measured using the Affymetrix GeneChip U133A. Clustering analysis was used to select CCL20 -coregulated genes, and laser dissection microscopy and real-time polymerase chain reaction on human biopsy specimens was used to assess the expression of the selected markers. RESULTS: Applying a 2-way analysis of variance, we identified regulated genes upon the different treatments. A subset of genes involved in inflammation and related to the nuclear factor kappaB pathway was coregulated with CCL20 . Among these genes, the antiapoptotic factor TNFAIP3 was highly expressed in the FAE. CCL23 , which was not coregulated in vitro with CCL20 , was also specifically expressed in the FAE. CONCLUSIONS: We have identified 2 novel human FAE specifically expressed genes. Most of the CCL20 -coregulated genes did not show FAE-specific expression, suggesting that other signaling pathways are critical to modulate FAE-specific gene expression.
Resumo:
Pneumocystis jirovecii is a fungus belonging to a basal lineage of the Ascomycotina, the Taphrinomycotina subphylum. It is a parasite specific to humans that dwells primarily in the lung and can cause severe pneumonia in individuals with debilitated immune system. Despite its clinical importance, many aspects of its biology remain poorly understood, at least in part because of the lack of a continuous in vitro cultivation system. The present thesis consists in the genome reconstruction and comparative genomics of P. jirovecii. It is made of three parts: (i) the de novo sequencing of P. jirovecii genome starting from a single broncho- alveolar lavage fluid of a single patient (ii) the de novo sequencing of the genome of the plant pathogen Taphrina deformans, a fungus closely related to P. jirovecii, and (iii) the genome scale comparison of P. jirovecii to other Taphrinomycotina members. Enrichment in P. jirovecii cells by immuno-precipitation, whole DNA random amplification, two complementary high throughput DNA sequencing methods, and in silico sorting and assembly of sequences were used for the de novo reconstruction of P. jirovecii genome from the microbiota of a single clinical specimen. An iterative ad hoc pipeline as well as numerical simulations was used to recover P. jirovecii sequences while purging out contaminants and assembly or amplification chimeras. This strategy produced a 8.1 Mb assembly, which encodes 3,898 genes. Homology searches, mapping on biochemical pathways atlases, and manual validations revealed that this genome lacks (i) most of the enzymes dedicated to the amino acids biosyntheses, and (ii) most virulence factors observed in other fungi, e.g. the glyoxylate shunt pathway and specific peptidases involved in the degradation of the host cell membrane. The same analyses applied to the available genomic sequences from Pneumocystis carinii the species infecting rats and Pneumocystis murina the species infecting mice revealed the same deficiencies. The genome sequencing of T. deformans yielded a 13 Mb assembly, which encodes 5,735 genes. T. deformans possesses enzymes involved plant cell wall degradation, secondary metabolism, the glyoxylate cycle, detoxification, sterol biosynthesis, as well as the biosyntheses of plant hormones such as abscisic acid or indole-3-acetic acid. T. deformans also harbors gene subsets that have counterparts in plant saprophytes or pathogens, which is consistent with its alternate saprophytic and pathogenic lifestyles. Mating genes were also identified. The homothallism of this fungus suggests a mating-type switching mechanism. Comparative analyses indicated that 81% of P. jirovecii genes are shared with eight other Taphrinomycotina members, including T. deformans, P. carinii and P. murina. These genes are mostly involved in housekeeping activities. The genes specific to the Pneumocystis genus represent 8%, and are involved in RNA metabolism and signaling. The signaling is known to be crucial for interaction of Pneumocystis spp with their environment. Eleven percent are unique to P. jirovecii and encode mostly proteins of unknown function. These genes in conjunction with other ones (e.g. the major surface glycoproteins) might govern the interaction of P. jirovecii with its human host cells, and potentially be responsible of the host specificity. P. jirovecii exhibits a reduced genome in size with a low GC content, and most probably scavenges vital compounds such as amino acids and cholesterol from human lungs. Consistently, its genome encodes a large set of transporters (ca. 22% of its genes), which may play a pivotal role in the acquisition of these compounds. All these features are generally observed in obligate parasite of various kingdoms (bacteria, protozoa, fungi). Moreover, epidemiological studies failed to evidence a free-living form of the fungus and Pneumocystis spp were shown to co-evolved with their hosts. Given also the lack of virulence factors, our observations strongly suggest that P. jirovecii is an obligate parasite specialized in the colonization of human lungs, and which causes disease only in individuals with compromised immune system. The same conclusion is most likely true for all other Pneumocystis spp in their respective mammalian host. - Pneumocystis jirovecii est un champignon appartenant à ine branche basale des Ascomycotina, le sous-embranchement des Taphrinomycotina. C'est un parasite spécifique aux humains qui réside principalement dans les poumons, et qui peut causer des pneumonies sévères chez des individus ayant un système immunitaire déficient. En dépit de son importance clinique, de nombreux aspects de sa biologie demeurent,largement méconnus, au moins en partie à cause de l'absence d'un système de culture in vitro continu. Cette thèse traite de la reconstruction du génome et de la génomique comparative de P. jirovecii. Elle comporte trois parties: (i) le séquençage de novo du génome de P. jirovecii à partir d'un lavage broncho-alvéolaire provenant d'un seul patient, (ii) le séquençage de novo du génome d'un champignon pathogène de plante Taphrina deformans qui est phylogénétiquement proche de P. jirovecii, et (iii) la comparaison du génome de P. jirovecii à celui d'autres membres du sous-embranchement des Taphrinomycotina. Un enrichissement en cellules de P. jirovecii par immuno-précipitation, une amplification aléatoire des molécules d'ADN, deux méthodes complémentaires de séquençage à haut débit, un tri in silico et un assemblage des séquences ont été utilisés pour reconstruire de novo le génome de P. jirovecii à partir du microbiote d'un seul échantillon clinique. Un pipeline spécifique ainsi que des simulations numériques ont été utilisés pour récupérer les séquences de P. jirovecii tout en éliminant les séquences contaminants et les chimères d'amplification ou d'assemblage. Cette stratégie a produit un assemblage de 8.1 Mb, qui contient 3898 gènes. Les recherches d'homologies, de cartographie des voies métaboliques et des validations manuelles ont révélé que ce génome est dépourvu (i) de la plupart des enzymes dédiées à la biosynthèse des acides aminés, et (ii) de la plupart des facteurs de virulence observés chez d'autres champignons, par exemple, le cycle du glyoxylate ainsi que des peptidases spécifiques impliquées dans la dégradation de la membrane de la cellule hôte. Les analyses appliquées aux données génomiques disponibles de Pneumocystis carinii, l'espèce infectant les rats, et de Pneumocystis murina, l'espèce infectant les souris, ont révélé les mêmes déficiences. Le séquençage du génome de T. deformans a généré un assemblage de 13.3 Mb qui contient 5735 gènes. T. deformans possède les gènes codant pour les enzymes impliquées dans la dégradation des parois cellulaires des plantes, le métabolisme secondaire, le cycle du glyoxylate, la détoxification, la biosynthèse des stérols ainsi que la biosynthèse d'hormones de plantes telles que l'acide abscissique ou l'acide indole 3-acétique. T. deformans possède également des sous-ensembles de gènes présents exclusivement chez des saprophytes ou des pathogènes de plantes, ce qui est consistent avec son mode de vie alternatif saprophyte et pathogène. Des gènes impliqués dans la conjugaison ont été identifiés. L'homothallisme de ce champignon suggère mécanisme de permutation du type conjuguant. Les analyses comparatives ont démontré que 81% des gènes de P. jirovecii sont présent chez les autres membres du sous-embranchement des Taphrinomycotina. Ces gènes sont essentiellement impliqués dans le métabolisme basai. Les gènes spécifiques au genre Pneumocystis représentent 8%, et sont impliqués dans le métabolisme de l'ARN et la signalisation. La signalisation est connue pour être cruciale pour l'interaction des espèces de Pneumocystis avec leur environnement. Les gènes propres à P. jirovecii représentent 11% et codent en majorité pour des protéines dont la fonction est inconnue. Ces gènes en conjonction avec d'autres (par exemple, les glycoprotéines de surface), pourraient être déterminants dans l'interaction de P. jirovecii avec les cellules de l'hôte humain, et être potentiellement responsable de la spécificité d'hôte. P. jirovecii possède un génome de taille réduite à faible pourcentage en GC et récupère très probablement des composés vitaux comme les acides aminés et le cholestérol à partir des poumons humains. De manière consistante, son génome code pour de nombreux transporteurs (22% de ses gènes), qui pourraient jouer un rôle essentiel dans l'acquisition de ces composés. Ces caractéristiques sont généralement observées chez les parasites obligatoires de plusieurs règnes (bactéries, protozoaires, champignons). De plus, les études épidémiologiques n'ont pas réussi à prouver l'existence d'ime forme vivant librement du champignon. Etant donné également l'absence de facteurs de virulence, nos observations suggèrent que P. jirovecii est un parasite obligatoire spécialisé dans la colonisation des poumons humains, ne causant une maladie que chez des individus ayant un système immunitaire compromis. La même conclusion est très probablement applicable à toutes les autres espèces de Pneumocystis dans leur hôte mammifère respectif.
Resumo:
ABSTRACT: BACKGROUND: Millions of humans and animals suffer from superficial infections caused by a group of highly specialized filamentous fungi, the dermatophytes, which exclusively infect keratinized host structures. To provide broad insights into the molecular basis of the pathogenicity-associated traits, we report the first genome sequences of two closely phylogenetically related dermatophytes, Arthroderma benhamiae and Trichophyton verrucosum, both of which induce highly inflammatory infections in humans. RESULTS: 97% of the 22.5 megabase genome sequences of A. benhamiae and T. verrucosum are unambiguously alignable and collinear. To unravel dermatophyte-specific virulence-associated traits, we compared sets of potentially pathogenicity-associated proteins, such as secreted proteases and enzymes involved in secondary metabolite production, with those of closely related onygenales (Coccidioides species) and the mould Aspergillus fumigatus. The comparisons revealed expansion of several gene families in dermatophytes and disclosed the peculiarities of the dermatophyte secondary metabolite gene sets. Secretion of proteases and other hydrolytic enzymes by A. benhamiae was proven experimentally by a global secretome analysis during keratin degradation. Molecular insights into the interaction of A. benhamiae with human keratinocytes were obtained for the first time by global transcriptome profiling. Given that A. benhamiae is able to undergo mating, a detailed comparison of the genomes further unraveled the genetic basis of sexual reproduction in this species. CONCLUSIONS: Our results enlighten the genetic basis of fundamental and putatively virulence-related traits of dermatophytes, advancing future research on these medically important pathogens.
Resumo:
Eukaryotic cells make many types of primary and processed RNAs that are found either in specific subcellular compartments or throughout the cells. A complete catalogue of these RNAs is not yet available and their characteristic subcellular localizations are also poorly understood. Because RNA represents the direct output of the genetic information encoded by genomes and a significant proportion of a cell's regulatory capabilities are focused on its synthesis, processing, transport, modification and translation, the generation of such a catalogue is crucial for understanding genome function. Here we report evidence that three-quarters of the human genome is capable of being transcribed, as well as observations about the range and levels of expression, localization, processing fates, regulatory regions and modifications of almost all currently annotated and thousands of previously unannotated RNAs. These observations, taken together, prompt a redefinition of the concept of a gene.
Resumo:
Progress in genomics with, in particular, high throughput next generation sequencing is revolutionizing oncology. The impact of these techniques is seen on the one hand the identification of germline mutations that predispose to a given type of cancer, allowing for a personalized care of patients or healthy carriers and, on the other hand, the characterization of all acquired somatic mutation of the tumor cell, opening the door to personalized treatment targeting the driver oncogenes. In both cases, next generation sequencing techniques allow a global approach whereby the integrality of the genome mutations is analyzed and correlated with the clinical data. The benefits on the quality of care delivered to our patients are extremely impressive.
Resumo:
BACKGROUND: Cleavage of messenger RNA (mRNA) precursors is an essential step in mRNA maturation. The signal recognized by the cleavage enzyme complex has been characterized as an A rich region upstream of the cleavage site containing a motif with consensus AAUAAA, followed by a U or UG rich region downstream of the cleavage site. RESULTS: We studied these signals using exhaustive databases of cleavage sites obtained from aligning raw expressed sequence tags (EST) sequences to genomic sequences in Homo sapiens and Drosophila melanogaster. These data show that the polyadenylation signal is highly conserved in human and fly. In addition, de novo motif searches generated a refined description of the U-rich downstream sequence (DSE) element, which shows more divergence between the two species. These refined motifs are applied, within a Hidden Markov Model (HMM) framework, to predict mRNA cleavage sites. CONCLUSION: We demonstrate that the DSE is a specific motif in both human and Drosophila. These findings shed light on the sequence correlates of a highly conserved biological process, and improve in silico prediction of 3' mRNA cleavage and polyadenylation sites.
Resumo:
The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.
Resumo:
The use of comparative genomics to infer genome function relies on the understanding of how different components of the genome change over evolutionary time. The aim of such comparative analysis is to identify conserved, functionally transcribed sequences such as protein-coding genes and non-coding RNA genes, and other functional sequences such as regulatory regions, as well as other genomic features. Here, we have compared the entire human chromosome 21 with syntenic regions of the mouse genome, and have identified a large number of conserved blocks of unknown function. Although previous studies have made similar observations, it is unknown whether these conserved sequences are genes or not. Here we present an extensive experimental and computational analysis of human chromosome 21 in an effort to assign function to sequences conserved between human chromosome 21 (ref. 8) and the syntenic mouse regions. Our data support the presence of a large number of potentially functional non-genic sequences, probably regulatory and structural. The integration of the properties of the conserved components of human chromosome 21 to the rapidly accumulating functional data for this chromosome will improve considerably our understanding of the role of sequence conservation in mammalian genomes.
Resumo:
The study of immunity against infection can be framed in the context of genomics. First, long-term association with pathogens results in genomic signatures that result from positive selection. Evolutionary pressures tailor species or individual responses to pathogens, that may be associated with skewed patterns of immunity. Second, recent human population expansion carries an increasing burden of genetic mutation that can result in sporadic immunodeficiencies, and more generally, in diversity in susceptibility to infection. This review highlights current concepts and tools for the analysis of genomes and stresses the interest of these approaches in immunity.
Resumo:
Protein-coding genes evolve at different rates, and the influence of different parameters, from gene size to expression level, has been extensively studied. While in yeast gene expression level is the major causal factor of gene evolutionary rate, the situation is more complex in animals. Here we investigate these relations further, especially taking in account gene expression in different organs as well as indirect correlations between parameters. We used RNA-seq data from two large datasets, covering 22 mouse tissues and 27 human tissues. Over all tissues, evolutionary rate only correlates weakly with levels and breadth of expression. The strongest explanatory factors of purifying selection are GC content, expression in many developmental stages, and expression in brain tissues. While the main component of evolutionary rate is purifying selection, we also find tissue-specific patterns for sites under neutral evolution and for positive selection. We observe fast evolution of genes expressed in testis, but also in other tissues, notably liver, which are explained by weak purifying selection rather than by positive selection.