233 resultados para gene duplication
em Université de Lausanne, Switzerland
Resumo:
RNA polymerase III (Pol III) occurs in two versions, one containing the POLR3G subunit and the other the closely related POLR3GL subunit. It is not clear whether these two Pol III forms have the same function, in particular whether they recognize the same target genes. We show that the POLR3G and POLR3GL genes arose from a DNA-based gene duplication, probably in a common ancestor of vertebrates. POLR3G- as well as POLR3GL-containing Pol III are present in cultured cell lines and in normal mouse liver, although the relative amounts of the two forms vary, with the POLR3G-containing Pol III relatively more abundant in dividing cells. Genome-wide chromatin immunoprecipitations followed by high-throughput sequencing (ChIP-seq) reveal that both forms of Pol III occupy the same target genes, in very constant proportions within one cell line, suggesting that the two forms of Pol III have a similar function with regard to specificity for target genes. In contrast, the POLR3G promoter-not the POLR3GL promoter-binds the transcription factor MYC, as do all other promoters of genes encoding Pol III subunits. Thus, the POLR3G/POLR3GL duplication did not lead to neo-functionalization of the gene product (at least with regard to target gene specificity) but rather to neo-functionalization of the transcription units, which acquired different mechanisms of regulation, thus likely affording greater regulation potential to the cell.
Resumo:
ABSTRACT : Gene duplication is a fundamental source of raw material for the origin of genetic novelty. It has been assumed for a long time that DNA-based gene duplication was the only source of new genes. Recently however, RNA-based gene duplication (retroposition) was shown in multiple organisms to contribute significantly to their genetic diversity. This mechanism produces intronless gene copies (retrocopies) that are inserted in random genomic position, independent of the position of the parental source genes. In human, mouse and fruit fly, it was demonstrated that the X-linked genes spawned an excess of functional retroposed gene copies (retrogenes). In human and mouse, the X chromosome also recruited an excess of retrogenes. Here we further characterized these interesting biases related to the X chromosome in mammals. Firstly, we have confirmed presence of the aforementioned biases in dog and opossum genome. Then based on the expression profile of retrogenes during various spermatogenetic stages, we have provided solid evidence that meiotic sex chromosome inactivation (MSCI) is responsible for an excess of retrogenes stemming from the X chromosome. Moreover, we showed that the X-linked genes started to export an excess of retrogenes just after the split of eutherian and marsupial mammalian lineages. This suggests that MSCI has originated around this time as well. More fundamentally, as MSCI reflects the spread of recombination barrier between the X and Y chromosomes during their evolution, our observation allowed us to re-estimate the age of mammalian sex chromosomes. Previous estimates suggested that they emerged in the common ancestor of all mammals (before the split of monotreme lineage); whereas, here we showed that they originated around the split of marsupial and eutherian lineages, after the divergence of monotremes. Thus, the therian (marsupial and eutherian) sex chromosomes are younger than previously thought. Thereafter, we have characterized the bias related to the recruitment of genes to the X chromosome. Sexually antagonistic forces are most likely driving this pattern. Using our limited retrogenes expression data, it is difficult to determine the exact nature of these forces but some conclusions have been made. Lastly, we looked at the history of this biased recruitment: it commenced around the split of marsupial and eutherian lineages (akin to the biased export of genes out of the X). In fact, the sexually antagonistic forces are predicted to appear just around that time as well. Thereby, the history of the recruitment of genes to the X, provides an indirect evidence that these forces are responsible for this bias.
Resumo:
Gene duplication leads to paralogy, which complicates the de novo assembly of genotyping-by-sequencing (GBS) data. The issue of paralogous genes is exacerbated in plants, because they are particularly prone to gene duplication events. Paralogs are normally filtered from GBS data before undertaking population genomics or phylogenetic analyses. However, gene duplication plays an important role in the functional diversification of genes and it can also lead to the formation of postzygotic barriers. Using populations and closely related species of a tropical mountain shrub, we examine 1) the genomic differentiation produced by putative orthologs, and 2) the distribution of recent gene duplication among lineages and geography. We find high differentiation among populations from isolated mountain peaks and species-level differentiation within what is morphologically described as a single species. The inferred distribution of paralogs among populations is congruent with taxonomy and shows that GBS could be used to examine recent gene duplication as a source of genomic differentiation of nonmodel species.
Resumo:
Abstract : Gene duplication is an essential source of material for the origin of genetic novelty and the evolution of lineage- or species-specific phenotypic traits. The reverse transcription of source gene mRNA followed by the genomic insertion of the resulting cDNA - retroposition - has provided the human genome with a significant number of gene copies during the last ~63 million years (MYA) of primate evolution. We estimated that at least 1 new functional gene (retrogene) per MYA emerged by retroposition in the primate lineage leading to humans. Using a combination of comparative sequencing and evolutionary simulations, we obtained strong evidence of functionality for 7 primate specific retrogenes. Most of these genes are specifically expressed in testis suggesting that retroposition has contributed with genetic raw material necessary for the evolution ofmale-specific functions in primates. We characterized CDC14Bretro (identified in the previous survey) that originated from the retroposition of a cell cycle gene - CDC14B - in the common ancestor of humans and apes. We demonstrate that CDC14Bretro experienced a period of intense positive selection in the African ape ancestor. By virtue of the amino acid substitutions that occurred during this period CDC 14Bretro adapted to a new subcellular compartment in African apes. Further analyses indicate that this subcellular shift reflects the evolution of anew functional role of CDC 14Bretro. Prompted by this result, we used yeast (Saccharomyces cerevisiae) to investigate on a global scale the extent of functional diversification of duplicate genes through the subcellular adaptation of their encoded proteins. We found that duplicate proteins frequently evolved new cellular localization patterns, either by partitioning of ancestral localizations ("sublocalization"), or more frequently by relocalization to previously unoccupied compartments ("neolocalization"). Interestingly, proteins involved in processes with a wider subcellular distribution more frequently evolved new localization patterns suggesting that subcellular localization changes are dependent on progenitor gene functions. Relocated proteins adapted to their new subcellular environments and evolved new functional roles through changes of their physio-chemical properties, expression levels, and interaction partners. Our work suggests an important role of subcellular adaptation for the emergence of new gene functions.
Resumo:
Gene copies that stem from the mRNAs of parental source genes have long been viewed as evolutionary dead-ends with little biological relevance. Here we review a range of recent studies that have unveiled a significant number of functional retroposed gene copies in both mammalian and some non-mammalian genomes. These studies have not only revealed previously unknown mechanisms for the emergence of new genes and their functions but have also provided fascinating general insights into molecular and evolutionary processes that have shaped genomes. For example, analyses of chromosomal gene movement patterns via RNA-based gene duplication have shed fresh light on the evolutionary origin and biology of our sex chromosomes.
Resumo:
Abstract : Gene duplication is an essential source of material for the origin of genetic novelties. The reverse transcription of source gene mRNA followed by the genomic insertion of the resulting cDNA - retroposition - has provided the human genome with at least ~3600 detectable retrocopies. We find that ~30% of these retrocopies are transcribed, generally in testes. Their transcription often relies on preexisting regulatory elements (or open chromatin) close to their insertion site, which is illustrated by mRNA molecules containing retrocopies fused to their neighboring genes. Retrocopies appear to have been profoundly shaped by selection. Consistently, human retrocopies with an intact open reading (ORF) are more often transcribed than retropseudogenes, which leads to a minimal estimate of 120 functional retrogenes present in our genome. We also performed an analysis of Ka/Ks for human retrocopies. This analysis demonstrates that several intact retrocopies evolved under purifying selection and yields an estimated formation rate of ~1 retrogene per million year in the primate lineage. Using DNA sequencing and evolutionary simulations, we have identified 7 such primate-specific retrogenes that emerged on the lineage leading to humans In therian genomes, we found an excess of retrogenes with X-linked parents. Expression analyses support the idea that this "out of X" movement was driven by natural selection to produce autosomal functional counterparts for X-linked genes, which are silenced during male meiosis. Phylogenetic dating of this "out of X" movement suggests that our sex chromosomes arose about 180 MYA ago and are thus much younger than previously thought. Finally, we have also analyzed young gene duplications (and deletions) that arose by non allelic-homologous recombination and are not fixed in species. Using wild-caught and laboratory animals, we detected thousands of DNA segments that are polymorphic in copy number in mice. These copy number variants were found to profoundly alter the transcriptome of several mouse tissues. Strikingly, their influence on gene expression is not limited to the gene they contain but seems to extend to genes located up to 1.5 million bases away.
Resumo:
Teleost fishes provide the first unambiguous support for ancient whole-genome duplication in an animal lineage. Studies in yeast or plants have shown that the effects of such duplications can be mediated by a complex pattern of gene retention and changes in evolutionary pressure. To explore such patterns in fishes, we have determined by phylogenetic analysis the evolutionary origin of 675 Tetraodon duplicated genes assigned to chromosomes, using additional data from other species of actinopterygian fishes. The subset of genes, which was retained in double after the genome duplication, is enriched in development, signaling, behavior, and regulation functional categories. The evolutionary rate of duplicate fish genes appears to be determined by 3 forces: 1) fish proteins evolve faster than mammalian orthologs; 2) the genes kept in double after genome duplication represent the subset under strongest purifying selection; and 3) following duplication, there is an asymmetric acceleration of evolutionary rate in one of the paralogs. These results show that similar mechanisms are at work in fishes as in yeast or plants and provide a framework for future investigation of the consequences of duplication in fishes and other animals.
Resumo:
We analyze here the relation between alternative splicing and gene duplication in light of recent genomic data, with a focus on the human genome. We show that the previously reported negative correlation between level of alternative splicing and family size no longer holds true. We clarify this pattern and show that it is sufficiently explained by two factors. First, genes progressively gain new splice variants with time. The gain is consistent with a selectively relaxed regime, until purifying selection slows it down as aging genes accumulate a large number of variants. Second, we show that duplication does not lead to a loss of splice forms, but rather that genes with low levels of alternative splicing tend to duplicate more frequently. This leads us to reconsider the role of alternative splicing in duplicate retention.
Resumo:
BACKGROUND: The evolutionary lineage leading to the teleost fish underwent a whole genome duplication termed FSGD or 3R in addition to two prior genome duplications that took place earlier during vertebrate evolution (termed 1R and 2R). Resulting from the FSGD, additional copies of genes are present in fish, compared to tetrapods whose lineage did not experience the 3R genome duplication. Interestingly, we find that ParaHox genes do not differ in number in extant teleost fishes despite their additional genome duplication from the genomic situation in mammals, but they are distributed over twice as many paralogous regions in fish genomes. RESULTS: We determined the DNA sequence of the entire ParaHox C1 paralogon in the East African cichlid fish Astatotilapia burtoni, and compared it to orthologous regions in other vertebrate genomes as well as to the paralogous vertebrate ParaHox D paralogons. Evolutionary relationships among genes from these four chromosomal regions were studied with several phylogenetic algorithms. We provide evidence that the genes of the ParaHox C paralogous cluster are duplicated in teleosts, just as it had been shown previously for the D paralogon genes. Overall, however, synteny and cluster integrity seems to be less conserved in ParaHox gene clusters than in Hox gene clusters. Comparative analyses of non-coding sequences uncovered conserved, possibly co-regulatory elements, which are likely to contain promoter motives of the genes belonging to the ParaHox paralogons. CONCLUSION: There seems to be strong stabilizing selection for gene order as well as gene orientation in the ParaHox C paralogon, since with a few exceptions, only the lengths of the introns and intergenic regions differ between the distantly related species examined. The high degree of evolutionary conservation of this gene cluster's architecture in particular - but possibly clusters of genes more generally - might be linked to the presence of promoter, enhancer or inhibitor motifs that serve to regulate more than just one gene. Therefore, deletions, inversions or relocations of individual genes could destroy the regulation of the clustered genes in this region. The existence of such a regulation network might explain the evolutionary conservation of gene order and orientation over the course of hundreds of millions of years of vertebrate evolution. Another possible explanation for the highly conserved gene order might be the existence of a regulator not located immediately next to its corresponding gene but further away since a relocation or inversion would possibly interrupt this interaction. Different ParaHox clusters were found to have experienced differential gene loss in teleosts. Yet the complete set of these homeobox genes was maintained, albeit distributed over almost twice the number of chromosomes. Selection due to dosage effects and/or stoichiometric disturbance might act more strongly to maintain a modal number of homeobox genes (and possibly transcription factors more generally) per genome, yet permit the accumulation of other (non regulatory) genes associated with these homeobox gene clusters.
Resumo:
The enzyme glutamate dehydrogenase (GDH) is important for recycling the chief excitatory neurotransmitter, glutamate, during neurotransmission. Human GDH exists in housekeeping and brain-specific isotypes encoded by the genes GLUD1 and GLUD2, respectively. Here we show that GLUD2 originated by retroposition from GLUD1 in the hominoid ancestor less than 23 million years ago. The amino acid changes responsible for the unique brain-specific properties of the enzyme derived from GLUD2 occurred during a period of positive selection after the duplication event.
Resumo:
Gene duplication and neofunctionalization are known to be important processes in the evolution of phenotypic complexity. They account for important evolutionary novelties that confer ecological adaptation, such as the major histocompatibility complex (MHC), a multigene family crucial to the vertebrate immune system. In birds, two MHC class II β (MHCIIβ) exon 3 lineages have been recently characterized, and two hypotheses for the evolutionary history of MHCIIβ lineages were proposed. These lineages could have arisen either by 1) an ancient duplication and subsequent divergence of one paralog or by 2) recent parallel duplications followed by functional convergence. Here, we compiled a data set consisting of 63 MHCIIβ exon 3 sequences from six avian orders to distinguish between these hypotheses and to understand the role of selection in the divergent evolution of the two avian MHCIIβ lineages. Based on phylogenetic reconstructions and simulations, we show that a unique duplication event preceding the major avian radiations gave rise to two ancestral MHCIIβ lineages that were each likely lost once later during avian evolution. Maximum likelihood estimation shows that following the ancestral duplication, positive selection drove a radical shift from basic to acidic amino acid composition of a protein domain facing the α-chain in the MHCII α β-heterodimer. Structural analyses of the MHCII α β-heterodimer highlight that three of these residues are potentially involved in direct interactions with the α-chain, suggesting that the shift following duplication may have been accompanied by coevolution of the interacting α- and β-chains. These results provide new insights into the long-term evolutionary relationships among avian MHC genes and open interesting perspectives for comparative and population genomic studies of avian MHC evolution.
Resumo:
In natural conditions, basidiomycete ectomycorrhizal fungi such as Laccaria bicolor are typically in the dikaryotic state when forming symbioses with trees, meaning that two genetically different individuals have to fuse or 'mate'. Nevertheless, nothing is known about the molecular mechanisms of mating in these ecologically important fungi. Here, advantage was taken of the first sequenced genome of the ectomycorrhizal fungus, Laccaria bicolor, to determine the genes that govern the establishment of cell-type identity and orchestrate mating. The L. bicolor mating type loci were identified through genomic screening. The evolutionary history of the genomic regions that contained them was determined by genome-wide comparison of L. bicolor sequences with those of known tetrapolar and bipolar basidiomycete species, and by phylogenetic reconstruction of gene family history. It is shown that the genes of the two mating type loci, A and B, are conserved across the Agaricales, but they are contained in regions of the genome with different evolutionary histories. The A locus is in a region where the gene order is under strong selection across the Agaricales. By contrast, the B locus is in a region where the gene order is likely under a low selection pressure but where gene duplication, translocation and transposon insertion are frequent.
Resumo:
The transformer (tra) gene is a key regulator in the signalling hierarchy controlling all aspects of somatic sexual differentiation in Drosophila and other insects. Here, we show that six of the seven sequenced ants have two copies of tra. Surprisingly, the two paralogues are always more similar within species than among species. Comparative sequence analyses indicate that this pattern is owing to the ongoing concerted evolution after an ancestral duplication rather than independent duplications in each of the six species. In particular, there was strong support for inter-locus recombination between the paralogues of the ant Atta cephalotes. In the five species where the location of paralogues is known, they are adjacent to each other in four cases and separated by only few genes in the fifth case. Because there have been extensive genomic rearrangements in these lineages, this suggests selection acting to conserve their synteny. In three species, we also find a signature of positive selection in one of the paralogues. In three bee species where information is available, the tra gene is also duplicated, the copies are adjacent and in at least one species there was recombination between paralogues. These results suggest that concerted evolution plays an adaptive role in the evolution of this gene family.
Resumo:
Secreted proteases constitute potential virulence factors of dermatophytes. A total of seven genes encoding putative serine proteases of the subtilisin family (SUB) were isolated in Trichophyton rubrum. Based on sequence data and intron-exon structure, a phylogenetic analysis of subtilisins from T. rubrum and other fungi revealed a presumed ancestral lineage comprising T. rubrum SUB2 and Aspergillus SUBs. All other SUBs (SUB1, SUB3-7) are dermatophyte-specific and have apparently emerged more recently, through successive gene duplication events. We showed that two subtilisins, Sub3 and Sub4, were detected in culture supernatants of T. rubrum grown in a medium containing soy protein as a sole nitrogen source. Both recombinant enzymes produced in Pichia pastoris are highly active on keratin azure suggesting that these proteases play an important role in invasion of keratinised tissues by the fungus. The set of deduced amino acid sequences of T. rubrum SUB ORFs allowed the identification of orthologous Subs secreted by other dermatophyte species using proteolysis and mass spectrometry.
Resumo:
Evolution of proteins after whole-genome duplicationGene and genome duplication are considered major mechanisms in the creation of newfunctions in genomes, or in the refinement of networks by the division of function amongmore genes. In animals, the best demonstrated whole genome duplication occurred at theorigin of Teleost fishes. This makes fishes an ideal model to study the consequences ofgenome duplication, particularly since we have a good sampling of genome sequences,abundant functional information, and a very well studied outgroup: the tetrapodes (includinghuman). More specifically, I studied the consequences of duplication on proteins usingevolutionary models to infer adaptive events. I analysed the influence of positive selection invertebrate genes, by contrasting singleton genes and duplicated genes. The conclusion of theanalyses was threefold: (i) positive selection affects diverse phylogenetic branches anddiverse gene categories during vertebrate evolution; (ii) it concerns only a small proportion ofsites (1%-5%); and (iii) whole genome duplication had no detectable impact on theprevalence of this positive selection.I also studied evolution at the amino acid level with different methods to detect functionalshifts (covarion process and constant-but-different process). As in my previous research, Ifound similar numbers of functional shifts between duplicates and between orthologs.The accepted framework for studies of molecular evolution is that orthologs share the samefunction, whereas the function of paralogs diverges. This framework gives a special place togene duplication in evolution, as the main mechanism for generating novelty. With myprevious results showing that duplication and speciation are not so different, we investigatedthe literature to question the evidence for similar or divergent evolution of gene function afterduplication relative to speciation genes. This led us to propose a more rigorous design offuture studies of gene duplication.Finally, based on my automated protocol, we built a database of positive selection invertebrates' genes, Selectome. This database is freely available on the web and will helpfuture evolutionary as well as biochemical studies.