618 resultados para no duplication
Resumo:
In addition to differences in protein-coding gene sequences, changes in expression resulting from mutations in regulatory sequences have long been hypothesized to be responsible for phenotypic differences between species. However, unlike comparison of genome sequences, few studies, generally restricted to pairwise comparisons of closely related mammalian species, have assessed between-species differences at the transcriptome level. They reported that gene expression evolves at different rates in various organs and in a pattern that is overall consistent with neutral models of evolution. In the first part of my thesis, I investigated the evolution of gene expression in therian mammals (i.e.7 placental and marsupials), based on microarray data from human, mouse and the gray short-tailed opossum (Monodelphis domestica). In addition to autosomal genes, a special focus was given to the evolution of X-linked genes. The therian X chromosome was recently shown to be younger than previously thought and to harbor a specific gene content (e.g., genes involved in brain or reproductive functions) that is thought to have been shaped by specific sex-related evolutionary forces. Sex chromosomes derive from ordinary autosomes and their differentiation led to the degeneration of the Y chromosome (in mammals) or W chromosome (in birds). Consequently, X- or Z-linked genes differ in gene dose between males and females such that the heterogametic sex has half the X/Z gene dose compared to the ancestral state. To cope with this dosage imbalance, mammals have been reported to have evolved mechanisms of dosage compensation.¦In the first project, I could first show that transcriptomes evolve at different rates in different organs. Out of the five tissues I investigated, the testis is the most rapidly evolving organ at the gene expression level while the brain has the most conserved transcriptome. Second, my analyses revealed that mammalian gene expression evolution is compatible with a neutral model, where the rates of change in gene expression levels is linked to the efficiency of purifying selection in a given lineage, which, in turn, is determined by the long-term effective population size in that lineage. Thus, the rate of DNA sequence evolution, which could be expected to determine the rate of regulatory sequence change, does not seem to be a major determinant of the rate of gene expression evolution. Thus, most gene expression changes seem to be (slightly) deleterious. Finally, X-linked genes seem to have experienced elevated rates of gene expression change during the early stage of X evolution. To further investigate the evolution of mammalian gene expression, we generated an extensive RNA-Seq gene expression dataset for nine mammalian species and a bird. The analyses of this dataset confirmed the patterns previously observed with microarrays and helped to significantly deepen our view on gene expression evolution.¦In a specific project based on these data, I sought to assess in detail patterns of evolution of dosage compensation in amniotes. My analyses revealed the absence of male to female dosage compensation in monotremes and its presence in marsupials and, in addition, confirmed patterns previously described for placental mammals and birds. I then assessed the global level of expression of X/Z chromosomes and contrasted this with its ancestral gene expression levels estimated from orthologous autosomal genes in species with non-homologous sex chromosomes. This analysis revealed a lack of up-regulation for placental mammals, the level of expression of X-linked genes being proportional to gene dose. Interestingly, the ancestral gene expression level was at least partially restored in marsupials as well as in the heterogametic sex of monotremes and birds. Finally, I investigated alternative mechanisms of dosage compensation and found that gene duplication did not seem to be a widespread mechanism to restore the ancestral gene dose. However, I could show that placental mammals have preferentially down-regulated autosomal genes interacting with X-linked genes which underwent gene expression decrease, and thus identified a novel alternative mechanism of dosage compensation.
Resumo:
Vitellogenin is synthesized under estrogen control in the liver, extensively modified, transported to the ovary, and there processed to the yolk proteins lipovitellin and phosvitin. In the frog Xenopus laevis there are at least four distinct but related vitellogenin genes. The two genes A1 and A2 have a 95 percent sequence homology in their messenger RNA coding regions, and contain 33 introns that interrupt the coding region (exons) at homologous positions. Sequences and lengths of analogous introns differ, and many introns contain repetitive DNA elements. The introns in these two genes that have apparently arisen by duplication have diverged extensively by events that include deletions, insertions, and probably duplications. Rapid evolutionary change involving rearrangements and the presence of repeated DNA suggests that the bulk of the sequences within introns may not have any specific function.
Resumo:
Background: The RPS4 gene codifies for ribosomal protein S4, a very well-conserved protein present in all kingdoms. In primates, RPS4 is codified by two functional genes located on both sex chromosomes: the RPS4X and RPS4Y genes. In humans, RPS4Y is duplicated and the Y chromosome therefore carries a third functional paralog: RPS4Y2, which presents a testis-specific expression pattern. Results: DNA sequence analysis of the intronic and cDNA regions of RPS4Y genes from species covering the entire primate phylogeny showed that the duplication event leading to the second Y-linked copy occurred after the divergence of New World monkeys, about 35 million years ago. Maximum likelihood analyses of the synonymous and non-synonymous substitutions revealed that positive selection was acting on RPS4Y2 gene in the human lineage, which represents the first evidence of positive selection on a ribosomal protein gene. Putative positive amino acid replacements affected the three domains of the protein: one of these changes is located in the KOW protein domain and affects the unique invariable position of this motif, and might thus have a dramatic effect on the protein function.Conclusion: Here, we shed new light on the evolutionary history of RPS4Y gene family, especially on that of RPS4Y2. The results point that the RPS4Y1 gene might be maintained to compensate gene dosage between sexes, while RPS4Y2 might have acquired a new function, at least in the lineage leading to humans.
Resumo:
Aphids are important agricultural pests and also biological models for studies of insect-plant interactions, symbiosis, virus vectoring, and the developmental causes of extreme phenotypic plasticity. Here we present the 464 Mb draft genome assembly of the pea aphid Acyrthosiphon pisum. This first published whole genome sequence of a basal hemimetabolous insect provides an outgroup to the multiple published genomes of holometabolous insects. Pea aphids are host-plant specialists, they can reproduce both sexually and asexually, and they have coevolved with an obligate bacterial symbiont. Here we highlight findings from whole genome analysis that may be related to these unusual biological features. These findings include discovery of extensive gene duplication in more than 2000 gene families as well as loss of evolutionarily conserved genes. Gene family expansions relative to other published genomes include genes involved in chromatin modification, miRNA synthesis, and sugar transport. Gene losses include genes central to the IMD immune pathway, selenoprotein utilization, purine salvage, and the entire urea cycle. The pea aphid genome reveals that only a limited number of genes have been acquired from bacteria; thus the reduced gene count of Buchnera does not reflect gene transfer to the host genome. The inventory of metabolic genes in the pea aphid genome suggests that there is extensive metabolite exchange between the aphid and Buchnera, including sharing of amino acid biosynthesis between the aphid and Buchnera. The pea aphid genome provides a foundation for post-genomic studies of fundamental biological questions and applied agricultural problems.
Resumo:
Background: The degree of metal binding specificity in metalloproteins such as metallothioneins (MTs) can be crucial for their functional accuracy. Unlike most other animal species, pulmonate molluscs possess homometallic MT isoforms loaded with Cu+ or Cd2+. They have, so far, been obtained as native metal-MT complexes from snail tissues, where they are involved in the metabolism of the metal ion species bound to the respective isoform. However, it has not as yet been discerned if their specific metal occupation is the result of a rigid control of metal availability, or isoform expression programming in the hosting tissues or of structural differences of the respective peptides determining the coordinative options for the different metal ions. In this study, the Roman snail (Helix pomatia) Cu-loaded and Cd-loaded isoforms (HpCuMT and HpCdMT) were used as model molecules in order t o elucidate the biochemical and evolutionary mechanisms permitting pulmonate MTs to achieve specificity for their cognate metal ion. Results: HpCuMT and HpCdMT were recombinantly synthesized in the presence of Cd2+, Zn2+ or Cu2+ and corresponding metal complexes analysed by electrospray mass spectrometry and circular dichroism (CD) and ultra violet-visible (UV-Vis) spectrophotometry. Both MT isoforms were only able to form unique, homometallic and stable complexes (Cd6-HpCdMT and Cu12-HpCuMT) with their cognate metal ions. Yeast complementation assays demonstrated that the two isoforms assumed metal-specific functions, in agreement with their binding preferences, in heterologous eukaryotic environments. In the snail organism, the functional metal specificity of HpCdMT and HpCuMT was contributed by metal-specific transcription programming and cell-specific expression. Sequence elucidation and phylogenetic analysis of MT isoforms from a number of snail species revealed that they possess an unspecific and two metal-specific MT isoforms, whose metal specificity was achieved exclusively by evolutionary modulation of non-cysteine amino acid positions. Conclusion: The Roman snail HpCdMT and HpCuMT isoforms can thus be regarded as prototypes of isoform families that evolved genuine metal-specificity within pulmonate molluscs. Diversification into these isoforms may have been initiated by gene duplication, followed by speciation and selection towards opposite needs for protecting copper-dominated metabolic pathways from nonessential cadmium. The mechanisms enabling these proteins to be metal-specific could also be relevant for other metalloproteins.
Resumo:
Background: Hox and ParaHox gene clusters are thought to have resulted from the duplication of a ProtoHox gene cluster early in metazoan evolution. However, the origin and evolution of the other genes belonging to the extended Hox group of homeobox-containing genes, that is, Mox and Evx, remains obscure. We constructed phylogenetic trees with mouse, amphioxus and Drosophila extended Hox and other related Antennapedia-type homeobox gene sequences and analyzed the linkage data available for such genes.Results: We claim that neither Mox nor Evx is a Hox or ParaHox gene. We propose a scenariothat reconciles phylogeny with linkage data, in which an Evx/Mox ancestor gene linked to aProtoHox cluster was involved in a segmental tandem duplication event that generated an arrayof all Hox-like genes, referred to as the `coupled¿ cluster. A chromosomal breakage within thiscluster explains the current composition of the extended Hox cluster (with Evx, Hox and Moxgenes) and the ParaHox cluster.Conclusions: Most studies dealing with the origin and evolution of Hox and ParaHox clustershave not included the Hox-related genes Mox and Evx. Our phylogenetic analyses and theavailable linkage data in mammalian genomes support an evolutionary scenario in which anancestor of Evx and Mox was linked to the ProtoHox cluster, and that a tandem duplication of alarge genomic region early in metazoan evolution generated the Hox and ParaHox clusters, plusthe cluster-neighbors Evx and Mox. The large `coupled¿ Hox-like cluster EvxHox/MoxParaHox wassubsequently broken, thus grouping the Mox and Evx genes to the Hox clusters, and isolating theParaHox cluster.
Resumo:
Background: Chemoreception is a widespread mechanism that is involved in critical biologic processes, including individual and social behavior. The insect peripheral olfactory system comprises three major multigene families: the olfactory receptor (Or), the gustatory receptor (Gr), and the odorant-binding protein (OBP) families. Members of the latter family establish the first contact with the odorants, and thus constitute the first step in the chemosensory transduction pathway.Results: Comparative analysis of the OBP family in 12 Drosophila genomes allowed the identification of 595 genes that encode putative functional and nonfunctional members in extant species, with 43 gene gains and 28 gene losses (15 deletions and 13 pseudogenization events). The evolution of this family shows tandem gene duplication events, progressive divergence in DNA and amino acid sequence, and prevalence of pseudogenization events in external branches of the phylogenetic tree. We observed that the OBP arrangement in clusters is maintained across the Drosophila species and that purifying selection governs the evolution of the family; nevertheless, OBP genes differ in their functional constraints levels. Finally, we detect that the OBP repertoire evolves more rapidly in the specialist lineages of the Drosophila melanogaster group (D. sechellia and D. erecta) than in their closest generalists.Conclusion: Overall, the evolution of the OBP multigene family is consistent with the birth-and-death model. We also found that members of this family exhibit different functional constraints, which is indicative of some functional divergence, and that they might be involved in some of the specialization processes that occurred through the diversification of the Drosophila genus.
Resumo:
Developmental constraints have been postulated to limit the space of feasible phenotypes and thus shape animal evolution. These constraints have been suggested to be the strongest during either early or mid-embryogenesis, which corresponds to the early conservation model or the hourglass model, respectively. Conflicting results have been reported, but in recent studies of animal transcriptomes the hourglass model has been favored. Studies usually report descriptive statistics calculated for all genes over all developmental time points. This introduces dependencies between the sets of compared genes and may lead to biased results. Here we overcome this problem using an alternative modular analysis. We used the Iterative Signature Algorithm to identify distinct modules of genes co-expressed specifically in consecutive stages of zebrafish development. We then performed a detailed comparison of several gene properties between modules, allowing for a less biased and more powerful analysis. Notably, our analysis corroborated the hourglass pattern at the regulatory level, with sequences of regulatory regions being most conserved for genes expressed in mid-development but not at the level of gene sequence, age, or expression, in contrast to some previous studies. The early conservation model was supported with gene duplication and birth that were the most rare for genes expressed in early development. Finally, for all gene properties, we observed the least conservation for genes expressed in late development or adult, consistent with both models. Overall, with the modular approach, we showed that different levels of molecular evolution follow different patterns of developmental constraints. Thus both models are valid, but with respect to different genomic features.
Resumo:
During my PhD, my aim was to provide new tools to increase our capacity to analyse gene expression patterns, and to study on a large-scale basis the evolution of gene expression in animals. Gene expression patterns (when and where a gene is expressed) are a key feature in understanding gene function, notably in development. It appears clear now that the evolution of developmental processes and of phenotypes is shaped both by evolution at the coding sequence level, and at the gene expression level.Studying gene expression evolution in animals, with complex expression patterns over tissues and developmental time, is still challenging. No tools are available to routinely compare expression patterns between different species, with precision, and on a large-scale basis. Studies on gene expression evolution are therefore performed only on small genes datasets, or using imprecise descriptions of expression patterns.The aim of my PhD was thus to develop and use novel bioinformatics resources, to study the evolution of gene expression. To this end, I developed the database Bgee (Base for Gene Expression Evolution). The approach of Bgee is to transform heterogeneous expression data (ESTs, microarrays, and in-situ hybridizations) into present/absent calls, and to annotate them to standard representations of anatomy and development of different species (anatomical ontologies). An extensive mapping between anatomies of species is then developed based on hypothesis of homology. These precise annotations to anatomies, and this extensive mapping between species, are the major assets of Bgee, and have required the involvement of many co-workers over the years. My main personal contribution is the development and the management of both the Bgee database and the web-application.Bgee is now on its ninth release, and includes an important gene expression dataset for 5 species (human, mouse, drosophila, zebrafish, Xenopus), with the most data from mouse, human and zebrafish. Using these three species, I have conducted an analysis of gene expression evolution after duplication in vertebrates.Gene duplication is thought to be a major source of novelty in evolution, and to participate to speciation. It has been suggested that the evolution of gene expression patterns might participate in the retention of duplicate genes. I performed a large-scale comparison of expression patterns of hundreds of duplicated genes to their singleton ortholog in an outgroup, including both small and large-scale duplicates, in three vertebrate species (human, mouse and zebrafish), and using highly accurate descriptions of expression patterns. My results showed unexpectedly high rates of de novo acquisition of expression domains after duplication (neofunctionalization), at least as high or higher than rates of partitioning of expression domains (subfunctionalization). I found differences in the evolution of expression of small- and large-scale duplicates, with small-scale duplicates more prone to neofunctionalization. Duplicates with neofunctionalization seemed to evolve under more relaxed selective pressure on the coding sequence. Finally, even with abundant and precise expression data, the majority fate I recovered was neither neo- nor subfunctionalization of expression domains, suggesting a major role for other mechanisms in duplicate gene retention.
Resumo:
The 2012 Iowa Code section 324A.4, subsection 2, states the Iowa Department of Transportation (DOT) “shall biennially prepare a report to be submitted to the general assembly and the governor prior to December 15 of even-numbered years. The report shall recommend methods to increase transportation coordination and improve the efficiency of federal, state, and local government programs used to finance public transit services and may address other topics as appropriate.” Iowa has long been a leader in transportation coordination, from designated public transit agencies covering all 99 counties with little duplication, to requiring any agency receiving public dollars for the provision of transportation to first coordinate with the local public transit agency before providing the transportation on their own, to the creation of the Iowa Transportation Coordination Council. Coordination allows Iowa to provide much needed transportation services to the citizens of Iowa with the most efficient use of public funds. Coordination has been an important topic in Iowa for many years, but during these times of economic constraint and restraint and Iowa’s changing demographics, coordination of transportation services becomes even more critical.
Resumo:
Cospeciation between host-parasite species is generally thought to result in mirror-image congruent phylogenies. Incongruence can be explained by mechanisms such as host switching, duplication, failure to speciate and sorting events. To investigate the level of association in the host-parasite relationship between Spinturnicid mites and their bat hosts, we constructed the phylogenetic tree of the genus Spinturnix (Acari, Mesostigmata) and compared it to the host phylogeny. We sequenced 938bp of the mitochondrial 16S rDNA and Cytochrome Oxydase subunit I (COI) genes among eleven morphospecies of Spinturnix collected on 20 European Vespertilionid and Rhinolophid bat species. Phylogenetic reconstruction of hosts and parasites showed statistical evidence for cospeciation and suggested that their evolutionary history involved also failure to speciate events and host switches. The latter seem to be mainly promoted by similar roosting habits of the host. As currently understood, host associations of Spinturnicid mites likely results from a complex interaction between the phylogenetic history of the host and the behaviour and the ecology of both parasite and host.
Resumo:
Ever since the pre-molecular era, the birth of new genes with novel functions has been considered to be a major contributor to adaptive evolutionary innovation. Here, I review the origin and evolution of new genes and their functions in eukaryotes, an area of research that has made rapid progress in the past decade thanks to the genomics revolution. Indeed, recent work has provided initial whole-genome views of the different types of new genes for a large number of different organisms. The array of mechanisms underlying the origin of new genes is compelling, extending way beyond the traditionally well-studied source of gene duplication. Thus, it was shown that novel genes also regularly arose from messenger RNAs of ancestral genes, protein-coding genes metamorphosed into new RNA genes, genomic parasites were co-opted as new genes, and that both protein and RNA genes were composed from scratch (i.e., from previously nonfunctional sequences). These mechanisms then also contributed to the formation of numerous novel chimeric gene structures. Detailed functional investigations uncovered different evolutionary pathways that led to the emergence of novel functions from these newly minted sequences and, with respect to animals, attributed a potentially important role to one specific tissue--the testis--in the process of gene birth. Remarkably, these studies also demonstrated that novel genes of the various types significantly impacted the evolution of cellular, physiological, morphological, behavioral, and reproductive phenotypic traits. Consequently, it is now firmly established that new genes have indeed been major contributors to the origin of adaptive evolutionary novelties.
Resumo:
Evolution through natural selection suggests unnecessary genes are lost. We observed that the yeast Candida glabrata lost the gene encoding a phosphate-repressible acid phosphatase (PHO5) present in many yeasts including Saccharomyces cerevisiae. However, C. glabrata still had phosphate starvation-inducible phosphatase activity. Screening a C. glabrata genomic library, we identified CgPMU2, a member of a three-gene family that contains a phosphomutase-like domain. This small-scale gene duplication event could allow for sub- or neofunctionalization. On the basis of phylogenetic and biochemical characterizations, CgPMU2 has neofunctionalized to become a broad range, phosphate starvation-regulated acid phosphatase, which functionally replaces PHO5 in this pathogenic yeast. We determined that CgPmu2, unlike ScPho5, is not able to hydrolyze phytic acid (inositol hexakisphosphate). Phytic acid is present in fruits and seeds where S. cerevisiae grows, but is not abundant in mammalian tissues where C. glabrata grows. We demonstrated that C. glabrata is limited from an environment where phytic acid is the only source of phosphate. Our work suggests that during evolutionary time, the selection for the ancestral PHO5 was lost and that C. glabrata neofunctionalized a weak phosphatase to replace PHO5. Convergent evolution of a phosphate starvation-inducible acid phosphatase in C. glabrata relative to most yeast species provides an example of how small changes in signal transduction pathways can mediate genetic isolation and uncovers a potential speciation gene.
Resumo:
Fanconi anemia is a genetically heterogeneous disorder associated with chromosome instability and a highly elevated risk for developing cancer. The mutated genes encode proteins involved in the cellular response to DNA replication stress. Fanconi anemia proteins are extensively connected with DNA caretaker proteins, and appear to function as a hub for the coordination of DNA repair with DNA replication and cell cycle progression. At a molecular level, however, the raison d'être of Fanconi anemia proteins still remains largely elusive. The thirteen Fanconi anemia proteins identified to date have not been embraced into a single and defined biological process. To help put the Fanconi anemia puzzle into perspective, we begin this review with a summary of the strategies employed by prokaryotes and eukaryotes to tolerate obstacles to the progression of replication forks. We then summarize what we know about Fanconi anemia with an emphasis on biochemical aspects, and discuss how the Fanconi anemia network, a late acquisition in evolution, may function to permit the faithful and complete duplication of our very large vertebrate chromosomes.
Resumo:
Medulloblastoma, the most common malignant paediatric brain tumour, is currently treated with nonspecific cytotoxic therapies including surgery, whole-brain radiation, and aggressive chemotherapy. As medulloblastoma exhibits marked intertumoural heterogeneity, with at least four distinct molecular variants, previous attempts to identify targets for therapy have been underpowered because of small samples sizes. Here we report somatic copy number aberrations (SCNAs) in 1,087 unique medulloblastomas. SCNAs are common in medulloblastoma, and are predominantly subgroup-enriched. The most common region of focal copy number gain is a tandem duplication of SNCAIP, a gene associated with Parkinson's disease, which is exquisitely restricted to Group 4α. Recurrent translocations of PVT1, including PVT1-MYC and PVT1-NDRG1, that arise through chromothripsis are restricted to Group 3. Numerous targetable SCNAs, including recurrent events targeting TGF-β signalling in Group 3, and NF-κB signalling in Group 4, suggest future avenues for rational, targeted therapy.