123 resultados para duplication


Relevância:

10.00% 10.00%

Publicador:

Resumo:

Developmental constraints have been postulated to limit the space of feasible phenotypes and thus shape animal evolution. These constraints have been suggested to be the strongest during either early or mid-embryogenesis, which corresponds to the early conservation model or the hourglass model, respectively. Conflicting results have been reported, but in recent studies of animal transcriptomes the hourglass model has been favored. Studies usually report descriptive statistics calculated for all genes over all developmental time points. This introduces dependencies between the sets of compared genes and may lead to biased results. Here we overcome this problem using an alternative modular analysis. We used the Iterative Signature Algorithm to identify distinct modules of genes co-expressed specifically in consecutive stages of zebrafish development. We then performed a detailed comparison of several gene properties between modules, allowing for a less biased and more powerful analysis. Notably, our analysis corroborated the hourglass pattern at the regulatory level, with sequences of regulatory regions being most conserved for genes expressed in mid-development but not at the level of gene sequence, age, or expression, in contrast to some previous studies. The early conservation model was supported with gene duplication and birth that were the most rare for genes expressed in early development. Finally, for all gene properties, we observed the least conservation for genes expressed in late development or adult, consistent with both models. Overall, with the modular approach, we showed that different levels of molecular evolution follow different patterns of developmental constraints. Thus both models are valid, but with respect to different genomic features.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

During my PhD, my aim was to provide new tools to increase our capacity to analyse gene expression patterns, and to study on a large-scale basis the evolution of gene expression in animals. Gene expression patterns (when and where a gene is expressed) are a key feature in understanding gene function, notably in development. It appears clear now that the evolution of developmental processes and of phenotypes is shaped both by evolution at the coding sequence level, and at the gene expression level.Studying gene expression evolution in animals, with complex expression patterns over tissues and developmental time, is still challenging. No tools are available to routinely compare expression patterns between different species, with precision, and on a large-scale basis. Studies on gene expression evolution are therefore performed only on small genes datasets, or using imprecise descriptions of expression patterns.The aim of my PhD was thus to develop and use novel bioinformatics resources, to study the evolution of gene expression. To this end, I developed the database Bgee (Base for Gene Expression Evolution). The approach of Bgee is to transform heterogeneous expression data (ESTs, microarrays, and in-situ hybridizations) into present/absent calls, and to annotate them to standard representations of anatomy and development of different species (anatomical ontologies). An extensive mapping between anatomies of species is then developed based on hypothesis of homology. These precise annotations to anatomies, and this extensive mapping between species, are the major assets of Bgee, and have required the involvement of many co-workers over the years. My main personal contribution is the development and the management of both the Bgee database and the web-application.Bgee is now on its ninth release, and includes an important gene expression dataset for 5 species (human, mouse, drosophila, zebrafish, Xenopus), with the most data from mouse, human and zebrafish. Using these three species, I have conducted an analysis of gene expression evolution after duplication in vertebrates.Gene duplication is thought to be a major source of novelty in evolution, and to participate to speciation. It has been suggested that the evolution of gene expression patterns might participate in the retention of duplicate genes. I performed a large-scale comparison of expression patterns of hundreds of duplicated genes to their singleton ortholog in an outgroup, including both small and large-scale duplicates, in three vertebrate species (human, mouse and zebrafish), and using highly accurate descriptions of expression patterns. My results showed unexpectedly high rates of de novo acquisition of expression domains after duplication (neofunctionalization), at least as high or higher than rates of partitioning of expression domains (subfunctionalization). I found differences in the evolution of expression of small- and large-scale duplicates, with small-scale duplicates more prone to neofunctionalization. Duplicates with neofunctionalization seemed to evolve under more relaxed selective pressure on the coding sequence. Finally, even with abundant and precise expression data, the majority fate I recovered was neither neo- nor subfunctionalization of expression domains, suggesting a major role for other mechanisms in duplicate gene retention.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Cospeciation between host-parasite species is generally thought to result in mirror-image congruent phylogenies. Incongruence can be explained by mechanisms such as host switching, duplication, failure to speciate and sorting events. To investigate the level of association in the host-parasite relationship between Spinturnicid mites and their bat hosts, we constructed the phylogenetic tree of the genus Spinturnix (Acari, Mesostigmata) and compared it to the host phylogeny. We sequenced 938bp of the mitochondrial 16S rDNA and Cytochrome Oxydase subunit I (COI) genes among eleven morphospecies of Spinturnix collected on 20 European Vespertilionid and Rhinolophid bat species. Phylogenetic reconstruction of hosts and parasites showed statistical evidence for cospeciation and suggested that their evolutionary history involved also failure to speciate events and host switches. The latter seem to be mainly promoted by similar roosting habits of the host. As currently understood, host associations of Spinturnicid mites likely results from a complex interaction between the phylogenetic history of the host and the behaviour and the ecology of both parasite and host.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Ever since the pre-molecular era, the birth of new genes with novel functions has been considered to be a major contributor to adaptive evolutionary innovation. Here, I review the origin and evolution of new genes and their functions in eukaryotes, an area of research that has made rapid progress in the past decade thanks to the genomics revolution. Indeed, recent work has provided initial whole-genome views of the different types of new genes for a large number of different organisms. The array of mechanisms underlying the origin of new genes is compelling, extending way beyond the traditionally well-studied source of gene duplication. Thus, it was shown that novel genes also regularly arose from messenger RNAs of ancestral genes, protein-coding genes metamorphosed into new RNA genes, genomic parasites were co-opted as new genes, and that both protein and RNA genes were composed from scratch (i.e., from previously nonfunctional sequences). These mechanisms then also contributed to the formation of numerous novel chimeric gene structures. Detailed functional investigations uncovered different evolutionary pathways that led to the emergence of novel functions from these newly minted sequences and, with respect to animals, attributed a potentially important role to one specific tissue--the testis--in the process of gene birth. Remarkably, these studies also demonstrated that novel genes of the various types significantly impacted the evolution of cellular, physiological, morphological, behavioral, and reproductive phenotypic traits. Consequently, it is now firmly established that new genes have indeed been major contributors to the origin of adaptive evolutionary novelties.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Evolution through natural selection suggests unnecessary genes are lost. We observed that the yeast Candida glabrata lost the gene encoding a phosphate-repressible acid phosphatase (PHO5) present in many yeasts including Saccharomyces cerevisiae. However, C. glabrata still had phosphate starvation-inducible phosphatase activity. Screening a C. glabrata genomic library, we identified CgPMU2, a member of a three-gene family that contains a phosphomutase-like domain. This small-scale gene duplication event could allow for sub- or neofunctionalization. On the basis of phylogenetic and biochemical characterizations, CgPMU2 has neofunctionalized to become a broad range, phosphate starvation-regulated acid phosphatase, which functionally replaces PHO5 in this pathogenic yeast. We determined that CgPmu2, unlike ScPho5, is not able to hydrolyze phytic acid (inositol hexakisphosphate). Phytic acid is present in fruits and seeds where S. cerevisiae grows, but is not abundant in mammalian tissues where C. glabrata grows. We demonstrated that C. glabrata is limited from an environment where phytic acid is the only source of phosphate. Our work suggests that during evolutionary time, the selection for the ancestral PHO5 was lost and that C. glabrata neofunctionalized a weak phosphatase to replace PHO5. Convergent evolution of a phosphate starvation-inducible acid phosphatase in C. glabrata relative to most yeast species provides an example of how small changes in signal transduction pathways can mediate genetic isolation and uncovers a potential speciation gene.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Fanconi anemia is a genetically heterogeneous disorder associated with chromosome instability and a highly elevated risk for developing cancer. The mutated genes encode proteins involved in the cellular response to DNA replication stress. Fanconi anemia proteins are extensively connected with DNA caretaker proteins, and appear to function as a hub for the coordination of DNA repair with DNA replication and cell cycle progression. At a molecular level, however, the raison d'être of Fanconi anemia proteins still remains largely elusive. The thirteen Fanconi anemia proteins identified to date have not been embraced into a single and defined biological process. To help put the Fanconi anemia puzzle into perspective, we begin this review with a summary of the strategies employed by prokaryotes and eukaryotes to tolerate obstacles to the progression of replication forks. We then summarize what we know about Fanconi anemia with an emphasis on biochemical aspects, and discuss how the Fanconi anemia network, a late acquisition in evolution, may function to permit the faithful and complete duplication of our very large vertebrate chromosomes.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Medulloblastoma, the most common malignant paediatric brain tumour, is currently treated with nonspecific cytotoxic therapies including surgery, whole-brain radiation, and aggressive chemotherapy. As medulloblastoma exhibits marked intertumoural heterogeneity, with at least four distinct molecular variants, previous attempts to identify targets for therapy have been underpowered because of small samples sizes. Here we report somatic copy number aberrations (SCNAs) in 1,087 unique medulloblastomas. SCNAs are common in medulloblastoma, and are predominantly subgroup-enriched. The most common region of focal copy number gain is a tandem duplication of SNCAIP, a gene associated with Parkinson's disease, which is exquisitely restricted to Group 4α. Recurrent translocations of PVT1, including PVT1-MYC and PVT1-NDRG1, that arise through chromothripsis are restricted to Group 3. Numerous targetable SCNAs, including recurrent events targeting TGF-β signalling in Group 3, and NF-κB signalling in Group 4, suggest future avenues for rational, targeted therapy.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Arising from either retrotransposition or genomic duplication of functional genes, pseudogenes are "genomic fossils" valuable for exploring the dynamics and evolution of genes and genomes. Pseudogene identification is an important problem in computational genomics, and is also critical for obtaining an accurate picture of a genome's structure and function. However, no consensus computational scheme for defining and detecting pseudogenes has been developed thus far. As part of the ENCyclopedia Of DNA Elements (ENCODE) project, we have compared several distinct pseudogene annotation strategies and found that different approaches and parameters often resulted in rather distinct sets of pseudogenes. We subsequently developed a consensus approach for annotating pseudogenes (derived from protein coding genes) in the ENCODE regions, resulting in 201 pseudogenes, two-thirds of which originated from retrotransposition. A survey of orthologs for these pseudogenes in 28 vertebrate genomes showed that a significant fraction ( approximately 80%) of the processed pseudogenes are primate-specific sequences, highlighting the increasing retrotransposition activity in primates. Analysis of sequence conservation and variation also demonstrated that most pseudogenes evolve neutrally, and processed pseudogenes appear to have lost their coding potential immediately or soon after their emergence. In order to explore the functional implication of pseudogene prevalence, we have extensively examined the transcriptional activity of the ENCODE pseudogenes. We performed systematic series of pseudogene-specific RACE analyses. These, together with complementary evidence derived from tiling microarrays and high throughput sequencing, demonstrated that at least a fifth of the 201 pseudogenes are transcribed in one or more cell lines or tissues.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

RESUME Nous rapportons l'étude d'une famille de 49 membres sur 5 générations. Parmi 35 membres étudiés, 18 sont atteints d'Osteolyse Expansive Familiale (OEF). L'OEF est une dysplasie osseuse génétique rare, autosomique dominante, dont les altérations locales et générales du squelette ont une distribution périphérique prédominante qui devient manifeste à partir de la deuxième décennie de vie. Une résorption ostéoclastique progressive, accompagnée d'une faible activité ostéoblastique, est à l'origine d'une expansion médullaire osseuse. Cette dernière est caractérisée par une raréfaction de la moelle osseuse qui est remplacée par du tissu fibreux et de la graisse. L'amincissement de la moelle osseuse aboutit à des déformations invalidantes, sévères et douloureuses du squelette, avec tendance aux fractures spontanées. La première manifestation clinique de la maladie est une surdité de transmission très précoce résultant d'une lyse de la chaîne ossiculaire. Radiologiquement, il existe toujours une pneumatisation marquée de la mastoïde et du rocher. Les dents montrent des signes importants de résorption osseuse au niveau de la région apicale et/ou du collet, dont l'aspect est caractéristique et unique. La phosphatase alcaline sérique, l'hydroxyproline et la deoxypiridoline urinaire sont élevées à des taux variables. Le taux de calcium et d'hormone parathyroïdienne est normal. Le traitement par les diphosphonates, la calcitonine et la vitamine D est inefficace. Histologiquement, l'OEF présente des similitudes avec la maladie de Paget, mais l'âge de début, la distribution des lésions osseuses, les altérations dentaires et de l'oreille moyenne, ainsi que la progression clinique sont différents. Il en va de même pour la dysplasie fibreuse, l'ostéite fibro-kystique et l'ostéogénèse imparfaite. Le gêne responsable de la maladie se localise dans la région du chromosome 18q21-22. Récemment, des mutations du TNFRSF 11A, gêne qui codifie le RANK, ont été identifiées comme étant la cause de l'OEF. La duplication de la 18ème paire de base au niveau de l'exon 1 suggère qu'il correspond au site de l'anomalie. La technique chirurgicale et les résultats audiométriques à court et long terme de 13 interventions chez 8 patients sont présentés. ABSTRACT Objectives: Familial Expansive Osteolysis (EEO) is a rare autosomal dominant bone dys¬plasia. The disease can show general and focal skeletal alterations, the latter having a pre¬dominantly peripheral distribution. Onset occurs after the second decade of life. Patients and methods: We present the study, of 30 years, of a family consisting of 49 members covering five generations. Results: Among the 35 members studied, 18 have familial expansive osteolysis (FEO). The first clinical sign of the condition is transmission deafness at an early age. The features of the teeth has a unique and characteristic appearance. Thinning of the corti¬cal bone leads to severe, painful, disabling deformities. Serum alkaline phosphatase, and urinary hydroxyproline and deoxipyridinoline are elevated. Calcium and parathyroid hor¬mone are normal. Treatment with diphosphonates, calcitonin and vitamin D has been unsuccessful. We present the surgical technology and the results to short and long term of 13 interventions on 8 patients. Conclusion: Progressive osteoclastic reabsorption accompanied by weak osteoblastic activ¬ity results in medullary expansion characterized by rarefaction of the bone marrow, which is replaced by fibrous tissue and fat. FE0 is histologically similar to Paget disease, but the age of onset, the distribution of the bone lesions, the dental and middle ear alterations, and the clin¬ical progression are different. These features also differentiate FE0 from fibrous dysplasia, fibrocystic osteitis and imperfect osteogenesis. The gene responsible for EEO is located in the 18q21-22 chromosome region. Mutations in TNFRSF11A, the gene encoding receptor activa¬tor of nuclear factor-kappa-B (RANK), has been recently identified as the cause of FEO. A duplication of 18 base pairs in exon 1 of the TNFRSF11A gene suggests that this corresponds to the site of the anomaly and can be considered a "hot spot" for mutations.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The focus of my PhD research was the concept of modularity. In the last 15 years, modularity has become a classic term in different fields of biology. On the conceptual level, a module is a set of interacting elements that remain mostly independent from the elements outside of the module. I used modular analysis techniques to study gene expression evolution in vertebrates. In particular, I identified ``natural'' modules of gene expression in mouse and human, and I showed that expression of organ-specific and system-specific genes tends to be conserved between such distance vertebrates as mammals and fishes. Also with a modular approach, I studied patterns of developmental constraints on transcriptome evolution. I showed that none of the two commonly accepted models of the evolution of embryonic development (``evo-devo'') are exclusively valid. In particular, I found that the conservation of the sequences of regulatory regions is highest during mid-development of zebrafish, and thus it supports the ``hourglass model''. In contrast, events of gene duplication and new gene introduction are most rare in early development, which supports the ``early conservation model''. In addition to the biological insights on transcriptome evolution, I have also discussed in detail the advantages of modular approaches in large-scale data analysis. Moreover, I re-analyzed several studies (published in high-ranking journals), and showed that their conclusions do not hold out under a detailed analysis. This demonstrates that complex analysis of high-throughput data requires a co-operation between biologists, bioinformaticians, and statisticians.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Abstract : A preliminary understanding of the phenotypic effect of copy number variation (CNV) of DNA segments is emerging. These rearrangements were shown to influence, in a somewhat dose-dependent manner, the expression of genes mapping within them. They were also shown to modify the expression of genes located on their Hanks, sometimes at great distance. Here, we demonstrate by monitoring these effects at multiple life stages, that these controls over expression are effective throughout mouse development. Similarly, we observe that the more specific spatial expression patterns of CNV genes are maintained through life. However, 'we find that some brain- expressed genes mapping within CNVS appear to be under compensatory loops only at specific time-points, indicating that the effect of CNVS on these genes is modulated during development. Notably, we also observe that CNV genes are significantly enriched within transcripts that show variable time-course expression between strains. Thus, modifying the copy number of a gene may potentially alter not only its expression level, but its timing of expression as well. Résume : Nous commençons à comprendre les effets phénotypiques liés aux séquences d'ADN qui changent de nombre de copies d'un individu a l'autre. Des travaux précédents ont montré que ces variante de nombre de copies (CNVS) avaient une influence sur l'expression non seulement des gènes se trouvant dans le réarrangement, mais aussi sur ceux se trouvant à une certaine distance. Le présent travail étudie ces effets à différents stades du développement de la souris allant d'un embryon de deux semaines à la souris adulte. Nous avons observé que certains gènes exprimés dans le cerveau semblent soumis à un contrôle plus strict a certaines étapes du développement suggèrent que l'effet des CNVs est modulé différemment au cours de la vie. Notre travail sur trois souches différentes de souris a permis de montrer que les gènes ayant un profil d'expression différent dans le temps entre souches sont enrichis en gènes se trouvant dans des CNVs. Ceci nous amène à penser que les CNVs ont, non seulement une influence sur le niveau d'expression des gènes, mais aussi sur les moments durant lesquels ils seront exprimés. Résumé pour un large public : De nombreuses maladies sont dues soit a un gain (on parle alors de duplication) soit à une perte de matériel génétique (il s'agit dune délétion). Bien que les recherches visant à identifier les mécanismes moléculaires liés à ces réarrangements de notre génome progressent continuellement, la plupart des causes des maladies génétiques restent à élucider. Certaines parties de notre génome sont présentes en un nombre de copies qui diffère d'un individu à l'autre sans pour autant provoquer une ou des maladies. Ces segments d'ADN qui varient en nombre sont appelés Copy Number Variant (CNVs). Ils couvrent environ 12% de notre matériel génétique. Des études menées sur différents modèles animaux ont montré que les CNVs avaient une influence aussi bien sur les gènes qui sont a l'intérieur des CNVs que sur ceux qui sont dans leur voisinage. Ce travail étudie l'effet des CNVs à travers différents stades du développement de la souris. Nous avons démontré que les segments d'ADN qui varient en nombre de copies ont des effets variables selon le stade auxquels ils sont mesurés. Ainsi, les CNVs ont non seulement un impact sur l'expression des gènes présents dans ces régions et dans leur voisinage, mais influencent également leurs profils d'expression au cours du temps.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The replication of circular DNA faces topological obstacles that need to be overcome to allow the complete duplication and separation of newly replicated molecules. Small bacterial plasmids provide a perfect model system to study the interplay between DNA helicases, polymerases, topoisomerases and the overall architecture of partially replicated molecules. Recent studies have shown that partially replicated circular molecules have an amazing ability to form various types of structures (supercoils, precatenanes, knots and catenanes) that help to accommodate the dynamic interplay between duplex unwinding at the replication fork and DNA unlinking by topoisomerases.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

In plants, an oligogene family encodes NADP-malic enzymes (NADP-me), which are responsible for various functions and exhibit different kinetics and expression patterns. In particular, a chloroplast isoform of NADP-me plays a key role in one of the three biochemical subtypes of C4 photosynthesis, an adaptation to warm environments that evolved several times independently during angiosperm diversification. By combining genomic and phylogenetic approaches, this study aimed at identifying the molecular mechanisms linked to the recurrent evolutions of C4-specific NADP-me in grasses (Poaceae). Genes encoding NADP-me (nadpme) were retrieved from genomes of model grasses and isolated from a large sample of C3 and C4 grasses. Genomic and phylogenetic analyses showed that 1) the grass nadpme gene family is composed of four main lineages, one of which is expressed in plastids (nadpme-IV), 2) C4-specific NADP-me evolved at least five times independently from nadpme-IV, and 3) some codons driven by positive selection underwent parallel changes during the multiple C4 origins. The C4 NADP-me being expressed in chloroplasts probably constrained its recurrent evolutions from the only plastid nadpme lineage and this common starting point limited the number of evolutionary paths toward a C4 optimized enzyme, resulting in genetic convergence. In light of the history of nadpme genes, an evolutionary scenario of the C4 phenotype using NADP-me is discussed.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Structural variation, whether it is caused by copy number variants or present in a balanced form, such as reciprocal translocations and inversions, can have a profound and dramatic effect on the expression of genes mapping within and close to the rearrangement, as well as affecting others genome wide. These effects can be caused by altering the copy number of one or more genes or regulatory elements (dosage effect) or from physical disruption of links between regulatory elements and their associated gene or genes, resulting in perturbation of expression. Similarly, large-scale structural variants can result in genome-wide expression changes by altering the positions that chromosomes occupy within the nucleus, potentially disrupting not only local cis interactions, but also trans interactions that occur throughout the genome. Structural variation is, therefore, a significant factor in the study of gene expression and is discussed here in more detail.