949 resultados para Molecular evolution
Resumo:
Summary : During the evolutionary diversification of organisms, similar ecological constraints led to the recurrent appearances of the same traits (phenotypes) in distant lineages, a phenomenon called convergence. In most cases, the genetic origins of the convergent traits remain unknown, but recent studies traced the convergent phenotypes to recurrent alterations of the same gene or, in a few cases, to identical genetic changes. However, these cases remain anecdotal and there is a need for a study system that evolved several times independently and whose genetic determinism is well resolved and straightforward, such as C4 photosynthesis. This adaptation to warm environments, possibly driven by past atmospheric CO2 decreases, consists in a CO2-concentrating pump, created by numerous morphological and biochemical novelties. All genes encoding C4 enzymes already existed in C3 ancestors, and are supposed to have been recruited through gene duplication followed by neo-functionalization, to acquire the cell specific expression pattern and altered kinetic properties that characterize Ca-specific enzymes. These predictions have so far been tested only in species-poor and ecologically marginal C4 dicots. The monocots, and especially the grass family (Poaceae), the most important C4 family in terms of species number, ecological dominance and economical importance, have been largely under-considered as suitable study systems. This thesis aimed at understanding the evolution of the C4 trait in grasses at a molecular level and to use the genetics of C4 photosynthesis to infer the evolutionary history of the C4 phenotype and its driving selective pressures. A molecular phylogeny of grasses and affiliated monocots identified 17 to 18 independent acquisitions of the C4 pathway in the grass family. A relaxed molecular clock was used to date these events and the first C4 evolution was estimated in the Chloridoideae subfamily, between 32-25 million years ago, at a period when atmospheric CO2 abruptly declined. Likelihood models showed that after the COZ decline the probability of evolving the C4 pathway strongly increased, confirming low CO2 as a likely driver of C4 photosynthesis evolution. In order to depict the genetic changes linked to the numerous C4 origins, genes encoding phopshoenolpyruvate carboxylase (PEPC), the key-enzyme responsible for the initial fixation of atmospheric CO2 in the C4 pathway, were isolated from a large sample of C3 and C4 grasses. Phylogenetic analyses were used to reconstruct the evolutionary history of the PEPC multigene family and showed that the evolution of C4-specific PEPC had been driven by positive selection on 21 codons simultaneously in up to eight C4 lineages. These selective pressures led to numerous convergent genetic changes in many different C4 clades, highlighting the repeatability of some evolutionary processes, even at the molecular level. PEPC C4-adaptive changes were traced and used to show multiple appearances of the C, pathway in clades where species tree inferences were unable to differentiate multiple C4 appearances and a single appearance followed by C4 to C3 reversion. Further investigations of genes involved in some of the C4 subtypes only (genes encoding decarboxylating enzymes NADP-malic enzyme and phosphoenolpyruvate carboxykinase) showed that these C4-enzymes also evolved through strong positive selection and underwent parallel genetic changes during the different Ca origins. The adaptive changes on these subtype-specific C4 genes were used to retrace the history of the C4-subtypes phenotypes, which revealed that the evolution of C4-PEPC and C4-decarboxylating enzymes was in several cases disconnected, emphasizing the multiplicity of the C4 trait and the gradual acquisition of the features that create the CO2-pump. Finally, phylogenetic analyses of a gene encoding the Rubisco (the enzyme responsible for the fixation of CO2 into organic compounds in all photosynthetic organisms) showed that C4 evolution switched the selective pressures on this gene. Five codons were recurrently mutated to adapt the enzyme kinetics to the high CO2 concentrations of C4 photosynthetic cells. This knowledge could be used to introgress C4-like Rubisco in C3 crops, which could lead to an increased yield under predicted future high CO2 atmosphere. Globally, the phylogenetic framework adopted during this thesis demonstrated the widespread occurrence of genetic convergence on C4-related enzymes. The genetic traces of C4 photosynthesis evolution allowed reconstructing events that happened during the last 30 million years and proved the usefulness of studying genes directly responsible for phenotype variations when inferring evolutionary history of a given trait. Résumé Durant la diversification évolutive des organismes, des pressions écologiques similaires ont amené à l'apparition récurrente de certains traits (phénotypes) dans des lignées distantes, un phénomène appelé évolution convergente. Dans la plupart des cas, l'origine génétique des traits convergents reste inconnue mais des études récentes ont montré qu'ils étaient dus dans certains cas à des changements répétés du même gène ou, dans de rares cas, à des changements génétiques identiques. Malgré tout, ces cas restent anecdotiques et il y a un réel besoin d'un système d'étude qui ait évolué indépendamment de nombreuses fois et dont le déterminisme génétique soit clairement identifié. La photosynthèse dite en Ça répond à ces critères. Cette adaptation aux environnements chauds, dont l'évolution a pu être encouragé par des baisses passées de la concentration atmosphérique en CO2, est constituée de nombreuses nouveautés morphologiques et biochimiques qui créent une pompe à CO2. La totalité des gènes codant les enzymes Ç4 étaient déjà présents dans les ancêtres C3. Leur recrutement pour la photosynthèse Ç4 est supposé s'être fait par le biais de duplications géniques suivies par une néo-fonctionnalisation pour leur conférer l'expression cellule-spécifique et les propriétés cinétiques qui caractérisent les enzymes C4. Ces prédictions n'ont jusqu'à présent été testées que dans des familles C4 contenant peu d'espèces et ayant un rôle écologique marginal. Les graminées (Poaceae), qui sont la famille C4 la plus importante, tant en termes de nombre d'espèces que de dominance écologique et d'importance économique, ont toujours été considérés comme un système d'étude peu adapté et ont fait le sujet de peu d'investigations évolutives. Le but de cette thèse était de comprendre l'évolution de la photosynthèse en C4 chez les graminées au niveau génétique et d'utiliser les gènes pour inférer l'évolution du phénotype C4 ainsi que les pressions de sélection responsables de son évolution. Une phylogénie moléculaire de la famille des graminées et des monocotylédones apparentés a identifié 17 à 18 acquisitions indépendantes de la photosynthèse chez les graminées. Grâce à une méthode d'horloge moléculaire relâchée, ces évènements ont été datés et la première apparition C4 a été estimée dans la sous-famille des Chloridoideae, il y a 32 à 25 millions d'années, à une période où les concentrations atmosphériques de CO2 ont décliné abruptement. Des modèles de maximum de vraisemblance ont montré qu'à la suite du déclin de CO2, la probabilité d'évoluer la photosynthèse C4 a fortement augmenté, confirmant ainsi qu'une faible concentration de CO2 est une cause potentielle de l'évolution de la photosynthèse C4. Afin d'identifier les mécanismes génétiques responsables des évolutions répétées de la photosynthèse C4, un segment des gènes codant pour la phosphoénolpyruvate carboxylase (PEPC), l'enzyme responsable de la fixation initiale du CO2 atmosphérique chez les plantes C4, ont été séquencés dans une centaine de graminées C3 et C4. Des analyses phylogénétiques ont permis de reconstituer l'histoire évolutive de la famille multigénique des PEPC et ont montré que l'évolution de PEPC spécifiques à la photosynthèse Ça a été causée par de la sélection positive agissant sur 21 codons, et ce simultanément dans huit lignées C4 différentes. Cette sélection positive a conduit à un grand nombre de changements génétiques convergents dans de nombreux clades différents, ce qui illustre la répétabilité de certains phénomènes évolutifs, et ce même au niveau génétique. Les changements sur la PEPC liés au C4 ont été utilisés pour confirmer des évolutions indépendantes du phénotype C4 dans des clades où l'arbre des espèces était incapable de différencier des apparitions indépendantes d'une seule apparition suivie par une réversion de C4 en C3. En considérant des gènes codant des protéines impliquées uniquement dans certains sous-types C4 (deux décarboxylases, l'enzyme malique à NADP et la phosphoénolpyruvate carboxykinase), des études ultérieures ont montré que ces enzymes C4 avaient elles-aussi évolué sous forte sélection positive et subi des changements génétiques parallèles lors des différentes origines de la photosynthèse C4. Les changements adaptatifs sur ces gènes liés seulement à certains sous-types C4 ont été utilisés pour retracer l'histoire des phénotypes de sous-types C4, ce qui a révélé que les caractères formant le trait C4 ont, dans certains cas, évolué de manière déconnectée. Ceci souligne la multiplicité du trait C4 et l'acquisition graduelle de composants participant à la pompe à CO2 qu'est la photosynthèse C4. Finalement, des analyses phylogénétiques des gènes codant pour la Rubisco (l'enzyme responsable de la fixation du CO2 en carbones organiques dans tous les organismes photosynthétiques) ont montré que l'évolution de la photosynthèse Ça a changé les pressions de sélection sur ce gène. Cinq codons ont été mutés de façon répétée afin d'adapter les propriétés cinétiques de la Rubisco aux fortes concentrations de CO2 présentes dans les cellules photosynthétiques des plantes C4. Globalement, l'approche phylogénétique adoptée durant cette thèse de doctorat a permis de démontré des phénomène fréquents de convergence génétique sur les enzymes liées à la photosynthèse C4. Les traces génétiques de l'évolution de la photosynthèse C4 ont permis de reconstituer des évènements qui se sont produits durant les derniers 30 millions d'années et ont prouvé l'utilité d'étudier des gènes directement responsables des variations phénotypiques pour inférer l'histoire évolutive d'un trait donné.
Resumo:
1406 I. 1407 II. 1408 III. 1410 IV. 1411 V. 1413 VI. 1416 VII. 1418 1418 References 1419 SUMMARY: Almost all land plants form symbiotic associations with mycorrhizal fungi. These below-ground fungi play a key role in terrestrial ecosystems as they regulate nutrient and carbon cycles, and influence soil structure and ecosystem multifunctionality. Up to 80% of plant N and P is provided by mycorrhizal fungi and many plant species depend on these symbionts for growth and survival. Estimates suggest that there are c. 50 000 fungal species that form mycorrhizal associations with c. 250 000 plant species. The development of high-throughput molecular tools has helped us to better understand the biology, evolution, and biodiversity of mycorrhizal associations. Nuclear genome assemblies and gene annotations of 33 mycorrhizal fungal species are now available providing fascinating opportunities to deepen our understanding of the mycorrhizal lifestyle, the metabolic capabilities of these plant symbionts, the molecular dialogue between symbionts, and evolutionary adaptations across a range of mycorrhizal associations. Large-scale molecular surveys have provided novel insights into the diversity, spatial and temporal dynamics of mycorrhizal fungal communities. At the ecological level, network theory makes it possible to analyze interactions between plant-fungal partners as complex underground multi-species networks. Our analysis suggests that nestedness, modularity and specificity of mycorrhizal networks vary and depend on mycorrhizal type. Mechanistic models explaining partner choice, resource exchange, and coevolution in mycorrhizal associations have been developed and are being tested. This review ends with major frontiers for further research.
Resumo:
BACKGROUND: The exceptionally diverse species flocks of cichlid fishes in East Africa are prime examples of parallel adaptive radiations. About 80% of East Africa's more than 1 800 endemic cichlid species, and all species of the flocks of Lakes Victoria and Malawi, belong to a particularly rapidly evolving lineage, the haplochromines. One characteristic feature of the haplochromines is their possession of egg-dummies on the males' anal fins. These egg-spots mimic real eggs and play an important role in the mating system of these maternal mouthbrooding fish. RESULTS: Here, we show that the egg-spots of haplochromines are made up of yellow pigment cells, xanthophores, and that a gene coding for a type III receptor tyrosine kinase, colony-stimulating factor 1 receptor a (csf1ra), is expressed in egg-spot tissue. Molecular evolutionary analyses reveal that the extracellular ligand-binding and receptor-interacting domain of csf1ra underwent adaptive sequence evolution in the ancestral lineage of the haplochromines, coinciding with the emergence of egg-dummies. We also find that csf1ra is expressed in the egg-dummies of a distantly related cichlid species, the ectodine cichlid Ophthalmotilapia ventralis, in which markings with similar functions evolved on the pelvic fin in convergence to those of the haplochromines. CONCLUSION: We conclude that modifications of existing signal transduction mechanisms might have evolved in the haplochromine lineage in association with the origination of anal fin egg-dummies. That positive selection has acted during the evolution of a color gene that seems to be involved in the morphogenesis of a sexually selected trait, the egg-dummies, highlights the importance of further investigations of the comparative genomic basis of the phenotypic diversification of cichlid fishes.
Resumo:
Many root-colonizing pseudomonads are able to promote plant growth by increasing phosphate availability in soil through solubilization of poorly soluble rock phosphates. The major mechanism of phosphate solubilization by pseudomonads is the secretion of gluconic acid, which requires the enzyme glucose dehydrogenase and its cofactor pyrroloquinoline quinone (PQQ). The main aim of this study was to evaluate whether a PQQ biosynthetic gene is suitable to study the phylogeny of phosphate-solubilizing pseudomonads. To this end, two new primers, which specifically amplify the pqqC gene of the Pseudomonas genus, were designed. pqqC fragments were amplified and sequenced from a Pseudomonas strain collection and from a natural wheat rhizosphere population using cultivation-dependent and cultivation-independent approaches. Phylogenetic trees based on pqqC sequences were compared to trees obtained with the two concatenated housekeeping genes rpoD and gyrB. For both pqqC and rpoD-gyrB, similar main phylogenetic clusters were found. However, in the pqqC but not in the rpoD-gyrB tree, the group of fluorescent pseudomonads producing the antifungal compounds 2,4-diacetylphloroglucinol and pyoluteorin was located outside the Pseudomonas fluorescens group. pqqC sequences from isolated pseudomonads were differently distributed among the identified phylogenetic groups than pqqC sequences derived from the cultivation-independent approach. Comparing pqqC phylogeny and phosphate solubilization activity, we identified one phylogenetic group with high solubilization activity. In summary, we demonstrate that the gene pqqC is a novel molecular marker that can be used complementary to housekeeping genes for studying the diversity and evolution of plant-beneficial pseudomonads.
Resumo:
Changes in gene expression are thought to underlie many of the phenotypic differences between species. However, large-scale analyses of gene expression evolution were until recently prevented by technological limitations. Here we report the sequencing of polyadenylated RNA from six organs across ten species that represent all major mammalian lineages (placentals, marsupials and monotremes) and birds (the evolutionary outgroup), with the goal of understanding the dynamics of mammalian transcriptome evolution. We show that the rate of gene expression evolution varies among organs, lineages and chromosomes, owing to differences in selective pressures: transcriptome change was slow in nervous tissues and rapid in testes, slower in rodents than in apes and monotremes, and rapid for the X chromosome right after its formation. Although gene expression evolution in mammals was strongly shaped by purifying selection, we identify numerous potentially selectively driven expression switches, which occurred at different rates across lineages and tissues and which probably contributed to the specific organ biology of various mammals.
Resumo:
In this study we have characterized intra-patient length polymorphism in V4 by cloning and sequencing a C2-C4 fragment from HIV plasma RNA in patients at different stages of HIV disease. Clonal analysis of clade B, G, and CRF02 isolates during early infection shows extensive intra-patient V4 variability, due to the presence of indel-associated polymorphism. Indels, coupled to amino acid substitution events, affect the number and distribution of potential N-glycosylation sites, resulting in the coexistence, within the same patient, of V4 subsets, each characterized by different sizes, amino acid sequences, and potential N-glycosylation patterns. In contrast, V3 appears to be relatively homogeneous, with similar V3 associated to significantly different V4 within the same clinical specimen. Based on these data, we propose that during early chronic infection V4 is present as a highly divergent quasispecies, enabling the virus to adopt different conformational structures according to immune constrains and other selective pressures
Resumo:
BACKGROUND: Along the chromosome of the obligate intracellular bacteria Protochlamydia amoebophila UWE25, we recently described a genomic island Pam100G. It contains a tra unit likely involved in conjugative DNA transfer and lgrE, a 5.6-kb gene similar to five others of P. amoebophila: lgrA to lgrD, lgrF. We describe here the structure, regulation and evolution of these proteins termed LGRs since encoded by "Large G+C-Rich" genes. RESULTS: No homologs to the whole protein sequence of LGRs were found in other organisms. Phylogenetic analyses suggest that serial duplications producing the six LGRs occurred relatively recently and nucleotide usage analyses show that lgrB, lgrE and lgrF were relocated on the chromosome. The C-terminal part of LGRs is homologous to Leucine-Rich Repeats domains (LRRs). Defined by a cumulative alignment score, the 5 to 18 concatenated octacosapeptidic (28-meric) LRRs of LGRs present all a predicted alpha-helix conformation. Their closest homologs are the 28-residue RI-like LRRs of mammalian NODs and the 24-meres of some Ralstonia and Legionella proteins. Interestingly, lgrE, which is present on Pam100G like the tra operon, exhibits Pfam domains related to DNA metabolism. CONCLUSION: Comparison of the LRRs, enable us to propose a parsimonious evolutionary scenario of these domains driven by adjacent concatenations of LRRs. Our model established on bacterial LRRs can be challenged in eucaryotic proteins carrying less conserved LRRs, such as NOD proteins and Toll-like receptors.
Resumo:
In the ecologically important arbuscular mycorrhizal fungi (AMF), Sod1 encodes a functional polypeptide that confers increased tolerance to oxidative stress and that is upregulated inside the roots during early steps of the symbiosis with host plants. It is still unclear whether its expression is directed at scavenging reactive oxygen species (ROS) produced by the host, if it plays a role in the fungus-host dialogue, or if it is a consequence of oxidative stress from the surrounding environment. All these possibilities are equally likely, and molecular variation at the Sod1 locus can possibly have adaptive implications for one or all of the three mentioned functions. In this paper, we analyzed the diversity of the Sod1 gene in six AMF species, as well as 14 Glomus intraradices isolates from a single natural population. By sequencing this locus, we identified a large amount of nucleotide and amino acid molecular diversity both among AMF species and individuals, suggesting a rapid divergence of its codons. The Sod1 gene was monomorphic within each isolate we analyzed, and quantitative PCR strongly suggest this locus is present as a single copy in G. intraradices. Maximum-likelihood analyses performed using a variety of models for codon evolution indicated that a number of amino acid sites most likely evolved under the regime of positive selection among AMF species. In addition, we found that some isolates of G. intraradices from a natural population harbor very divergent orthologous Sod1 sequences, and our analysis suggested that diversifying selection, rather than recombination, was responsible for the persistence of this molecular diversity within the AMF population.
Resumo:
Staphylococcus aureus, especially when it is methicillin resistant, has been recognised as a major cause of nosocomial and community-acquired infections. It has also been shown that certain strains were able to cause clonal epidemics whereas others showed a more incidental occurrence. On the basis of this behavioural distinction, a genetic feature underlying this difference in epidemicity can be assumed. Understanding the difference will not only contribute to the development of markers for the identification of epidemic strains but will also shed light on the evolution of clones. Genomes of strains from two independent collections (n=18 and n=10 strains) were analysed. Both collections were composed of carefully selected, genetically diverse strains with clinically well-defined epidemic and sporadic behaviour. Comparative genome hybridisation (CGH) was performed using an Agilent array for one collection (up to 11 probes per open reading frame - ORF), and an Affymetrix array for the other (up to 30 probes per ORF). Presence and absence information of probe homologues and ORFs was taken for analysis of molecular variance (AMOVA) at the strain and behaviour levels. Not a single probe showed 100% concordant differences between epidemic and sporadic strains. Moreover, probe differences between groups were always smaller than those within groups. This was also true, when the analysis was focussed on presence versus absence of ORF's or when probe information was transformed into allelic profiles. These findings present strong evidence against the presence or absence of a single common specific genetic factor differentiating epidemic from sporadic S. aureus clones.
Resumo:
Cancer cells acquire cell-autonomous capacities to undergo limitless proliferation and survival through the activation of oncogenes and inactivation of tumor suppressor genes. Nevertheless, the formation of a clinically relevant tumor requires support from the surrounding normal stroma, also referred to as the tumor microenvironment. Carcinoma-associated fibroblasts, leukocytes, bone marrow-derived cells, blood and lymphatic vascular endothelial cells present within the tumor microenvironment contribute to tumor progression. Recent evidence indicates that the microenvironment provides essential cues to the maintenance of cancer stem cells/cancer initiating cells and to promote the seeding of cancer cells at metastatic sites. Furthermore, inflammatory cells and immunomodulatory mediators present in the tumor microenvironment polarize host immune response toward specific phenotypes impacting tumor progression. A growing number of studies demonstrate a positive correlation between angiogenesis, carcinoma-associated fibroblasts, and inflammatory infiltrating cells and poor outcome, thereby emphasizing the clinical relevance of the tumor microenvironment to aggressive tumor progression. Thus, the dynamic and reciprocal interactions between tumor cells and cells of the tumor microenvironment orchestrate events critical to tumor evolution toward metastasis, and many cellular and molecular elements of the microenvironment are emerging as attractive targets for therapeutic strategies.
Resumo:
AbstractIn addition to genetic changes affecting the function of gene products, changes in gene expression have been suggested to underlie many or even most of the phenotypic differences among mammals. However, detailed gene expression comparisons were, until recently, restricted to closely related species, owing to technological limitations. Thus, we took advantage of the latest technologies (RNA-Seq) to generate extensive qualitative and quantitative transcriptome data for a unique collection of somatic and germline tissues from representatives of all major mammalian lineages (placental mammals, marsupials and monotremes) and birds, the evolutionary outgroup.In the first major project of my thesis, we performed global comparative analyses of gene expression levels based on these data. Our analyses provided fundamental insights into the dynamics of transcriptome change during mammalian evolution (e.g., the rate of expression change across species, tissues and chromosomes) and allowed the exploration of the functional relevance and phenotypic implications of transcription changes at a genome-wide scale (e.g., we identified numerous potentially selectively driven expression switches).In a second project of my thesis, which was also based on the unique transcriptome data generated in the context of the first project we focused on the evolution of alternative splicing in mammals. Alternative splicing contributes to transcriptome complexity by generating several transcript isoforms from a single gene, which can, thus, perform various functions. To complete the global comparative analysis of gene expression changes, we explored patterns of alternative splicing evolution. This work uncovered several general and unexpected patterns of alternative splicing evolution (e.g., we found that alternative splicing evolves extremely rapidly) as well as a large number of conserved alternative isoforms that may be crucial for the functioning of mammalian organs.Finally, the third and final project of my PhD consisted in analyzing in detail the unique functional and evolutionary properties of the testis by exploring the extent of its transcriptome complexity. This organ was previously shown to evolve rapidly both at the phenotypic and molecular level, apparently because of the specific pressures that act on this organ and are associated with its reproductive function. Moreover, my analyses of the amniote tissue transcriptome data described above, revealed strikingly widespread transcriptional activity of both functional and nonfunctional genomic elements in the testis compared to the other organs. To elucidate the cellular source and mechanisms underlying this promiscuous transcription in the testis, we generated deep coverage RNA-Seq data for all major testis cell types as well as epigenetic data (DNA and histone methylation) using the mouse as model system. The integration of these complete dataset revealed that meiotic and especially post-meiotic germ cells are the major contributors to the widespread functional and nonfunctional transcriptome complexity of the testis, and that this "promiscuous" spermatogenic transcription is resulting, at least partially, from an overall transcriptionally permissive chromatin state. We hypothesize that this particular open state of the chromatin results from the extensive chromatin remodeling that occurs during spermatogenesis which ultimately leads to the replacement of histones by protamines in the mature spermatozoa. Our results have important functional and evolutionary implications (e.g., regarding new gene birth and testicular gene expression evolution).Generally, these three large-scale projects of my thesis provide complete and massive datasets that constitute valuables resources for further functional and evolutionary analyses of mammalian genomes.
Resumo:
Inbreeding avoidance is often invoked to explain observed patterns of dispersal, and theoretical models indeed point to a possibly important role. However, while inbreeding load is usually assumed constant in these models, it is actually bound to vary dynamically under the combined influences of mutation, drift, and selection and thus to evolve jointly with dispersal. Here we report the results of individual-based stochastic simulations allowing such a joint evolution. We show that strongly deleterious mutations should play no significant role, owing to the low genomic mutation rate for such mutations. Mildly deleterious mutations, by contrast, may create enough heterosis to affect the evolution of dispersal as an inbreeding-avoidance mechanism, but only provided that they are also strongly recessive. If slightly recessive, they will spread among demes and accumulate at the metapopulation level, thus contributing to mutational load, but not to heterosis. The resulting loss of viability may then combine with demographic stochasticity to promote population fluctuations, which foster indirect incentives for dispersal. Our simulations suggest that, under biologically realistic parameter values, deleterious mutations have a limited impact on the evolution of dispersal, which on average exceeds by only one-third the values expected from kin-competition avoidance.
Resumo:
The genomic era has revealed that the large repertoire of observed animal phenotypes is dependent on changes in the expression patterns of a finite number of genes, which are mediated by a plethora of transcription factors (TFs) with distinct specificities. The dimerization of TFs can also increase the complexity of a genetic regulatory network manifold, by combining a small number of monomers into dimers with distinct functions. Therefore, studying the evolution of these dimerizing TFs is vital for understanding how complexity increased during animal evolution. We focus on the second largest family of dimerizing TFs, the basic-region leucine zipper (bZIP), and infer when it expanded and how bZIP DNA-binding and dimerization functions evolved during the major phases of animal evolution. Specifically, we classify the metazoan bZIPs into 19 families and confirm the ancient nature of at least 13 of these families, predating the split of the cnidaria. We observe fixation of a core dimerization network in the last common ancestor of protostomes-deuterostomes. This was followed by an expansion of the number of proteins in the network, but no major dimerization changes in interaction partners, during the emergence of vertebrates. In conclusion, the bZIPs are an excellent model with which to understand how DNA binding and protein interactions of TFs evolved during animal evolution.