42 resultados para GENE DUPLICATION

em National Center for Biotechnology Information - NCBI


Relevância:

100.00% 100.00%

Publicador:

Resumo:

Changes in genes encoding transcriptional regulators can alter development and are important components of the molecular mechanisms of morphological evolution. MADS-box genes encode transcriptional regulators of diverse and important biological functions. In plants, MADS-box genes regulate flower, fruit, leaf, and root development. Recent sequencing efforts in Arabidopsis have allowed a nearly complete sampling of the MADS-box gene family from a single plant, something that was lacking in previous phylogenetic studies. To test the long-suspected parallel between the evolution of the MADS-box gene family and the evolution of plant form, a polarized gene phylogeny is necessary. Here we suggest that a gene duplication ancestral to the divergence of plants and animals gave rise to two main lineages of MADS-box genes: TypeI and TypeII. We locate the root of the eukaryotic MADS-box gene family between these two lineages. A novel monophyletic group of plant MADS domains (AGL34 like) seems to be more closely related to previously identified animal SRF-like MADS domains to form TypeI lineage. Most other plant sequences form a clear monophyletic group with animal MEF2-like domains to form TypeII lineage. Only plant TypeII members have a K domain that is downstream of the MADS domain in most plant members previously identified. This suggests that the K domain evolved after the duplication that gave rise to the two lineages. Finally, a group of intermediate plant sequences could be the result of recombination events. These analyses may guide the search for MADS-box sequences in basal eukaryotes and the phylogenetic placement of new genes from other plant species.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Plant-specific polyketide synthase genes constitute a gene superfamily, including universal chalcone synthase [CHS; malonyl-CoA:4-coumaroyl-CoA malonyltransferase (cyclizing) (EC 2.3.1.74)] genes, sporadically distributed stilbene synthase (SS) genes, and atypical, as-yet-uncharacterized CHS-like genes. We have recently isolated from Gerbera hybrida (Asteraceae) an unusual CHS-like gene, GCHS2, which codes for an enzyme with structural and enzymatic properties as well as ontogenetic distribution distinct from both CHS and SS. Here, we show that the GCHS2-like function is encoded in the Gerbera genome by a family of at least three transcriptionally active genes. Conservation within the GCHS2 family was exploited with selective PCR to study the occurrence of GCHS2-like genes in other Asteraceae. Parsimony analysis of the amplified sequences together with CHS-like genes isolated from other taxa of angiosperm subclass Asteridae suggests that GCHS2 has evolved from CHS via a gene duplication event that occurred before the diversification of the Asteraceae. Enzyme activity analysis of proteins produced in vitro indicates that the GCHS2 reaction is a non-SS variant of the CHS reaction, with both different substrate specificity (to benzoyl-CoA) and a truncated catalytic profile. Together with the recent results of Durbin et al. [Durbin, M. L., Learn, G. H., Jr., Huttley, G. A. & Clegg, M. T. (1995) Proc. Natl. Acad. Sci. USA 92, 3338-3342], our study confirms a gene duplication-based model that explains how various related functions have arisen from CHS during plant evolution.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Keratinocyte growth factor (KGF) is a member of the fibroblast growth factor family. Portions of the gene encoding KGF were amplified during primate evolution and are present in multiple nonprocessed copies in the human genome. Nucleotide analysis of a representative sampling of these KGF-like sequences indicated that they were at least 95% identical to corresponding regions of the KGF gene. To localize these sequences to specific chromosomal sites in human and higher primates, we used fluorescence in situ hybridization. In human, using a cosmid probe encoding KGF exon 1, we assigned the location of the KGF gene to chromosome 15q15–21.1. In addition, copies of KGF-like sequences hybridizing only with a cosmid probe encoding exons 2 and 3 were localized to dispersed sites on chromosome 2q21, 9p11, 9q12–13, 18p11, 18q11, 21q11, and 21q21.1. The distribution of KGF-like sequences suggests a role for alphoid DNA in their amplification and dispersion. In chimpanzee, KGF-like sequences were observed at five chromosomal sites, which were each homologous to sites in human, while in gorilla, a subset of four of these homologous sites was identified; in orangutan two sites were identified, while gibbon exhibited only a single site. The chromosomal localization of KGF sequences in human and great ape genomes indicates that amplification and dispersion occurred in multiple discrete steps, with initial KGF gene duplication and dispersion taking place in gibbon and involving loci corresponding to human chromosomes 15 and 21. These findings support the concept of a closer evolutionary relationship of human and chimpanzee and a possible selective pressure for such dispersion during the evolution of higher primates.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The parasitic bacterium Mycoplasma genitalium has a small, reduced genome with close to a basic set of genes. As a first step toward determining the families of protein domains that form the products of these genes, we have used the multiple sequence programs psi-blast and geanfammer to match the sequences of the 467 gene products of M. genitalium to the sequences of the domains that form proteins of known structure [Protein Data Bank (PDB) sequences]. PDB sequences (274) match all of 106 M. genitalium sequences and some parts of another 85; thus, 41% of its total sequences are matched in all or part. The evolutionary relationships of the PDB domains that match M. genitalium are described in the structural classification of proteins (SCOP) database. Using this information, we show that the domains in the matched M. genitalium sequences come from 114 superfamilies and that 58% of them have arisen by gene duplication. This level of duplication is more than twice that found by using pairwise sequence comparisons. The PDB domain matches also describe the domain structure of the matched sequences: just over a quarter contain one domain and the rest have combinations of two or more domains.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

A DNA sequence has been obtained for a 35.6-kb genomic segment from Heliobacillus mobilis that contains a major cluster of photosynthesis genes. A total of 30 ORFs were identified, 20 of which encode enzymes for bacteriochlorophyll and carotenoid biosynthesis, reaction-center (RC) apoprotein, and cytochromes for cyclic electron transport. Donor side electron-transfer components to the RC include a putative RC-associated cytochrome c553 and a unique four-large-subunit cytochrome bc complex consisting of Rieske Fe-S protein (encoded by petC), cytochrome b6 (petB), subunit IV (petD), and a diheme cytochrome c (petX). Phylogenetic analysis of various photosynthesis gene products indicates a consistent grouping of oxygenic lineages that are distinct and descendent from anoxygenic lineages. In addition, H. mobilis was placed as the closest relative to cyanobacteria, which form a monophyletic origin to chloroplast-based photosynthetic lineages. The consensus of the photosynthesis gene trees also indicates that purple bacteria are the earliest emerging photosynthetic lineage. Our analysis also indicates that an ancient gene-duplication event giving rise to the paralogous bchI and bchD genes predates the divergence of all photosynthetic groups. In addition, our analysis of gene duplication of the photosystem I and photosystem II core polypeptides supports a “heterologous fusion model” for the origin and evolution of oxygenic photosynthesis.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

Ubiquitin is a highly conserved protein that is encoded by a multigene family. It is generally believed that this gene family is subject to concerted evolution, which homogenizes the member genes of the family. However, protein homogeneity can be attained also by strong purifying selection. We therefore studied the proportion (pS) of synonymous nucleotide differences between members of the ubiquitin gene family from 28 species of fungi, plants, and animals. The results have shown that pS is generally very high and is often close to the saturation level, although the protein sequence is virtually identical for all ubiquitins from fungi, plants, and animals. A small proportion of species showed a low level of pS values, but these values appeared to be caused by recent gene duplication. It was also found that the number of repeat copies of the gene family varies considerably with species, and some species harbor pseudogenes. These observations suggest that the members of this gene family evolve almost independently by silent nucleotide substitution and are subjected to birth-and-death evolution at the DNA level.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

The hypothesis that morphological evolution may largely result from changes in gene regulation rather than gene structure has been difficult to test. Morphological differences among insects are often apparent in the cuticle structures produced. The dopa decarboxylase (Ddc) and alpha-methyldopa hypersensitive (amd) genes arose from an ancient gene duplication. In Drosophila, they have evolved nonoverlapping functions, including the production of distinct types of cuticle, and for Ddc, the production of the neurotransmitters, dopamine and serotonin. The amd gene is particularly active in the production of specialized flexible cuticles in the developing embryo. We have compared the pattern of amd expression in three Drosophila species. Several regions of expression conserved in all three species but, surprisingly, a unique domain of expression is found in Drosophila simulans that does occur in the closely related (2-5 million years) Drosophila melanogaster or in the more remote (40-50 million years) Drosophila virilis. The "sudden" appearance of a completely new and robust domain of expression provides a glimpse of evolutionary variation resulting from changes in regulation of structural gene expression.

Relevância:

70.00% 70.00%

Publicador:

Resumo:

We have isolated a new hemoglobin gene from soybean. It is expressed in cotyledons, stems of seedlings, roots, young leaves, and in some cells in the nodules that are associated with the nitrogen-fixing Bradyrhizobium symbiont. This contrasts with the expression of the leghemoglobins, which are active only in the infected cells of the nodules. The deduced protein sequence of the new gene shows only 58% similarity to one of the soybean leghemoglobins, but 85-87% similarity to hemoglobins from the nonlegumes Parasponia, Casuarina, and barley. The pattern of expression and the gene sequence indicate that this new gene is a nonsymbiotic legume hemoglobin. The finding of this gene in legumes and similar genes in other species strengthens our previous suggestion that genomes of all plants contain hemoglobin genes. The specialized leghemoglobin gene family may have arisen from a preexisting nonsymbiotic hemoglobin by gene duplication.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Knowledge of the origin and evolution of gene families is critical to our understanding of the evolution of protein function. To gain a detailed understanding of the evolution of the small heat shock proteins (sHSPs) in plants, we have examined the evolutionary history of the chloroplast (CP)-localized sHSPs. Previously, these nuclear-encoded CP proteins had been identified only from angiosperms. This study reveals the presence of the CP sHSPs in a moss, Funaria hygrometrica. Two clones for CP sHSPs were isolated from a F. hygrometrica heat shock cDNA library that represent two distinct CP sHSP genes. Our analysis of the CP sHSPs reveals unexpected evolutionary relationships and patterns of sequence conservation. Phylogenetic analysis of the CP sHSPs with other plant CP sHSPs and eukaryotic, archaeal, and bacterial sHSPs shows that the CP sHSPs are not closely related to the cyanobacterial sHSPs. Thus, they most likely evolved via gene duplication from a nuclear-encoded cytosolic sHSP and not via gene transfer from the CP endosymbiont. Previous sequence analysis had shown that all angiosperm CP sHSPs possess a methionine-rich region in the N-terminal domain. The primary sequence of this region is not highly conserved in the F. hygrometrica CP sHSPs. This lack of sequence conservation indicates that sometime in land plant evolution, after the divergence of mosses from the common ancestor of angiosperms but before the monocot–dicot divergence, there was a change in the selective constraints acting on the CP sHSPs.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Cryptocyanin, a copper-free hexameric protein in crab (Cancer magister) hemolymph, has been characterized and the amino acid sequence has been deduced from its cDNA. It is markedly similar in sequence, size, and structure to hemocyanin, the copper-containing oxygen-transport protein found in many arthropods. Cryptocyanin does not bind oxygen, however, and lacks three of the six highly conserved copper-binding histidine residues of hemocyanin. Cryptocyanin has no phenoloxidase activity, although a phenoloxidase is present in the hemolymph. The concentration of cryptocyanin in the hemolymph is closely coordinated with the molt cycle and reaches levels higher than hemocyanin during premolt. Cryptocyanin resembles insect hexamerins in the lack of copper, molt cycle patterns of biosynthesis, and potential contributions to the new exoskeleton. Phylogenetic analysis of sequence similarities between cryptocyanin and other members of the hemocyanin gene family shows that cryptocyanin is closely associated with crustacean hemocyanins and suggests that cryptocyanin arose as a result of a hemocyanin gene duplication. The presence of both hemocyanin and cryptocyanin in one animal provides an example of how insect hexamerins might have evolved from hemocyanin. Our results suggest that multiple members of the hemocyanin gene family—hemocyanin, cryptocyanin, phenoloxidase, and hexamerins—may participate in two vital functions of molting animals, oxygen binding and molting. Cryptocyanin may provide important molecular data to further investigate evolutionary relationships among all molting animals.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

This paper describes three distinct estrogen receptor (ER) subtypes: ERα, ERβ, and a unique type, ERγ, cloned from a teleost fish, the Atlantic croaker Micropogonias undulatus; the first identification of a third type of classical ER in vertebrate species. Phylogenetic analysis shows that ERγ arose through gene duplication from ERβ early in the teleost lineage and indicates that ERγ is present in other teleosts, although it has not been recognized as such. The Atlantic croaker ERγ shows amino acid differences in regions important for ligand binding and receptor activation that are conserved in all other ERγs. The three ER subtypes are genetically distinct and have different distribution patterns in Atlantic croaker tissues. In addition, ERβ and ERγ fusion proteins can each bind estradiol-17β with high affinity. The presence of three functional ERs in one species expands the role of ER multiplicity in estrogen signaling systems and provides a unique opportunity to investigate the dynamics and mechanisms of ER evolution.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

The aryl hydrocarbon receptor (AHR) is a ligand-activated transcription factor through which halogenated aromatic hydrocarbons such as 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD) cause altered gene expression and toxicity. The AHR belongs to the basic helix–loop–helix/Per-ARNT-Sim (bHLH-PAS) family of transcriptional regulatory proteins, whose members play key roles in development, circadian rhythmicity, and environmental homeostasis; however, the normal cellular function of the AHR is not yet known. As part of a phylogenetic approach to understanding the function and evolutionary origin of the AHR, we sequenced the PAS homology domain of AHRs from several species of early vertebrates and performed phylogenetic analyses of these AHR amino acid sequences in relation to mammalian AHRs and 24 other members of the PAS family. AHR sequences were identified in a teleost (the killifish Fundulus heteroclitus), two elasmobranch species (the skate Raja erinacea and the dogfish Mustelus canis), and a jawless fish (the lamprey Petromyzon marinus). Two putative AHR genes, designated AHR1 and AHR2, were found both in Fundulus and Mustelus. Phylogenetic analyses indicate that the AHR2 genes in these two species are orthologous, suggesting that an AHR gene duplication occurred early in vertebrate evolution and that multiple AHR genes may be present in other vertebrates. Database searches and phylogenetic analyses identified four putative PAS proteins in the nematode Caenorhabditis elegans, including possible AHR and ARNT homologs. Phylogenetic analysis of the PAS gene family reveals distinct clades containing both invertebrate and vertebrate PAS family members; the latter include paralogous sequences that we propose have arisen by gene duplication early in vertebrate evolution. Overall, our analyses indicate that the AHR is a phylogenetically ancient protein present in all living vertebrate groups (with a possible invertebrate homolog), thus providing an evolutionary perspective to the study of dioxin toxicity and AHR function.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Gnathostome vertebrates have multiple members of the Dlx family of transcription factors that are expressed during the development of several tissues considered to be vertebrate synapomorphies, including the forebrain, cranial neural crest, placodes, and pharyngeal arches. The Dlx gene family thus presents an ideal system in which to examine the relationship between gene duplication and morphological innovation during vertebrate evolution. Toward this end, we have cloned Dlx genes from the lamprey Petromyzon marinus, an agnathan vertebrate that occupies a critical phylogenetic position between cephalochordates and gnathostomes. We have identified four Dlx genes in P. marinus, whose orthology with gnathostome Dlx genes provides a model for how this gene family evolved in the vertebrate lineage. Differential expression of these lamprey Dlx genes in the forebrain, cranial neural crest, pharyngeal arches, and sensory placodes of lamprey embryos provides insight into the developmental evolution of these structures as well as a model of regulatory evolution after Dlx gene duplication events.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Pseudogenes are non-functioning copies of genes in genomic DNA, which may either result from reverse transcription from an mRNA transcript (processed pseudogenes) or from gene duplication and subsequent disablement (non-processed pseudogenes). As pseudogenes are apparently ‘dead’, they usually have a variety of obvious disablements (e.g., insertions, deletions, frameshifts and truncations) relative to their functioning homologs. We have derived an initial estimate of the size, distribution and characteristics of the pseudogene population in the Caenorhabditis elegans genome, performing a survey in ‘molecular archaeology’. Corresponding to the 18 576 annotated proteins in the worm (i.e., in Wormpep18), we have found an estimated total of 2168 pseudogenes, about one for every eight genes. Few of these appear to be processed. Details of our pseudogene assignments are available from http://bioinfo.mbb.yale.edu/genome/worm/pseudogene. The population of pseudogenes differs significantly from that of genes in a number of respects: (i) pseudogenes are distributed unevenly across the genome relative to genes, with a disproportionate number on chromosome IV; (ii) the density of pseudogenes is higher on the arms of the chromosomes; (iii) the amino acid composition of pseudogenes is midway between that of genes and (translations of) random intergenic DNA, with enrichment of Phe, Ile, Leu and Lys, and depletion of Asp, Ala, Glu and Gly relative to the worm proteome; and (iv) the most common protein folds and families differ somewhat between genes and pseudogenes—whereas the most common fold found in the worm proteome is the immunoglobulin fold and the most common ‘pseudofold’ is the C-type lectin. In addition, the size of a gene family bears little overall relationship to the size of its corresponding pseudogene complement, indicating a highly dynamic genome. There are in fact a number of families associated with large populations of pseudogenes. For example, one family of seven-transmembrane receptors (represented by gene B0334.7) has one pseudogene for every four genes, and another uncharacterized family (represented by gene B0403.1) is approximately two-thirds pseudogenic. Furthermore, over a hundred apparent pseudogenic fragments do not have any obvious homologs in the worm.

Relevância:

60.00% 60.00%

Publicador:

Resumo:

Plant chloroplasts originated from an endosymbiotic event by which an ancestor of contemporary cyanobacteria was engulfed by an early eukaryotic cell and then transformed into an organelle. Oxygenic photosynthesis is the specific feature of cyanobacteria and chloroplasts, and the photosynthetic machinery resides in an internal membrane system, the thylakoids. The origin and genesis of thylakoid membranes, which are essential for oxygenic photosynthesis, are still an enigma. Vipp1 (vesicle-inducing protein in plastids 1) is a protein located in both the inner envelope and the thylakoids of Pisum sativum and Arabidopsis thaliana. In Arabidopsis disruption of the VIPP1 gene severely affects the plant's ability to form properly structured thylakoids and as a consequence to carry out photosynthesis. In contrast, Vipp1 in Synechocystis appears to be located exclusively in the plasma membrane. Yet, as in higher plants, disruption of the VIPP1 gene locus leads to the complete loss of thylakoid formation. So far VIPP1 genes are found only in organisms carrying out oxygenic photosynthesis. They share sequence homology with a subunit encoded by the bacterial phage shock operon (PspA) but differ from PspA by a C-terminal extension of about 30 amino acids. In two cyanobacteria, Synechocystis and Anabaena, both a VIPP1 and a pspA gene are present, and phylogenetic analysis indicates that VIPP1 originated from a gene duplication of the latter and thereafter acquired its new function. It also appears that the C-terminal extension that discriminates VIPP1 proteins from PspA is important for its function in thylakoid formation.