976 resultados para SEQUENCE EVOLUTION
Resumo:
Molecular phylogenetic studies of homologous sequences of nucleotides often assume that the underlying evolutionary process was globally stationary, reversible, and homogeneous (SRH), and that a model of evolution with one or more site-specific and time-reversible rate matrices (e.g., the GTR rate matrix) is enough to accurately model the evolution of data over the whole tree. However, an increasing body of data suggests that evolution under these conditions is an exception, rather than the norm. To address this issue, several non-SRH models of molecular evolution have been proposed, but they either ignore heterogeneity in the substitution process across sites (HAS) or assume it can be modeled accurately using the distribution. As an alternative to these models of evolution, we introduce a family of mixture models that approximate HAS without the assumption of an underlying predefined statistical distribution. This family of mixture models is combined with non-SRH models of evolution that account for heterogeneity in the substitution process across lineages (HAL). We also present two algorithms for searching model space and identifying an optimal model of evolution that is less likely to over- or underparameterize the data. The performance of the two new algorithms was evaluated using alignments of nucleotides with 10 000 sites simulated under complex non-SRH conditions on a 25-tipped tree. The algorithms were found to be very successful, identifying the correct HAL model with a 75% success rate (the average success rate for assigning rate matrices to the tree's 48 edges was 99.25%) and, for the correct HAL model, identifying the correct HAS model with a 98% success rate. Finally, parameter estimates obtained under the correct HAL-HAS model were found to be accurate and precise. The merits of our new algorithms were illustrated with an analysis of 42 337 second codon sites extracted from a concatenation of 106 alignments of orthologous genes encoded by the nuclear genomes of Saccharomyces cerevisiae, S. paradoxus, S. mikatae, S. kudriavzevii, S. castellii, S. kluyveri, S. bayanus, and Candida albicans. Our results show that second codon sites in the ancestral genome of these species contained 49.1% invariable sites, 39.6% variable sites belonging to one rate category (V1), and 11.3% variable sites belonging to a second rate category (V2). The ancestral nucleotide content was found to differ markedly across these three sets of sites, and the evolutionary processes operating at the variable sites were found to be non-SRH and best modeled by a combination of eight edge-specific rate matrices (four for V1 and four for V2). The number of substitutions per site at the variable sites also differed markedly, with sites belonging to V1 evolving slower than those belonging to V2 along the lineages separating the seven species of Saccharomyces. Finally, sites belonging to V1 appeared to have ceased evolving along the lineages separating S. cerevisiae, S. paradoxus, S. mikatae, S. kudriavzevii, and S. bayanus, implying that they might have become so selectively constrained that they could be considered invariable sites in these species.
DIFFERENT RATES OF MITOCHONDRIAL-DNA SEQUENCE EVOLUTION IN KIRK DIK-DIK (MADOQUA-KIRKII) POPULATIONS
Resumo:
We have investigated evolutionary rates of the mitochondrial genome among individuals of Madoqua kirkii using the relative rate test. Our results demonstrate that individuals of two chromosome races, East African cytotype A and Southwest African cytotype D, evolve about 2.3 times faster than East African cytotype B. Cytogenetic changes, DNA repair efficiency, mutagens, and more likely, hitherto unrecognized factors will account for the rate difference we have observed. Our results suggest additional caution when using molecular clocks in the estimation of divergence time, even within lineages of closely related taxa. Rate heterogeneity in microevolutionary timescales represents a potentially important aspect of basic evolutionary processes and may provide additional insights into factors which affect genome evolution. (C) 1995 Academic Press, Inc.
Resumo:
The chemokine receptor CCR5 can serve as a coreceptor for M-tropic HIV-1 infection and both M-tropic and T-tropic SIV infection. We sequenced the entire CCR5 gene from 10 nonhuman primates: Pongo pygmaeus, Hylobates leucogenys, Trachypithecus francoisi, Trachypithecus phayrei, Pygathrix nemaeus, Rhinopithecus roxellanae, Rhinopithecus bieti, Rhinopithecus avunculus, Macaca assamensis, and Macaca arctoides. When compared with CCR5 sequences from humans and other primates, our results demonstrate that:(1) nucleotide and amino acid sequences of CCR5 among primates are highly homologous, with variations slightly concentrated on the amino and carboxyl termini; and (2) site Asp13, which is critical for CD4-independent binding of SIV gp120 to Macaca mulatta CCR5, was also present in all other nonhuman primates tested here, suggesting that those nonhuman primate CCR5s might also bind SIV gp120 without the presence of CD4. The topologies of CCR5 gene trees constructed here conflict with the putative opinion that the snub-nosed langurs compose a monophyletic group, suggesting that the CCR5 gene may not be a good genetic marker for low-level phylogenetic analysis. The evolutionary rate of CCR5 was calculated, and our results suggest a slowdown in primates after they diverged from rodents. The synonymous mutation rate of CCR5 in primates is constant, about 1.1 x 10(-9) synonymous mutations per site per year. Comparisons of K-a and K-s suggest that the CCR5 genes have undergone negative or purifying selection. K-a/K-s ratios from cercopithecines and colobines are significantly different, implying that selective pressures have played different roles in the two lineages.
Resumo:
Cellular stresses activate the tumor suppressor p53 protein leading to selective binding to DNA response elements (REs) and gene transactivation from a large pool of potential p53 REs (p53REs). To elucidate how p53RE sequences and local chromatin context interact to affect p53 binding and gene transactivation, we mapped genome-wide binding localizations of p53 and H3K4me3 in untreated and doxorubicin (DXR)-treated human lymphoblastoid cells. We examined the relationships among p53 occupancy, gene expression, H3K4me3, chromatin accessibility (DNase 1 hypersensitivity, DHS), ENCODE chromatin states, p53RE sequence, and evolutionary conservation. We observed that the inducible expression of p53-regulated genes was associated with the steady-state chromatin status of the cell. Most highly inducible p53-regulated genes were suppressed at baseline and marked by repressive histone modifications or displayed CTCF binding. Comparison of p53RE sequences residing in different chromatin contexts demonstrated that weaker p53REs resided in open promoters, while stronger p53REs were located within enhancers and repressed chromatin. p53 occupancy was strongly correlated with similarity of the target DNA sequences to the p53RE consensus, but surprisingly, inversely correlated with pre-existing nucleosome accessibility (DHS) and evolutionary conservation at the p53RE. Occupancy by p53 of REs that overlapped transposable element (TE) repeats was significantly higher (p<10-7) and correlated with stronger p53RE sequences (p<10-110) relative to nonTE-associated p53REs, particularly for MLT1H, LTR10B, and Mer61 TEs. However, binding at these elements was generally not associated with transactivation of adjacent genes. Occupied p53REs located in L2-like TEs were unique in displaying highly negative PhyloP scores (predicted fast-evolving) and being associated with altered H3K4me3 and DHS levels. These results underscore the systematic interaction between chromatin status and p53RE context in the induced transactivation response. This p53 regulated response appears to have been tuned via evolutionary processes that may have led to repression and/or utilization of p53REs originating from primate-specific transposon elements.
Resumo:
Boron abundances have been derived for seven main-sequence B- type stars from Hubble Space Telescope STIS spectra around the B III lambda2066 line. In two stars, boron appears to be undepleted with respect to the presumed initial abundance. In one star, boron is detectable but is clearly depleted. In the other four stars, boron is undetectable, implying depletions of 1-2 dex. Three of these four stars are nitrogen enriched, but the fourth shows no enrichment of nitrogen. Only rotationally induced mixing predicts that boron depletions are unaccompanied by nitrogen enrichments. The inferred rate of boron depletion from our observations is in good agreement with these predictions. Other boron-depleted nitrogen-normal stars are identified from the literature. In addition, several boron- depleted nitrogen-rich stars are identified, and while all fall on the boron-nitrogen trend predicted by rotationally induced mixing, a majority have nitrogen enrichments that are not uniquely explained by rotation. The spectra have also been used to determine iron group (Cr, Mn, Fe, and Ni) abundances. The seven B-type stars have near-solar iron group abundances, as expected for young stars in the solar neighborhood. We have also analyzed the halo B-type star PG 0832 + 676. We find [Fe/H] = -0.88 +/- 0.10, and the absence of the B III line gives the upper limit [B/H] <-2.5. These and other published abundances are used to infer the star's evolutionary status as a post-asymptotic giant branch star.
Sequence, evolution and ligand binding properties of mammalian Duffy antigen/receptor for chemokines
Resumo:
The Duffy antigen/receptor for chemokine, DARC, acts as a widely expressed promiscuous chemokine receptor and as the erythrocyte receptor for Plasmodium vivax. To gain insight into the evolution and structure/function relations of DARC, we analyzed the binding of anti-human Fy monoclonal antibodies (mAbs) and human chemokines to red blood cells (RBCs) from 11 nonhuman primates and two nonprimate mammals, and we elucidated the structures of the DARC genes from gorilla, gibbon, baboon, marmoset, tamarin, night monkey and cattle. CXCL-8 and CCL-5 chemokine binding analysis indicated that the promiscuous binding profile characteristic of DARC is conserved across species. Among three mAbs that detected the Fy6 epitope by flow cytometric analysis of human and chimpanzee RBCs, only one reacted with night monkey and squirrel monkey. Only chimpanzee RBCs bound a significant amount of the anti-Fy3 mAb. Fy3 was also poorly detected on RBCs from gorilla, baboon and rhesus monkey, but not from new world monkeys. Alignment of DARC homologous sequences allowed us to construct a phylogenetic tree in which all branchings were in accordance with current knowledge of primate phylogeny. Although DARC was expected to be under strong internal and external selection pressure, in order to maintain chemokine binding and avoid Plasmodium vivax binding, respectively, our present study did not provide arguments in favor of a selection pressure on the extracellular domains involved in ligand specificity. The amino acid variability of DARC-like polypeptides was found to be well correlated with the hydrophylicity indexes, with the highest divergence on the amino-terminal extracellular domain. Analysis of the deduced amino acid sequences highlighted the conservation of some amino acid residues, which should prove to be critical for the structural and functional properties of DARC.
Resumo:
Models of DNA sequence evolution and methods for estimating evolutionary distances are needed for studying the rate and pattern of molecular evolution and for inferring the evolutionary relationships of organisms or genes. In this dissertation, several new models and methods are developed.^ The rate variation among nucleotide sites: To obtain unbiased estimates of evolutionary distances, the rate heterogeneity among nucleotide sites of a gene should be considered. Commonly, it is assumed that the substitution rate varies among sites according to a gamma distribution (gamma model) or, more generally, an invariant+gamma model which includes some invariable sites. A maximum likelihood (ML) approach was developed for estimating the shape parameter of the gamma distribution $(\alpha)$ and/or the proportion of invariable sites $(\theta).$ Computer simulation showed that (1) under the gamma model, $\alpha$ can be well estimated from 3 or 4 sequences if the sequence length is long; and (2) the distance estimate is unbiased and robust against violations of the assumptions of the invariant+gamma model.^ However, this ML method requires a huge amount of computational time and is useful only for less than 6 sequences. Therefore, I developed a fast method for estimating $\alpha,$ which is easy to implement and requires no knowledge of tree. A computer program was developed for estimating $\alpha$ and evolutionary distances, which can handle the number of sequences as large as 30.^ Evolutionary distances under the stationary, time-reversible (SR) model: The SR model is a general model of nucleotide substitution, which assumes (i) stationary nucleotide frequencies and (ii) time-reversibility. It can be extended to SRV model which allows rate variation among sites. I developed a method for estimating the distance under the SR or SRV model, as well as the variance-covariance matrix of distances. Computer simulation showed that the SR method is better than a simpler method when the sequence length $L>1,000$ bp and is robust against deviations from time-reversibility. As expected, when the rate varies among sites, the SRV method is much better than the SR method.^ The evolutionary distances under nonstationary nucleotide frequencies: The statistical properties of the paralinear and LogDet distances under nonstationary nucleotide frequencies were studied. First, I developed formulas for correcting the estimation biases of the paralinear and LogDet distances. The performances of these formulas and the formulas for sampling variances were examined by computer simulation. Second, I developed a method for estimating the variance-covariance matrix of the paralinear distance, so that statistical tests of phylogenies can be conducted when the nucleotide frequencies are nonstationary. Third, a new method for testing the molecular clock hypothesis was developed in the nonstationary case. ^
Resumo:
An extensive sequence comparison of the chloroplast ndhF gene from all major clades of the largest flowering plant family (Asteraceae) shows that this gene provides approximately 3 times more phylogenetic information than rbcL. This is because it is substantially longer and evolves twice as fast. The 5' region (1380 bp) of ndhF is very different from the 3' region (855 bp) and is similar to rbcL in both the rate and the pattern of sequence change. The 3' region is more A+T-rich, has higher levels of nonsynonymous base substitution, and shows greater transversion bias at all codon positions. These differences probably reflect different functional constraints on the 5' and 3' regions of ndhF. The two patterns of base substitutions of ndhF are particularly advantageous for phylogenetic reconstruction because the conserved and variable segments can be used for older and recent groups, respectively. Phylogenetic analyses of 94 ndhF sequences provided much better resolution of relationships than previous molecular and morphological phylogenies of the Asteraceae. The ndhF tree identified five major clades: (i) the Calyceraceae is the sister family of Asteraceae; (ii) the Barnadesioideae is monophyletic and is the sister group to the rest of the family; (iii) the Cichorioideae and its two basal tribes Mutisieae and Cardueae are paraphyletic; (iv) four tribes of Cichorioideae (Lactuceae, Arctoteae, Liabeae, and Vernonieae) form a monophyletic group, and these are the sister clade of the Asteroideae; and (v) the Asteroideae is monophyletic and includes three major clades.
Resumo:
Ratites are large, flightless birds and include the ostrich, rheas, kiwi, emu, and cassowaries, along with extinct members, such as moa and elephant birds. Previous phylogenetic analyses of complete mitochondrial genome sequences have reinforced the traditional belief that ratites are monophyletic and tinamous are their sister group. However, in these studies ratite monophyly was enforced in the analyses that modeled rate heterogeneity among variable sites. Relaxing this topological constraint results in strong support for the tinamous (which fly) nesting within ratites. Furthermore, upon reducing base compositional bias and partitioning models of sequence evolution among protein codon positions and RNA structures, the tinamou–moa clade grouped with kiwi, emu, and cassowaries to the exclusion of the successively more divergent rheas and ostrich. These relationships are consistent with recent results from a large nuclear data set, whereas our strongly supported finding of a tinamou–moa grouping further resolves palaeognath phylogeny. We infer flight to have been lost among ratites multiple times in temporally close association with the Cretaceous–Tertiary extinction event. This circumvents requirements for transient microcontinents and island chains to explain discordance between ratite phylogeny and patterns of continental breakup. Ostriches may have dispersed to Africa from Eurasia, putting in question the status of ratites as an iconic Gondwanan relict taxon. [Base composition; flightless; Gondwana; mitochondrial genome; Palaeognathae; phylogeny; ratites.]
Resumo:
Growth hormone is a classic molecule in the study of the molecular clock hypothesis as it exhibits a relatively constant rate of evolution in most mammalian orders except primates and artiodactyls, where dramatically enhanced rate of evolution (25-50-fold) has been reported. The rapid evolution of primate growth hormone occurred after the divergence of tarsiers and simians, but before the separation of old world monkeys (OWM) from new world monkeys (NWM). Interestingly, this event of rapid sequence evolution coincided with multiple duplications of the growth hormone gene, suggesting gene duplication as a possible cause of the accelerated sequence evolution. Here we determined 21 different GH-like sequences from four species of OWM and hominoids. Combining with published sequences from OWM and hominoids, our analysis demonstrates that multiple gene duplications and several gene conversion events both occurred in the evolutionary history of this gene family in OWM/hominoids. The episode of recent duplications of CSH-like genes in gibbon is accompanied with rapid sequence evolution likely resulting from relaxation of purifying selection. GHN genes in both hominoids and OWM are under strong purifying selection. In contrast, CSH genes in both lineages are probably not. GHV genes in OWM and hominoids evolved at different evolutionary rates and underwent different selective constraints. Our results disclosed the complex history of the primate growth hormone gene family and raised intriguing questions on the consequences of these evolutionary events. © 2005 Elsevier B.V. All rights reserved.
Resumo:
BACKGROUND: Microsporidia are obligate intracellular, eukaryotic pathogens that infect a wide range of animals from nematodes to humans, and in some cases, protists. The preponderance of evidence as to the origin of the microsporidia reveals a close relationship with the fungi, either within the kingdom or as a sister group to it. Recent phylogenetic studies and gene order analysis suggest that microsporidia share a particularly close evolutionary relationship with the zygomycetes. METHODOLOGY/PRINCIPAL FINDINGS: Here we expanded this analysis and also examined a putative sex-locus for variability between microsporidian populations. Whole genome inspection reveals a unique syntenic gene pair (RPS9-RPL21) present in the vast majority of fungi and the microsporidians but not in other eukaryotic lineages. Two other unique gene fusions (glutamyl-prolyl tRNA synthetase and ubiquitin-ribosomal subunit S30) that are present in metazoans, choanoflagellates, and filasterean opisthokonts are unfused in the fungi and microsporidians. One locus previously found to be conserved in many microsporidian genomes is similar to the sex locus of zygomycetes in gene order and architecture. Both sex-related and sex loci harbor TPT, HMG, and RNA helicase genes forming a syntenic gene cluster. We sequenced and analyzed the sex-related locus in 11 different Encephalitozoon cuniculi isolates and the sibling species E. intestinalis (3 isolates) and E. hellem (1 isolate). There was no evidence for an idiomorphic sex-related locus in this Encephalitozoon species sample. According to sequence-based phylogenetic analyses, the TPT and RNA helicase genes flanking the HMG genes are paralogous rather than orthologous between zygomycetes and microsporidians. CONCLUSION/SIGNIFICANCE: The unique genomic hallmarks between microsporidia and fungi are independent of sequence based phylogenetic comparisons and further contribute to define the borders of the fungal kingdom and support the classification of microsporidia as unusual derived fungi. And the sex/sex-related loci appear to have been subject to frequent gene conversion and translocations in microsporidia and zygomycetes.
Resumo:
The molecular networks regulating the G1-S transition in budding yeast and mammals are strikingly similar in network structure. However, many of the individual proteins performing similar network roles appear to have unrelated amino acid sequences, suggesting either extremely rapid sequence evolution, or true polyphyly of proteins carrying out identical network roles. A yeast/mammal comparison suggests that network topology, and its associated dynamic properties, rather than regulatory proteins themselves may be the most important elements conserved through evolution. However, recent deep phylogenetic studies show that fungal and animal lineages are relatively closely related in the opisthokont branch of eukaryotes. The presence in plants of cell cycle regulators such as Rb, E2F and cyclins A and D, that appear lost in yeast, suggests cell cycle control in the last common ancestor of the eukaryotes was implemented with this set of regulatory proteins. Forward genetics in non-opisthokonts, such as plants or their green algal relatives, will provide direct information on cell cycle control in these organisms, and may elucidate the potentially more complex cell cycle control network of the last common eukaryotic ancestor.
Resumo:
Many bacteria live only within animal cells and infect hosts through cytoplasmic inheritance. These endosymbiotic lineages show distinctive population structure, with small population size and effectively no recombination. As a result, endosymbionts are expected to accumulate mildly deleterious mutations. If these constitute a substantial proportion of new mutations, endosymbionts will show (i) faster sequence evolution and (ii) a possible shift in base composition reflecting mutational bias. Analyses of 16S rDNA of five independently derived endosymbiont clades show, in every case, faster evolution in endosymbionts than in free-living relatives. For aphid endosymbionts (genus Buchnera), coding genes exhibit accelerated evolution and unusually low ratios of synonymous to nonsynonymous substitutions compared to ratios for the same genes for enterics. This concentration of the rate increase in nonsynonymous substitutions is expected under the hypothesis of increased fixation of deleterious mutations. Polypeptides for all Buchnera genes analyzed have accumulated amino acids with codon families rich in A+T, supporting the hypothesis that substitutions are deleterious in terms of polypeptide function. These observations are best explained as the result of Muller's ratchet within small asexual populations, combined with mutational bias. In light of this explanation, two observations reported earlier for Buchnera, the apparent loss of a repair gene and the overproduction of a chaperonin, may reflect compensatory evolution. An alternative hypothesis, involving selection on genomic base composition, is contradicted by the observation that the speedup is concentrated at nonsynonymous sites.
Resumo:
The myc gene family encodes a group of transcription factors that regulate cell proliferation and differentiation. These genes are widely studied because of their importance as proto-oncogenes. Phylogenetic analyses are described here for 45 Myc protein sequences representing c-, N-, L-, S-, and B-myc genes. A gene duplication early in vertebrate evolution produced the c-myc lineage and another lineage that later gave rise to the N- and L-myc lineages by another gene duplication. Evolutionary divergence in the myc gene family corresponds closely to the known branching order of the major vertebrate groups. The patterns of sequence evolution are described for five separate highly conserved regions, and these analyses show that differential rates of sequence divergence (= mosaic evolution) have occurred among conserved motifs. Further, the closely related dimerization partner protein Max exhibits significantly less sequence variability than Myc. It is suggested that the reduced variability in max stems from natural selection acting to preserve dimerization capability with products of myc and related genes.
Resumo:
Context. There is growing evidence that a treatment of binarity amongst OB stars is essential for a full theory of stellar evolution. However the binary properties of massive stars – frequency, mass ratio & orbital separation – are still poorly constrained. Aims. In order to address this shortcoming we have undertaken a multiepoch spectroscopic study of the stellar population of the young massive cluster Westerlund 1. In this paper we present an investigation into the nature of the dusty Wolf-Rayet star and candidate binary W239. Methods. To accomplish this we have utilised our spectroscopic data in conjunction with multi-year optical and near-IR photometric observations in order to search for binary signatures. Comparison of these data to synthetic non-LTE model atmosphere spectra were used to derive the fundamental properties of the WC9 primary. Results. We found W239 to have an orbital period of only ~5.05 days, making it one of the most compact WC binaries yet identified. Analysis of the long term near-IR lightcurve reveals a significant flare between 2004-6. We interpret this as evidence for a third massive stellar component in the system in a long period (>6 yr), eccentric orbit, with dust production occuring at periastron leading to the flare. The presence of a near-IR excess characteristic of hot (~1300 K) dust at every epoch is consistent with the expectation that the subset of persistent dust forming WC stars are short (<1 yr) period binaries, although confirmation will require further observations. Non-LTE model atmosphere analysis of the spectrum reveals the physical properties of the WC9 component to be fully consistent with other Galactic examples. Conclusions. The simultaneous presence of both short period Wolf-Rayet binaries and cool hypergiants within Wd 1 provides compelling evidence for a bifurcation in the post-Main Sequence evolution of massive stars due to binarity. Short period O+OB binaries will evolve directly to the Wolf-Rayet phase, either due to an episode of binary mediated mass loss – likely via case A mass transfer or a contact configuration – or via chemically homogenous evolution. Conversely, long period binaries and single stars will instead undergo a red loop across the HR diagram via a cool hypergiant phase. Future analysis of the full spectroscopic dataset for Wd 1 will constrain the proportion of massive stars experiencing each pathway; hence quantifying the importance of binarity in massive stellar evolution up to and beyond supernova and the resultant production of relativistic remnants.