3 resultados para Bayesian phylogenetic analysis
em Duke University
Resumo:
BACKGROUND: Nonparametric Bayesian techniques have been developed recently to extend the sophistication of factor models, allowing one to infer the number of appropriate factors from the observed data. We consider such techniques for sparse factor analysis, with application to gene-expression data from three virus challenge studies. Particular attention is placed on employing the Beta Process (BP), the Indian Buffet Process (IBP), and related sparseness-promoting techniques to infer a proper number of factors. The posterior density function on the model parameters is computed using Gibbs sampling and variational Bayesian (VB) analysis. RESULTS: Time-evolving gene-expression data are considered for respiratory syncytial virus (RSV), Rhino virus, and influenza, using blood samples from healthy human subjects. These data were acquired in three challenge studies, each executed after receiving institutional review board (IRB) approval from Duke University. Comparisons are made between several alternative means of per-forming nonparametric factor analysis on these data, with comparisons as well to sparse-PCA and Penalized Matrix Decomposition (PMD), closely related non-Bayesian approaches. CONCLUSIONS: Applying the Beta Process to the factor scores, or to the singular values of a pseudo-SVD construction, the proposed algorithms infer the number of factors in gene-expression data. For real data the "true" number of factors is unknown; in our simulations we consider a range of noise variances, and the proposed Bayesian models inferred the number of factors accurately relative to other methods in the literature, such as sparse-PCA and PMD. We have also identified a "pan-viral" factor of importance for each of the three viruses considered in this study. We have identified a set of genes associated with this pan-viral factor, of interest for early detection of such viruses based upon the host response, as quantified via gene-expression data.
Resumo:
Today, the only surviving wild population of giant tortoises in the Indian Ocean occurs on the island of Aldabra. However, giant tortoises once inhabited islands throughout the western Indian Ocean. Madagascar, Africa, and India have all been suggested as possible sources of colonization for these islands. To address the origin of Indian Ocean tortoises (Dipsochelys, formerly Geochelone gigantea), we sequenced the 12S, 16S, and cyt b genes of the mitochondrial DNA. Our phylogenetic analysis shows Dipsochelys to be embedded within the Malagasy lineage, providing evidence that Indian Ocean giant tortoises are derived from a common Malagasy ancestor. This result points to Madagascar as the source of colonization for western Indian Ocean islands by giant tortoises. Tortoises are known to survive long oceanic voyages by floating with ocean currents, and thus, currents flowing northward towards the Aldabra archipelago from the east coast of Madagascar would have provided means for the colonization of western Indian Ocean islands. Additionally, we found an accelerated rate of sequence evolution in the two Malagasy Pyxis species examined. This finding supports previous theories that shorter generation time and smaller body size are related to an increase in mitochondrial DNA substitution rate in vertebrates.
Resumo:
Molecular data have converged on a consensus about the genus-level phylogeny of extant platyrrhine monkeys, but for most extinct taxa and certainly for those older than the Pleistocene we must rely upon morphological evidence from fossils. This raises the question as to how well anatomical data mirror molecular phylogenies and how best to deal with discrepancies between the molecular and morphological data as we seek to extend our phylogenies to the placement of fossil taxa. Here I present parsimony-based phylogenetic analyses of extant and fossil platyrrhines based on an anatomical dataset of 399 dental characters and osteological features of the cranium and postcranium. I sample 16 extant taxa (one from each platyrrhine genus) and 20 extinct taxa of platyrrhines. The tree structure is constrained with a "molecular scaffold" of extant species as implemented in maximum parsimony using PAUP with the molecular-based 'backbone' approach. The data set encompasses most of the known extinct species of platyrrhines, ranging in age from latest Oligocene (∼26 Ma) to the Recent. The tree is rooted with extant catarrhines, and Late Eocene and Early Oligocene African anthropoids. Among the more interesting patterns to emerge are: (1) known early platyrrhines from the Late Oligocene through Early Miocene (26-16.5Ma) represent only stem platyrrhine taxa; (2) representatives of the three living platyrrhine families first occur between 15.7 Ma and 13.5 Ma; and (3) recently extinct primates from the Greater Antilles (Cuba, Jamaica, Hispaniola) are sister to the clade of extant platyrrhines and may have diverged in the Early Miocene. It is probable that the crown platyrrhine clade did not originate before about 20-24 Ma, a conclusion consistent with the phylogenetic analysis of fossil taxa presented here and with recent molecular clock estimates. The following biogeographic scenario is consistent with the phylogenetic findings and climatic and geologic evidence: Tropical South America has been a center for platyrrhine diversification since platyrrhines arrived on the continent in the middle Cenozoic. Platyrrhines dispersed from tropical South America to Patagonia at ∼25-24 Ma via a "Paraná Portal" through eastern South America across a retreating Paranense Sea. Phylogenetic bracketing suggests Antillean primates arrived via a sweepstakes route or island chain from northern South America in the Early Miocene, not via a proposed land bridge or island chain (GAARlandia) in the Early Oligocene (∼34 Ma). Patagonian and Antillean platyrrhines went extinct without leaving living descendants, the former at the end of the Early Miocene and the latter within the past six thousand years. Molecular evidence suggests crown platyrrhines arrived in Central America by crossing an intermittent connection through the Isthmus of Panama at or after 3.5Ma. Any more ancient Central American primates, should they be discovered, are unlikely to have given rise to the extant Central American taxa in situ.