29 resultados para Comparative genomics
em Duke University
Resumo:
BACKGROUND: The evolutionary relationships of modern birds are among the most challenging to understand in systematic biology and have been debated for centuries. To address this challenge, we assembled or collected the genomes of 48 avian species spanning most orders of birds, including all Neognathae and two of the five Palaeognathae orders, and used the genomes to construct a genome-scale avian phylogenetic tree and perform comparative genomics analyses (Jarvis et al. in press; Zhang et al. in press). Here we release assemblies and datasets associated with the comparative genome analyses, which include 38 newly sequenced avian genomes plus previously released or simultaneously released genomes of Chicken, Zebra finch, Turkey, Pigeon, Peregrine falcon, Duck, Budgerigar, Adelie penguin, Emperor penguin and the Medium Ground Finch. We hope that this resource will serve future efforts in phylogenomics and comparative genomics. FINDINGS: The 38 bird genomes were sequenced using the Illumina HiSeq 2000 platform and assembled using a whole genome shotgun strategy. The 48 genomes were categorized into two groups according to the N50 scaffold size of the assemblies: a high depth group comprising 23 species sequenced at high coverage (>50X) with multiple insert size libraries resulting in N50 scaffold sizes greater than 1 Mb (except the White-throated Tinamou and Bald Eagle); and a low depth group comprising 25 species sequenced at a low coverage (~30X) with two insert size libraries resulting in an average N50 scaffold size of about 50 kb. Repetitive elements comprised 4%-22% of the bird genomes. The assembled scaffolds allowed the homology-based annotation of 13,000 ~ 17000 protein coding genes in each avian genome relative to chicken, zebra finch and human, as well as comparative and sequence conservation analyses. CONCLUSIONS: Here we release full genome assemblies of 38 newly sequenced avian species, link genome assembly downloads for the 7 of the remaining 10 species, and provide a guideline of genomic data that has been generated and used in our Avian Phylogenomics Project. To the best of our knowledge, the Avian Phylogenomics Project is the biggest vertebrate comparative genomics project to date. The genomic data presented here is expected to accelerate further analyses in many fields, including phylogenetics, comparative genomics, evolution, neurobiology, development biology, and other related areas.
Resumo:
Improvements in genomic technology, both in the increased speed and reduced cost of sequencing, have expanded the appreciation of the abundance of human genetic variation. However the sheer amount of variation, as well as the varying type and genomic content of variation, poses a challenge in understanding the clinical consequence of a single mutation. This work uses several methodologies to interpret the observed variation in the human genome, and presents novel strategies for the prediction of allele pathogenicity.
Using the zebrafish model system as an in vivo assay of allele function, we identified a novel driver of Bardet-Biedl Syndrome (BBS) in CEP76. A combination of targeted sequencing of 785 cilia-associated genes in a cohort of BBS patients and subsequent in vivo functional assays recapitulating the human phenotype gave strong evidence for the role of CEP76 mutations in the pathology of an affected family. This portion of the work demonstrated the necessity of functional testing in validating disease-associated mutations, and added to the catalogue of known BBS disease genes.
Further study into the role of copy-number variations (CNVs) in a cohort of BBS patients showed the significant contribution of CNVs to disease pathology. Using high-density array comparative genomic hybridization (aCGH) we were able to identify pathogenic CNVs as small as several hundred bp. Dissection of constituent gene and in vivo experiments investigating epistatic interactions between affected genes allowed for an appreciation of several paradigms by which CNVs can contribute to disease. This study revealed that the contribution of CNVs to disease in BBS patients is much higher than previously expected, and demonstrated the necessity of consideration of CNV contribution in future (and retrospective) investigations of human genetic disease.
Finally, we used a combination of comparative genomics and in vivo complementation assays to identify second-site compensatory modification of pathogenic alleles. These pathogenic alleles, which are found compensated in other species (termed compensated pathogenic deviations [CPDs]), represent a significant fraction (from 3 – 10%) of human disease-associated alleles. In silico pathogenicity prediction algorithms, a valuable method of allele prioritization, often misrepresent these alleles as benign, leading to omission of possibly informative variants in studies of human genetic disease. We created a mathematical model that was able to predict CPDs and putative compensatory sites, and functionally showed in vivo that second-site mutation can mitigate the pathogenicity of disease alleles. Additionally, we made publically available an in silico module for the prediction of CPDs and modifier sites.
These studies have advanced the ability to interpret the pathogenicity of multiple types of human variation, as well as made available tools for others to do so as well.
Resumo:
This is a crucial transition time for human genetics in general, and for HIV host genetics in particular. After years of equivocal results from candidate gene analyses, several genome-wide association studies have been published that looked at plasma viral load or disease progression. Results from other studies that used various large-scale approaches (siRNA screens, transcriptome or proteome analysis, comparative genomics) have also shed new light on retroviral pathogenesis. However, most of the inter-individual variability in response to HIV-1 infection remains to be explained: genome resequencing and systems biology approaches are now required to progress toward a better understanding of the complex interactions between HIV-1 and its human host.
Resumo:
Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants.
Resumo:
Dopamine is an important central nervous system transmitter that functions through two classes of receptors (D1 and D2) to influence a diverse range of biological processes in vertebrates. With roles in regulating neural activity, behavior, and gene expression, there has been great interest in understanding the function and evolution dopamine and its receptors. In this study, we use a combination of sequence analyses, microsynteny analyses, and phylogenetic relationships to identify and characterize both the D1 (DRD1A, DRD1B, DRD1C, and DRD1E) and D2 (DRD2, DRD3, and DRD4) dopamine receptor gene families in 43 recently sequenced bird genomes representing the major ordinal lineages across the avian family tree. We show that the common ancestor of all birds possessed at least seven D1 and D2 receptors, followed by subsequent independent losses in some lineages of modern birds. Through comparisons with other vertebrate and invertebrate species we show that two of the D1 receptors, DRD1A and DRD1B, and two of the D2 receptors, DRD2 and DRD3, originated from a whole genome duplication event early in the vertebrate lineage, providing the first conclusive evidence of the origin of these highly conserved receptors. Our findings provide insight into the evolutionary development of an important modulatory component of the central nervous system in vertebrates, and will help further unravel the complex evolutionary and functional relationships among dopamine receptors.
Resumo:
PURPOSE: Mammography is known to be one of the most difficult radiographic exams to interpret. Mammography has important limitations, including the superposition of normal tissue that can obscure a mass, chance alignment of normal tissue to mimic a true lesion and the inability to derive volumetric information. It has been shown that stereomammography can overcome these deficiencies by showing that layers of normal tissue lay at different depths. If standard stereomammography (i.e., a single stereoscopic pair consisting of two projection images) can significantly improve lesion detection, how will multiview stereoscopy (MVS), where many projection images are used, compare to mammography? The aim of this study was to assess the relative performance of MVS compared to mammography for breast mass detection. METHODS: The MVS image sets consisted of the 25 raw projection images acquired over an arc of approximately 45 degrees using a Siemens prototype breast tomosynthesis system. The mammograms were acquired using a commercial Siemens FFDM system. The raw data were taken from both of these systems for 27 cases and realistic simulated mass lesions were added to duplicates of the 27 images at the same local contrast. The images with lesions (27 mammography and 27 MVS) and the images without lesions (27 mammography and 27 MVS) were then postprocessed to provide comparable and representative image appearance across the two modalities. All 108 image sets were shown to five full-time breast imaging radiologists in random order on a state-of-the-art stereoscopic display. The observers were asked to give a confidence rating for each image (0 for lesion definitely not present, 100 for lesion definitely present). The ratings were then compiled and processed using ROC and variance analysis. RESULTS: The mean AUC for the five observers was 0.614 +/- 0.055 for mammography and 0.778 +/- 0.052 for multiview stereoscopy. The difference of 0.164 +/- 0.065 was statistically significant with a p-value of 0.0148. CONCLUSIONS: The differences in the AUCs and the p-value suggest that multiview stereoscopy has a statistically significant advantage over mammography in the detection of simulated breast masses. This highlights the dominance of anatomical noise compared to quantum noise for breast mass detection. It also shows that significant lesion detection can be achieved with MVS without any of the artifacts associated with tomosynthesis.
Resumo:
Co-occurrence of HIV and substance abuse is associated with poor outcomes for HIV-related health and substance use. Integration of substance use and medical care holds promise for HIV patients, yet few integrated treatment models have been reported. Most of the reported models lack data on treatment outcomes in diverse settings. This study examined the substance use outcomes of an integrated treatment model for patients with both HIV and substance use at three different clinics. Sites differed by type and degree of integration, with one integrated academic medical center, one co-located academic medical center, and one co-located community health center. Participants (n=286) received integrated substance use and HIV treatment for 12 months and were interviewed at 6-month intervals. We used linear generalized estimating equation regression analysis to examine changes in Addiction Severity Index (ASI) alcohol and drug severity scores. To test whether our treatment was differentially effective across sites, we compared a full model including site by time point interaction terms to a reduced model including only site fixed effects. Alcohol severity scores decreased significantly at 6 and 12 months. Drug severity scores decreased significantly at 12 months. Once baseline severity variation was incorporated into the model, there was no evidence of variation in alcohol or drug score changes by site. Substance use outcomes did not differ by age, gender, income, or race. This integrated treatment model offers an option for treating diverse patients with HIV and substance use in a variety of clinic settings. Studies with control groups are needed to confirm these findings.
Resumo:
BACKGROUND: The nutrient-sensing Tor pathway governs cell growth and is conserved in nearly all eukaryotic organisms from unicellular yeasts to multicellular organisms, including humans. Tor is the target of the immunosuppressive drug rapamycin, which in complex with the prolyl isomerase FKBP12 inhibits Tor functions. Rapamycin is a gold standard drug for organ transplant recipients that was approved by the FDA in 1999 and is finding additional clinical indications as a chemotherapeutic and antiproliferative agent. Capitalizing on the plethora of recently sequenced genomes we have conducted comparative genomic studies to annotate the Tor pathway throughout the fungal kingdom and related unicellular opisthokonts, including Monosiga brevicollis, Salpingoeca rosetta, and Capsaspora owczarzaki. RESULTS: Interestingly, the Tor signaling cascade is absent in three microsporidian species with available genome sequences, the only known instance of a eukaryotic group lacking this conserved pathway. The microsporidia are obligate intracellular pathogens with highly reduced genomes, and we hypothesize that they lost the Tor pathway as they adapted and streamlined their genomes for intracellular growth in a nutrient-rich environment. Two TOR paralogs are present in several fungal species as a result of either a whole genome duplication or independent gene/segmental duplication events. One such event was identified in the amphibian pathogen Batrachochytrium dendrobatidis, a chytrid responsible for worldwide global amphibian declines and extinctions. CONCLUSIONS: The repeated independent duplications of the TOR gene in the fungal kingdom might reflect selective pressure acting upon this kinase that populates two proteinaceous complexes with different cellular roles. These comparative genomic analyses illustrate the evolutionary trajectory of a central nutrient-sensing cascade that enables diverse eukaryotic organisms to respond to their natural environments.
Resumo:
We consider the problem of variable selection in regression modeling in high-dimensional spaces where there is known structure among the covariates. This is an unconventional variable selection problem for two reasons: (1) The dimension of the covariate space is comparable, and often much larger, than the number of subjects in the study, and (2) the covariate space is highly structured, and in some cases it is desirable to incorporate this structural information in to the model building process. We approach this problem through the Bayesian variable selection framework, where we assume that the covariates lie on an undirected graph and formulate an Ising prior on the model space for incorporating structural information. Certain computational and statistical problems arise that are unique to such high-dimensional, structured settings, the most interesting being the phenomenon of phase transitions. We propose theoretical and computational schemes to mitigate these problems. We illustrate our methods on two different graph structures: the linear chain and the regular graph of degree k. Finally, we use our methods to study a specific application in genomics: the modeling of transcription factor binding sites in DNA sequences. © 2010 American Statistical Association.
Resumo:
BACKGROUND: Previous clinical efficacy trials failed to support the continued development of recombinant gp120 (rgp120) as a candidate HIV vaccine. However, the recent RV144 HIV vaccine trial in Thailand showed that a prime/boost immunization strategy involving priming with canarypox vCP1521 followed by boosting with rgp120 could provide significant, although modest, protection from HIV infection. Based on these results, there is renewed interest in the development of rgp120 based antigens for follow up vaccine trials, where this immunization approach can be applied to other cohorts at high risk for HIV infection. Of particular interest are cohorts in Africa, India, and China that are infected with clade C viruses. METHODOLOGY/PRINCIPAL FINDINGS: A panel of 10 clade C rgp120 envelope proteins was expressed in 293 cells, purified by immunoaffinity chromatography, and used to immunize guinea pigs. The resulting sera were collected and analyzed in checkerboard experiments for rgp120 binding, V3 peptide binding, and CD4 blocking activity. Virus neutralization studies were carried out with two different assays and two different panels of clade C viruses. A high degree of cross reactivity against clade C and clade B viruses and viral proteins was observed. Most, but not all of the immunogens tested elicited antibodies that neutralized tier 1 clade B viruses, and some sera neutralized multiple clade C viruses. Immunization with rgp120 from the CN97001 strain of HIV appeared to elicit higher cross neutralizing antibody titers than the other antigens tested. CONCLUSIONS/SIGNIFICANCE: While all of the clade C antigens tested were immunogenic, some were more effective than others in eliciting virus neutralizing antibodies. Neutralization titers did not correlate with rgp120 binding, V3 peptide binding, or CD4 blocking activity. CN97001 rgp120 elicited the highest level of neutralizing antibodies, and should be considered for further HIV vaccine development studies.
Resumo:
Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution.
Resumo:
Most studies that apply qualitative comparative analysis (QCA) rely on macro-level data, but an increasing number of studies focus on units of analysis at the micro or meso level (i.e., households, firms, protected areas, communities, or local governments). For such studies, qualitative interview data are often the primary source of information. Yet, so far no procedure is available describing how to calibrate qualitative data as fuzzy sets. The authors propose a technique to do so and illustrate it using examples from a study of Guatemalan local governments. By spelling out the details of this important analytic step, the authors aim at contributing to the growing literature on best practice in QCA. © The Author(s) 2012.
Resumo:
Addressing global fisheries overexploitation requires better understanding of how small-scale fishing communities in developing countries limit access to fishing grounds. We analyze the performance of a system based on individual licenses and a common property-rights regime in their ability to generate incentives for self-governance and conservation of fishery resources. Using a qualitative before-after-control-impact approach, we compare two neighbouring fishing communities in the Gulf of California, Mexico. Both were initially governed by the same permit system, are situated in the same ecosystem, use similar harvesting technology, and have overharvested similar species. One community changed to a common property-right regime, enabling the emergence of access controls and avoiding overexploitation of benthic resources, while the other community, still relies on the permit system. We discuss the roles played by power, institutions, socio-historic, and biophysical factors to develop access controls. © 2012 The Author(s).
Resumo:
Systematic reviews comparing the effectiveness of strategies to prevent, detect, and treat chronic kidney disease are needed to inform patient care. We engaged stakeholders in the chronic kidney disease community to prioritize topics for future comparative effectiveness research systematic reviews. We developed a preliminary list of suggested topics and stakeholders refined and ranked topics based on their importance. Among 46 topics identified, stakeholders nominated 18 as 'high' priority. Most pertained to strategies to slow disease progression, including: (a) treat proteinuria, (b) improve access to care, (c) treat hypertension, (d) use health information technology, and (e) implement dietary strategies. Most (15 of 18) topics had been previously studied with two or more randomized controlled trials, indicating feasibility of rigorous systematic reviews. Chronic kidney disease topics rated by stakeholders as 'high priority' are varied in scope and may lead to quality systematic reviews impacting practice and policy.