917 resultados para whole genome duplication
Resumo:
Cytosine methylation is important for transposon silencing and epigenetic regulation of endogenous genes, although the extent to which this DNA modification functions to regulate the genome is still unknown. Here we report the first comprehensive DNA methylation map of an entire genome, at 35 base pair resolution, using the flowering plant Arabidopsis thaliana as a model. We find that pericentromeric heterochromatin, repetitive sequences, and regions producing small interfering RNAs are heavily methylated. Unexpectedly, over one-third of expressed genes contain methylation within transcribed regions, whereas only approximately 5% of genes show methylation within promoter regions. Interestingly, genes methylated in transcribed regions are highly expressed and constitutively active, whereas promoter-methylated genes show a greater degree of tissue-specific expression. Whole-genome tiling-array transcriptional profiling of DNA methyltransferase null mutants identified hundreds of genes and intergenic noncoding RNAs with altered expression levels, many of which may be epigenetically controlled by DNA methylation.
Resumo:
Background: Various evolutionary models have been proposed to interpret the fate of paralogous duplicates, which provides substrates on which evolution selection could act. In particular, domestication, as a special selection, has played important role in crop cultivation with divergence of many genes controlling important agronomic traits. Recent studies have indicated that a pair of duplicate genes was often sub-functionalized from their ancestral functions held by the parental genes. We previously demonstrated that the rice cell-wall invertase (CWI) gene GIF1 that plays an important role in the grain-filling process was most likely subjected to domestication selection in the promoter region. Here, we report that GIF1 and another CWI gene OsCIN1 constitute a pair of duplicate genes with differentiated expression and function through independent selection. Results: Through synteny analysis, we show that GIF1 and another cell-wall invertase gene OsCIN1 were paralogues derived from a segmental duplication originated during genome duplication of grasses. Results based on analyses of population genetics and gene phylogenetic tree of 25 cultivars and 25 wild rice sequences demonstrated that OsCIN1 was also artificially selected during rice domestication with a fixed mutation in the coding region, in contrast to GIF1 that was selected in the promoter region. GIF1 and OsCIN1 have evolved into different expression patterns and probable different kinetics parameters of enzymatic activity with the latter displaying less enzymatic activity. Overexpression of GIF1 and OsCIN1 also resulted in different phenotypes, suggesting that OsCIN1 might regulate other unrecognized biological process. Conclusion: How gene duplication and divergence contribute to genetic novelty and morphological adaptation has been an interesting issue to geneticists and biologists. Our discovery that the duplicated pair of GIF1 and OsCIN1 has experiencedsub-functionalization implies that selection could act independently on each duplicate towards different functional specificity, which provides a vivid example for evolution of genetic novelties in a model crop. Our results also further support the established hypothesis that gene duplication with sub-functionalization could be one solution for genetic adaptive conflict.
Resumo:
Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes.
Resumo:
Multiple type I interferons (IFNs) have recently been identified in salmonids, containing two or four conserved cysteines. In this work, a novel two-cysteine containing (2C) IFN gene was identified in rainbow trout. This novel trout IFN gene (termed IFN5) formed a phylogenetic group that is distinct from the other three salmonid IFN groups sequenced to date and had a close evolutionary relationship with IFNs from advanced fish species. Our data demonstrate that two subgroups are apparent within each of the 2C and 4C type I IFNs, an evolutionary outcome possibly due to two rounds of genome duplication events that have occurred within teleosts. We have examined gene expression of the trout 2C type I IFN in cultured cells following stimulation with lipopolysaccharide, phytohaemagglutinin, polyI:C or recombinant IFN, or after transfection with polyI:C. The kinetics of gene expression was also studied after viral infection. Analysis of the regulatory elements in the IFN promoter region predicted several binding sites for key transcription factors that potentially play an important role in mediating IFN5 gene expression.
Resumo:
Cyanobacteria are the oldest life form making important contributions to global CO2 fixation on the Earth. Phycobilisomes (PBSs) are the major light harvesting systems of most cyanobacteria species. Recent availability of the whole genome database of cyanobacteria provides us a global and further view on the complex structural PBSs. A PBSs linker family is crucial in structure and function of major light-harvesting PBSs complexes. Linker polypeptides are considered to have the same ancestor with other phycobiliproteins (PBPs), and might have been diverged and evolved under particularly selective forces together. In this paper, a total of 192 putative linkers including 167 putative PBSs-associated linker genes and 25 Ferredoxin-NADP oxidoreductase (FNR) genes were detected through whole genome analysis of all 25 cyanobacterial genomes (20 finished and 5 in draft state). We compared the PBSs linker family of cyanobacteria in terms of gene structure, chromosome location, conservation domain, and polymorphic variants, and discussed the features and functions of the PBSs linker family. Most of PBSs-associated linkers in PBSs linker family are assembled into gene clusters with PBPs. A phylogenetic analysis based on protein data demonstrates a possibility of six classes of the linker family in cyanobacteria. Emergence, divergence, and disappearance of PBSs linkers among cyanobacterial species were due to speciation, gene duplication, gene transfer, or gene loss, and acclimation to various environmental selective pressures especially light.
Resumo:
BACKGROUND: The availability of multiple avian genome sequence assemblies greatly improves our ability to define overall genome organization and reconstruct evolutionary changes. In birds, this has previously been impeded by a near intractable karyotype and relied almost exclusively on comparative molecular cytogenetics of only the largest chromosomes. Here, novel whole genome sequence information from 21 avian genome sequences (most newly assembled) made available on an interactive browser (Evolution Highway) was analyzed. RESULTS: Focusing on the six best-assembled genomes allowed us to assemble a putative karyotype of the dinosaur ancestor for each chromosome. Reconstructing evolutionary events that led to each species' genome organization, we determined that the fastest rate of change occurred in the zebra finch and budgerigar, consistent with rapid speciation events in the Passeriformes and Psittaciformes. Intra- and interchromosomal changes were explained most parsimoniously by a series of inversions and translocations respectively, with breakpoint reuse being commonplace. Analyzing chicken and zebra finch, we found little evidence to support the hypothesis of an association of evolutionary breakpoint regions with recombination hotspots but some evidence to support the hypothesis that microchromosomes largely represent conserved blocks of synteny in the majority of the 21 species analyzed. All but one species showed the expected number of microchromosomal rearrangements predicted by the haploid chromosome count. Ostrich, however, appeared to retain an overall karyotype structure of 2n=80 despite undergoing a large number (26) of hitherto un-described interchromosomal changes. CONCLUSIONS: Results suggest that mechanisms exist to preserve a static overall avian karyotype/genomic structure, including the microchromosomes, with widespread interchromosomal change occurring rarely (e.g., in ostrich and budgerigar lineages). Of the species analyzed, the chicken lineage appeared to have undergone the fewest changes compared to the dinosaur ancestor.
Resumo:
Ferns are one of the few remaining major clades of land plants for which a complete genome sequence is lacking. Knowledge of genome space in ferns will enable broad-scale comparative analyses of land plant genes and genomes, provide insights into genome evolution across green plants, and shed light on genetic and genomic features that characterize ferns, such as their high chromosome numbers and large genome sizes. As part of an initial exploration into fern genome space, we used a whole genome shotgun sequencing approach to obtain low-density coverage (∼0.4X to 2X) for six fern species from the Polypodiales (Ceratopteris, Pteridium, Polypodium, Cystopteris), Cyatheales (Plagiogyria), and Gleicheniales (Dipteris). We explore these data to characterize the proportion of the nuclear genome represented by repetitive sequences (including DNA transposons, retrotransposons, ribosomal DNA, and simple repeats) and protein-coding genes, and to extract chloroplast and mitochondrial genome sequences. Such initial sweeps of fern genomes can provide information useful for selecting a promising candidate fern species for whole genome sequencing. We also describe variation of genomic traits across our sample and highlight some differences and similarities in repeat structure between ferns and seed plants.
Resumo:
BACKGROUND: Parrots belong to a group of behaviorally advanced vertebrates and have an advanced ability of vocal learning relative to other vocal-learning birds. They can imitate human speech, synchronize their body movements to a rhythmic beat, and understand complex concepts of referential meaning to sounds. However, little is known about the genetics of these traits. Elucidating the genetic bases would require whole genome sequencing and a robust assembly of a parrot genome. FINDINGS: We present a genomic resource for the budgerigar, an Australian Parakeet (Melopsittacus undulatus) -- the most widely studied parrot species in neuroscience and behavior. We present genomic sequence data that includes over 300× raw read coverage from multiple sequencing technologies and chromosome optical maps from a single male animal. The reads and optical maps were used to create three hybrid assemblies representing some of the largest genomic scaffolds to date for a bird; two of which were annotated based on similarities to reference sets of non-redundant human, zebra finch and chicken proteins, and budgerigar transcriptome sequence assemblies. The sequence reads for this project were in part generated and used for both the Assemblathon 2 competition and the first de novo assembly of a giga-scale vertebrate genome utilizing PacBio single-molecule sequencing. CONCLUSIONS: Across several quality metrics, these budgerigar assemblies are comparable to or better than the chicken and zebra finch genome assemblies built from traditional Sanger sequencing reads, and are sufficient to analyze regions that are difficult to sequence and assemble, including those not yet assembled in prior bird genomes, and promoter regions of genes differentially regulated in vocal learning brain regions. This work provides valuable data and material for genome technology development and for investigating the genomics of complex behavioral traits.
Resumo:
Propionibacterium acnes is an anaerobic Gram-positive bacterium that forms part of the normal human cutaneous microbiota and is thought to play a central role in acne vulgaris, a chronic inflammatory disease of the pilosebaceous unit (I. Kurokawa et al., Exp. Dermatol. 18:821-832, 2009). Here we present the whole genome sequence of P. acnes type IB strain 6609, which was recovered from a skin sample from a woman with no recorded acne history and is thus considered a nonpathogenic strain (I. Nagy, Microbes Infect. 8:2195-2205, 2006).
Resumo:
Propionibacterium acnes is an anaerobic Gram-positive bacterium that has been linked to a wide range of opportunistic human infections and conditions, most notably acne vulgaris (I. Kurokawa et al., Exp. Dermatol. 18:821-832, 2009). We now present the whole-genome sequences of three P. acnes strains from the type IA(2) cluster which were recovered from ophthalmic infections (A. McDowell et al., Microbiology 157:1990-2003, 2011).
Resumo:
Summary: Genome duplications and polyploidization events are thought to have played relevant roles in the early stages of vertebrate evolution, in particular near the time of divergence of the lamprey lineage. Additional genome duplications, specifically in ray-finned fish, may have occurred before the divergence of the teleosts. The role of polyploidization in vertebrate genome evolution is a thriving area of research. Sturgeons (order Acipenseriformes) provide a unique model for the investigation of genome duplication, with existing species possessing 120, 250 or 360 chromosomes. In the present study, data from 240 sturgeon specimens representing 11 species were used for analysis of ploidy levels. Allele numbers were assessed at eleven microsatellite loci. The results provide further evidence for functional diploidy, tetraploidy and hexaploidy in species possessing 120, 250 and 360 chromosomes, respectively. The analysis also uncovered novel evidence for functional hexaploidy in the shortnose sturgeon (Acipenser brevirostrum). In conclusion, the process of functional genome reduction is demonstrated to be an on-going process in this fish lineage. © 2013 Blackwell Verlag GmbH.
Resumo:
To assess factors influencing the success of whole-genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases or families across a broad spectrum of disorders in whom previous screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritization. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease-causing variants in 21% of cases, with the proportion increasing to 34% (23/68) for mendelian disorders and 57% (8/14) in family trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, although only 4 were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis but also highlight many outstanding challenges.
Resumo:
Therapies that are safe, effective, and not vulnerable to developing resistance are highly desirable to counteract bacterial infections. Host-directed therapeutics is an antimicrobial approach alternative to conventional antibiotics based on perturbing host pathways subverted by pathogens during their life cycle by using host-directed drugs. In this study, we identified and evaluated the efficacy of a panel of host-directed drugs against respiratory infection by nontypeable Haemophilus influenzae (NTHi). NTHi is an opportunistic pathogen that is an important cause of exacerbation of chronic obstructive pulmonary disease (COPD). We screened for host genes differentially expressed upon infection by the clinical isolate NTHi375 by analyzing cell whole-genome expression profiling and identified a repertoire of host target candidates that were pharmacologically modulated. Based on the proposed relationship between NTHi intracellular location and persistence, we hypothesized that drugs perturbing host pathways used by NTHi to enter epithelial cells could have antimicrobial potential against NTHi infection. Interfering drugs were tested for their effects on bacterial and cellular viability, on NTHi-epithelial cell interplay, and on mouse pulmonary infection. Glucocorticoids and statins lacked in vitro and/or in vivo efficacy. Conversely, the sirtuin-1 activator resveratrol showed a bactericidal effect against NTHi, and the PDE4 inhibitor rolipram showed therapeutic efficacy by lowering NTHi375 counts intracellularly and in the lungs of infected mice. PDE4 inhibition is currently prescribed in COPD, and resveratrol is an attractive geroprotector for COPD treatment. Together, these results expand our knowledge of NTHi-triggered host subversion and frame the antimicrobial potential of rolipram and resveratrol against NTHi respiratory infection.
Resumo:
The Neolithic and Bronze Age transitions were profound cultural shifts catalyzed in parts of Europe by migrations, first of early farmers from the Near East and then Bronze Age herders from the Pontic Steppe. However, a decades-long, unresolved controversy is whether population change or cultural adoption occurred at the Atlantic edge, within the British Isles. We address this issue by using the first whole genome data from prehistoric Irish individuals. A Neolithic woman (3343–3020 cal BC) from a megalithic burial (10.3× coverage) possessed a genome of predominantly Near Eastern origin. She had some hunter–gatherer ancestry but belonged to a population of large effective size, suggesting a substantial influx of early farmers to the island. Three Bronze Age individuals from Rathlin Island (2026–1534 cal BC), including one high coverage (10.5×) genome, showed substantial Steppe genetic heritage indicating that the European population upheavals of the third millennium manifested all of the way from southern Siberia to the western ocean. This turnover invites the possibility of accompanying introduction of Indo-European, perhaps early Celtic, language. Irish Bronze Age haplotypic similarity is strongest within modern Irish, Scottish, and Welsh populations, and several important genetic variants that today show maximal or very high frequencies in Ireland appear at this horizon. These include those coding for lactase persistence, blue eye color, Y chromosome R1b haplotypes, and the hemochromatosis C282Y allele; to our knowledge, the first detection of a known Mendelian disease variant in prehistory. These findings together suggest the establishment of central attributes of the Irish genome 4,000 y ago.
Resumo:
With the availability of new generation sequencing technologies, bacterial genome projects have undergone a major boost. Still, chromosome completion needs a costly and time-consuming gap closure, especially when containing highly repetitive elements. However, incomplete genome data may be sufficiently informative to derive the pursued information. For emerging pathogens, i.e. newly identified pathogens, lack of release of genome data during gap closure stage is clearly medically counterproductive. We thus investigated the feasibility of a dirty genome approach, i.e. the release of unfinished genome sequences to develop serological diagnostic tools. We showed that almost the whole genome sequence of the emerging pathogen Parachlamydia acanthamoebae was retrieved even with relatively short reads from Genome Sequencer 20 and Solexa. The bacterial proteome was analyzed to select immunogenic proteins, which were then expressed and used to elaborate the first steps of an ELISA. This work constitutes the proof of principle for a dirty genome approach, i.e. the use of unfinished genome sequences of pathogenic bacteria, coupled with proteomics to rapidly identify new immunogenic proteins useful to develop in the future specific diagnostic tests such as ELISA, immunohistochemistry and direct antigen detection. Although applied here to an emerging pathogen, this combined dirty genome sequencing/proteomic approach may be used for any pathogen for which better diagnostics are needed. These genome sequences may also be very useful to develop DNA based diagnostic tests. All these diagnostic tools will allow further evaluations of the pathogenic potential of this obligate intracellular bacterium.