987 resultados para Eukaryotic Genomes


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Next-generation sequencing (NGS) technologies have become the standard for data generation in studies of population genomics, as the 1000 Genomes Project (1000G). However, these techniques are known to be problematic when applied to highly polymorphic genomic regions, such as the human leukocyte antigen (HLA) genes. Because accurate genotype calls and allele frequency estimations are crucial to population genomics analyses, it is important to assess the reliability of NGS data. Here, we evaluate the reliability of genotype calls and allele frequency estimates of the single-nucleotide polymorphisms (SNPs) reported by 1000G (phase I) at five HLA genes (HLA-A, -B, -C, -DRB1, and -DQB1). We take advantage of the availability of HLA Sanger sequencing of 930 of the 1092 1000G samples and use this as a gold standard to benchmark the 1000G data. We document that 18.6% of SNP genotype calls in HLA genes are incorrect and that allele frequencies are estimated with an error greater than ±0.1 at approximately 25% of the SNPs in HLA genes. We found a bias toward overestimation of reference allele frequency for the 1000G data, indicating mapping bias is an important cause of error in frequency estimation in this dataset. We provide a list of sites that have poor allele frequency estimates and discuss the outcomes of including those sites in different kinds of analyses. Because the HLA region is the most polymorphic in the human genome, our results provide insights into the challenges of using of NGS data at other genomic regions of high diversity.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The increase of publicly available sequencing data has allowed for rapid progress in our understanding of genome composition. As new information becomes available we should constantly be updating and reanalyzing existing and newly acquired data. In this report we focus on transposable elements (TEs) which make up a significant portion of nearly all sequenced genomes. Our ability to accurately identify and classify these sequences is critical to understanding their impact on host genomes. At the same time, as we demonstrate in this report, problems with existing classification schemes have led to significant misunderstandings of the evolution of both TE sequences and their host genomes. In a pioneering publication Finnegan (1989) proposed classifying all TE sequences into two classes based on transposition mechanisms and structural features: the retrotransposons (class I) and the DNA transposons (class II). We have retraced how ideas regarding TE classification and annotation in both prokaryotic and eukaryotic scientific communities have changed over time. This has led us to observe that: (1) a number of TEs have convergent structural features and/or transposition mechanisms that have led to misleading conclusions regarding their classification, (2) the evolution of TEs is similar to that of viruses by having several unrelated origins, (3) there might be at least 8 classes and 12 orders of TEs including 10 novel orders. In an effort to address these classification issues we propose: (1) the outline of a universal TE classification, (2) a set of methods and classification rules that could be used by all scientific communities involved in the study of TEs, and (3) a 5-year schedule for the establishment of an International Committee for Taxonomy of Transposable Elements (ICTTE).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Quest for Orthologs (QfO) is a community effort with the goal to improve and benchmark orthology predictions. As quality assessment assumes prior knowledge on species phylogenies, we investigated the congruency between existing species trees by comparing the relationships of 147 QfO reference organisms from six Tree of Life (ToL)/species tree projects: The National Center for Biotechnology Information (NCBI) taxonomy, Opentree of Life, the sequenced species/species ToL, the 16S ribosomal RNA (rRNA) database, and trees published by Ciccarelli et al. (Ciccarelli FD, et al. 2006. Toward automatic reconstruction of a highly resolved tree of life. Science 311:1283-1287) and by Huerta-Cepas et al. (Huerta-Cepas J, Marcet-Houben M, Gabaldon T. 2014. A nested phylogenetic reconstruction approach provides scalable resolution in the eukaryotic Tree Of Life. PeerJ PrePrints 2:223) Our study reveals that each species tree suggests a different phylogeny: 87 of the 146 (60%) possible splits of a dichotomous and rooted tree are congruent, while all other splits are incongruent in at least one of the species trees. Topological differences are observed not only at deep speciation events, but also within younger clades, such as Hominidae, Rodentia, Laurasiatheria, or rosids. The evolutionary relationships of 27 archaea and bacteria are highly inconsistent. By assessing 458,108 gene trees from 65 genomes, we show that consistent species topologies are more often supported by gene phylogenies than contradicting ones. The largest concordant species tree includes 77 of the QfO reference organisms at the most. Results are summarized in the form of a consensus ToL (http://swisstree.vital-it.ch/species_tree) that can serve different benchmarking purposes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Background: Information about the composition of regulatory regions is of great value for designing experiments to functionally characterize gene expression. The multiplicity of available applications to predict transcription factor binding sites in a particular locus contrasts with the substantial computational expertise that is demanded to manipulate them, which may constitute a potential barrier for the experimental community. Results: CBS (Conserved regulatory Binding Sites, http://compfly.bio.ub.es/CBS) is a public platform of evolutionarily conserved binding sites and enhancers predicted in multiple Drosophila genomes that is furnished with published chromatin signatures associated to transcriptionally active regions and other experimental sources of information. The rapid access to this novel body of knowledge through a user-friendly web interface enables non-expert users to identify the binding sequences available for any particular gene, transcription factor, or genome region. Conclusions: The CBS platform is a powerful resource that provides tools for data mining individual sequences and groups of co-expressed genes with epigenomics information to conduct regulatory screenings in Drosophila.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Horizontal gene transfer is central to microbial evolution, because it enables genetic regions to spread horizontally through diverse communities. However, how gene transfer exerts such a strong effect is not understood. Here we develop an eco-evolutionary model and show how genetic transfer, even when rare, can transform the evolution and ecology of microbes. We recapitulate existing models, which suggest that asexual reproduction will overpower horizontal transfer and greatly limit its effects. We then show that allowing immigration completely changes these predictions. With migration, the rates and impacts of horizontal transfer are greatly increased, and transfer is most frequent for loci under positive natural selection. Our analysis explains how ecologically important loci can sweep through competing strains and species. In this way, microbial genomes can evolve to become ecologically diverse where different genomic regions encode for partially overlapping, but distinct, ecologies. Under these conditions ecological species do not exist, because genes, not species, inhabit niches.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Members of the Mycoplasma mycoides cluster' represent important livestock pathogens worldwide. Mycoplasma mycoides subsp. mycoides is the etiologic agent of contagious bovine pleuropneumonia (CBPP), which is still endemic in many parts of Africa. We report the genome sequences and annotation of two frequently used challenge strains of Mycoplasma mycoides subsp. mycoides, Afadé and B237. The information provided will enable downstream 'omics' applications such as proteomics, transcriptomics and reverse vaccinology approaches. Despite the absence of Mycoplasma pneumoniae like cyto-adhesion encoding genes, the two strains showed the presence of protrusions. This phenotype is likely encoded by another set of genes.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Genome duplications increase genetic diversity and may facilitate the evolution of gene subfunctions. Little attention, however, has focused on the evolutionary impact of lineage-specific gene loss. Here, we show that identifying lineage-specific gene loss after genome duplication is important for understanding the evolution of gene subfunctions in surviving paralogs and for improving functional connectivity among human and model organism genomes. We examine the general principles of gene loss following duplication, coupled with expression analysis of the retinaldehyde dehydrogenase Aldh1a gene family during retinoic acid signaling in eye development as a case study. Humans have three ALDH1A genes, but teleosts have just one or two. We used comparative genomics and conserved syntenies to identify loss of ohnologs (paralogs derived from genome duplication) and to clarify uncertain phylogenies. Analysis showed that Aldh1a1 and Aldh1a2 form a clade that is sister to Aldh1a3-related genes. Genome comparisons showed secondarily loss of aldh1a1 in teleosts, revealing that Aldh1a1 is not a tetrapod innovation and that aldh1a3 was recently lost in medaka, making it the first known vertebrate with a single aldh1a gene. Interestingly, results revealed asymmetric distribution of surviving ohnologs between co-orthologous teleost chromosome segments, suggesting that local genome architecture can influence ohnolog survival. We propose a model that reconstructs the chromosomal history of the Aldh1a family in the ancestral vertebrate genome, coupled with the evolution of gene functions in surviving Aldh1a ohnologs after R1, R2, and R3 genome duplications. Results provide evidence for early subfunctionalization and late subfunction-partitioning and suggest a mechanistic model based on altered regulation leading to heterochronic gene expression to explain the acquisition or modification of subfunctions by surviving ohnologs that preserve unaltered ancestral developmental programs in the face of gene loss.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Based on Darwin's concept of the tree of life, vertical inheritance was thought to be dominant, and mutations, deletions, and duplication were streaming the genomes of living organisms. In the current genomic era, increasing data indicated that both vertical and lateral gene inheritance interact in space and time to trigger genome evolution, particularly among microorganisms sharing a given ecological niche. As a paradigm to their diversity and their survival in a variety of cell types, intracellular microorganisms, and notably intracellular bacteria, were considered as less prone to lateral genetic exchanges. Such specialized microorganisms generally have a smaller gene repertoire because they do rely on their host's factors for some basic regulatory and metabolic functions. Here we review events of lateral gene transfer (LGT) that illustrate the genetic exchanges among intra-amoebal microorganisms or between the microorganism and its amoebal host. We tentatively investigate the functions of laterally transferred genes in the light of the interaction with their host as they should confer a selective advantage and success to the amoeba-resisting microorganisms (ARMs).

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Mitochondrial genomes (mitogenomes) are useful and relatively accessible sources of molecular data to explore and understand the evolutionary history and relationships of eukaryotic organisms across diverse taxonomic levels. The availability of complete mitogenomes from Platyhelminthes is limited; of the 40 or so published most are from parasitic flatworms (Neodermata). Here, we present the mitogenomes of two free-living flatworms (Tricladida): the complete genome of the freshwater species Crenobia alpina (Planariidae) and a nearly complete genome of the land planarian Obama sp. (Geoplanidae). Moreover, we have reanotated the published mitogenome of the species Dugesia japonica (Dugesiidae). This contribution almost doubles the total number of mtDNAs published for Tricladida, a species-rich group including model organisms and economically important invasive species. We took the opportunity to conduct comparative mitogenomic analyses between available free-living and selected parasitic flatworms in order to gain insights into the putative effect of life cycle on nucleotide composition through mutation and natural selection. Unexpectedly, we did not find any molecular hallmark of a selective relaxation in mitogenomes of parasitic flatworms; on the contrary, three out of the four studied free-living triclad mitogenomes exhibit higher A+T content and selective relaxation levels. Additionally, we provide new and valuable molecular data to develop markers for future phylogenetic studies on planariids and geoplanids.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Organismic-centered Darwinism, in order to use direct phenotypes to measure natural selection's effect, necessitates genome's harmony and uniform coherence plus large population sizes. However, modern gene-centered Darwinism has found new interpretations to data that speak of genomic incoherence and disharmony. As a result of these two conflicting positions a conceptual crisis in Biology has arisen. My position is that the presence of small, even pocket-size, demes is instrumental in generating divergence and phenotypic crisis. Moreover, the presence of parasitic genomes as in acanthocephalan worms, which even manipulate suicidal behavior in their hosts; segregation distorters that change meiosis and Mendelian ratios; selfish genes and selfish whole chromosomes, such as the case of B-chromosomes in grasshoppers; P-elements in Drosophila; driving Y-chromosomes that manipulate sex ratios making males more frequent, as in Hamilton's X-linked drive; male strategists and outlaw genes, are eloquent examples of the presence of real conflicting genomes and of a non-uniform phenotypic coherence and genome harmony. Thus, we are proposing that overall incoherence and disharmony generate disorder but also more biodiversity and creativeness. Finally, if genes can manipulate natural selection, they can multiply mutations or undesirable characteristics and even lethal or detrimental ones, hence the accumulation of genetic loads. Outlaw genes can change what is adaptively convenient even in the direction of the trait that is away from the optimum. The optimum can be "negotiated" among the variants, not only because pleiotropic effects demand it, but also, in some cases, because selfish, outlaw, P-elements or extended phenotypic manipulation require it. With organismic Darwinism the genome in the population and in the individual was thought to act harmoniously without conflicts, and genotypes were thought to march towards greater adaptability. Modern Darwinism has a gene-centered vision in which genes, as natural selection's objects can move in dissonance in the direction which benefits their multiplication. Thus, we have greater opportunities for genomes in permanent conflict.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Preference for specific protein substrates together with differential sensitivity to activators and inhibitors has allowed classification of serine/threonine protein phosphatases (PPs) into four major types designated types 1, 2A, 2B and 2C (PP1, PP2A, PP2B and PP2C, respectively). Comparison of sequences within their catalytic domains has indicated that PP1, PP2A and PP2B are members of the same gene family named PPP. On the other hand, the type 2C enzyme does not share sequence homology with the PPP members and thus represents another gene family, known as PPM. In this report we briefly summarize some of our studies about the role of serine/threonine phosphatases in growth and differentiation of three different eukaryotic models: Blastocladiella emersonii, Neurospora crassa and Dictyostelium discoideum. Our observations suggest that PP2C is the major phosphatase responsible for dephosphorylation of amidotransferase, an enzyme that controls cell wall synthesis during Blastocladiella emersonii zoospore germination. We also report the existence of a novel acid- and thermo-stable protein purified from Neurospora crassa mycelia, which specifically inhibits the PP1 activity of this fungus and mammals. Finally, we comment on our recent results demonstrating that Dictyostelium discoideum expresses a gene that codes for PP1, although this activity has never been demonstrated biochemically in this organism.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Elongation factor 1A is a highly conserved protein that participates in translation. We report the occurrence of two genes homologous to the eukaryotic Elongation Factor 1A in Bradysia hygida and describe the partial cloning and characterization of the B. hygida eukaryotic Elongation Factor 1A-F1 (BheEF1A-F1) gene. The pattern of BheEF1A-F1 expression in the salivary gland at the end of the fourth larval instar was investigated using real-time PCR. The results showed that BheEF1A-F1 expression levels are relatively constant at the time when rapid changes in protein synthesis occur in this tissue. In situ hybridization experiments coupled to Southern blot analyses showed that the BheEF1A-F1 gene is located at position 3d of the A chromosome and a second gene homologous to eEF1A is located at position 6a of the X chromosome. Southern blot analyses showed that both the BheEF1A-F1 gene and the second gene homologous to eEF1A constitute non-amplified genes. The present results contribute to the molecular characterization of a sciarid eEF1A gene.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The biological functions of the BC047440 gene highly expressed by hepatocellular carcinoma (HCC) are unknown. The objective of this study was to reconstruct antisense eukaryotic expression vectors of the gene for inhibiting HepG2 cell proliferation and suppressing their xenograft tumorigenicity. The full-length BC047440 cDNA was cloned from human primary HCC by RT-PCR. BC047440 gene fragments were ligated with pMD18-T simple vectors and subsequent pcDNA3.1(+) plasmids to construct the recombinant antisense eukaryotic vector pcDNA3.1(+)BC047440AS. The endogenous BC047440 mRNA abundance in target gene-transfected, vector-transfected and naive HepG2 cells was semiquantitatively analyzed by RT-PCR and cell proliferation was measured by the MTT assay. Cell cycle distribution and apoptosis were profiled by flow cytometry. The in vivo xenograft experiment was performed on nude mice to examine the effects of antisense vector on tumorigenicity. BC047440 cDNA fragments were reversely inserted into pcDNA3.1(+) plasmids. The antisense vector significantly reduced the endogenous BC047440 mRNA abundance by 41% in HepG2 cells and inhibited their proliferation in vitro (P < 0.01). More cells were arrested by the antisense vector at the G1 phase in an apoptosis-independent manner (P = 0.014). Additionally, transfection with pcDNA3.1(+)BC047440AS significantly reduced the xenograft tumorigenicity in nude mice. As a novel cell cycle regulator associated with HCC, the BC047440 gene was involved in cell proliferation in vitro and xenograft tumorigenicity in vivo through apoptosis-independent mechanisms.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Lichens are symbiotic organisms, which consist of the fungal partner and the photosynthetic partner, which can be either an alga or a cyanobacterium. In some lichen species the symbiosis is tripartite, where the relationship includes both an alga and a cyanobacterium alongside the primary symbiont, fungus. The lichen symbiosis is an evolutionarily old adaptation to life on land and many extant fungal species have evolved from lichenised ancestors. Lichens inhabit a wide range of habitats and are capable of living in harsh environments and on nutrient poor substrates, such as bare rocks, often enduring frequent cycles of drying and wetting. Most lichen species are desiccation tolerant, and they can survive long periods of dehydration, but can rapidly resume photosynthesis upon rehydration. The molecular mechanisms behind lichen desiccation tolerance are still largely uncharacterised and little information is available for any lichen species at the genomic or transcriptomic level. The emergence of the high-throughput next generation sequencing (NGS) technologies and the subsequent decrease in the cost of sequencing new genomes and transcriptomes has enabled non-model organism research on the whole genome level. In this doctoral work the transcriptome and genome of the grey reindeer lichen, Cladonia rangiferina, were sequenced, de novo assembled and characterised using NGS and traditional expressed sequence tag (EST) technologies. RNA extraction methods were optimised to improve the yield and quality of RNA extracted from lichen tissue. The effects of rehydration and desiccation on C. rangiferina gene expression on whole transcriptome level were studied and the most differentially expressed genes were identified. The secondary metabolites present in C. rangiferina decreased the quality – integrity, optical characteristics and utility for sensitive molecular biological applications – of the extracted RNA requiring an optimised RNA extraction method for isolating sufficient quantities of high-quality RNA from lichen tissue in a time- and cost-efficient manner. The de novo assembly of the transcriptome of C. rangiferina was used to produce a set of contiguous unigene sequences that were used to investigate the biological functions and pathways active in a hydrated lichen thallus. The de novo assembly of the genome yielded an assembly containing mostly genes derived from the fungal partner. The assembly was of sufficient quality, in size similar to other lichen-forming fungal genomes and included most of the core eukaryotic genes. Differences in gene expression were detected in all studied stages of desiccation and rehydration, but the largest changes occurred during the early stages of rehydration. The most differentially expressed genes did not have any annotations, making them potentially lichen-specific genes, but several genes known to participate in environmental stress tolerance in other organisms were also identified as differentially expressed.