116 resultados para Transcriptomes
Resumo:
Searching for matches between large collections of short (14-30 nucleotides) words and sequence databases comprising full genomes or transcriptomes is a common task in biological sequence analysis. We investigated the performance of simple indexing strategies for handling such tasks and developed two programs, fetchGWI and tagger, that index either the database or the query set. Either strategy outperforms megablast for searches with more than 10,000 probes. FetchGWI is shown to be a versatile tool for rapidly searching multiple genomes, whose performance is limited in most cases by the speed of access to the filesystem. We have made publicly available a Web interface for searching the human, mouse, and several other genomes and transcriptomes with oligonucleotide queries.
Resumo:
Aldosterone and vasopressin are responsible for the final adjustment of sodium and water reabsorption in the kidney. In principal cells of the kidney cortical collecting duct (CCD), the integral response to aldosterone and the long-term functional effects of vasopressin depend on transcription. In this study, we analyzed the transcriptome of a highly differentiated mouse clonal CCD principal cell line (mpkCCD(cl4)) and the changes in the transcriptome induced by aldosterone and vasopressin. Serial analysis of gene expression (SAGE) was performed on untreated cells and on cells treated with either aldosterone or vasopressin for 4 h. The transcriptomes in these three experimental conditions were determined by sequencing 169,721 transcript tags from the corresponding SAGE libraries. Limiting the analysis to tags that occurred twice or more in the data set, 14,654 different transcripts were identified, 3,642 of which do not match known mouse sequences. Statistical comparison (at P < 0.05 level) of the three SAGE libraries revealed 34 AITs (aldosterone-induced transcripts), 29 ARTs (aldosterone-repressed transcripts), 48 VITs (vasopressin-induced transcripts) and 11 VRTs (vasopressin-repressed transcripts). A selection of the differentially-expressed, hormone-specific transcripts (5 VITs, 2 AITs and 1 ART) has been validated in the mpkCCD(cl4) cell line either by Northern blot hybridization or reverse transcription-PCR. The hepatocyte nuclear transcription factor HNF-3-alpha (VIT39), the receptor activity modifying protein RAMP3 (VIT48), and the glucocorticoid-induced leucine zipper protein (GILZ) (AIT28) are candidate proteins playing a role in physiological responses of this cell line to vasopressin and aldosterone.
Resumo:
Staphylococcus aureus infections involve numerous adhesins and toxins, which expression depends on complex regulatory networks. Adhesins include a family of surface proteins covalently attached to the peptidoglycan via a conserved LPXTG motif. Here we determined the protein and mRNA expression of LPXTG-proteins of S. aureus Newman in time-course experiments, and their relation to fibrinogen adherence in vitro. Experiments were performed with mutants in the global accessory-gene regulator (agr), surface protein A (Spa), and fibrinogen-binding protein A (ClfA), as well as during growth in iron-rich or iron-poor media. Surface proteins were recovered by trypsin-shaving of live bacteria. Released peptides were analyzed by liquid chromatography coupled to tandem mass-spectrometry. To unambiguously identify peptides unique to LPXTG-proteins, the analytical conditions were refined using a reference library of S. aureus LPXTG-proteins heterogeneously expressed in surrogate Lactococcus lactis. Transcriptomes were determined by microarrays. Sixteen of the 18 LPXTG-proteins present in S. aureus Newman were detected by proteomics. Nine LPXTG-proteins showed a bell-shape agr-like expression that was abrogated in agr-negative mutants including Spa, fibronectin-binding protein A (FnBPA), ClfA, iron-binding IsdA, and IsdB, immunomodulator SasH, functionally uncharacterized SasD, biofilm-related SasG and methicillin resistance-related FmtB. However, only Spa and SasH modified their proteomic and mRNA profiles in parallel in the parent and its agr- mutant, whereas all other LPXTG-proteins modified their proteomic profiles independently of their mRNA. Moreover, ClfA became highly transcribed and active in fibrinogen-adherence tests during late growth (24 h), whereas it remained poorly detected by proteomics. On the other hand, iron-regulated IsdA-B-C increased their protein expression by >10-times in iron-poor conditions. Thus, proteomic, transcriptomic, and adherence-phenotype demonstrated differential profiles in S. aureus. Moreover, trypsin peptide signatures suggested differential protein domain exposures in various environments, which might be relevant for anti-adhesin vaccines. A comprehensive understanding of the S. aureus physiology should integrate all three approaches.
Resumo:
A simple way to quickly optimize microsatellites in nonmodel organisms is to reuse loci available in closely related taxa; however, this approach can be limited by the stochastic and low cross-amplification success experienced in some groups (e.g. amphibians). An efficient alternative is to develop loci from transcriptome sequences. Transcriptomic microsatellites have been found to vary in their levels of cross-species amplification and variability, but this has to date never been tested in amphibians. Here, we compare the patterns of cross-amplification and levels of polymorphism of 18 published anonymous microsatellites isolated from genomic DNA vs. 17 loci derived from a transcriptome, across nine species of tree frogs (Hyla arborea and Hyla cinerea group). We established a clear negative relationship between divergence time and amplification success, which was much steeper for anonymous than transcriptomic markers, with half-lives (time at which 50% of the markers still amplify) of 1.1 and 37 My, respectively. Transcriptomic markers are significantly less polymorphic than anonymous loci, but remain variable across diverged taxa. We conclude that the exploitation of amphibian transcriptomes for developing microsatellites seems an optimal approach for multispecies surveys (e.g. analyses of hybrid zones, comparative linkage mapping), whereas anonymous microsatellites may be more informative for fine-scale analyses of intraspecific variation. Moreover, our results confirm the pattern that microsatellite cross-amplification is greatly variable among amphibians and should be assessed independently within target lineages. Finally, we provide a bank of microsatellites for Palaearctic tree frogs (so far only available for H. arborea), which will be useful for conservation and evolutionary studies in this radiation.
Resumo:
BACKGROUND: Understanding how alternative phenotypes arise from the same genome is a major challenge in modern biology. Eusociality in insects requires the evolution of two alternative phenotypes - workers, who sacrifice personal reproduction, and queens, who realize that reproduction. Extensive work on honeybees and ants has revealed the molecular basis of derived queen and worker phenotypes in highly eusocial lineages, but we lack equivalent deep-level analyses of wasps and of primitively eusocial species, the latter of which can reveal how phenotypic decoupling first occurs in the early stages of eusocial evolution. RESULTS: We sequenced 20 Gbp of transcriptomes derived from brains of different behavioral castes of the primitively eusocial tropical paper wasp Polistes canadensis. Surprisingly, 75% of the 2,442 genes differentially expressed between phenotypes were novel, having no significant homology with described sequences. Moreover, 90% of these novel genes were significantly upregulated in workers relative to queens. Differential expression of novel genes in the early stages of sociality may be important in facilitating the evolution of worker behavioral complexity in eusocial evolution. We also found surprisingly low correlation in the identity and direction of expression of differentially expressed genes across similar phenotypes in different social lineages, supporting the idea that social evolution in different lineages requires substantial de novo rewiring of molecular pathways. CONCLUSIONS: These genomic resources for aculeate wasps and first transcriptome-wide insights into the origin of castes bring us closer to a more general understanding of eusocial evolution and how phenotypic diversity arises from the same genome.
Resumo:
Alternative splicing (AS) has the potential to greatly expand the functional repertoire of mammalian transcriptomes. However, few variant transcripts have been characterized functionally, making it difficult to assess the contribution of AS to the generation of phenotypic complexity and to study the evolution of splicing patterns. We have compared the AS of 309 protein-coding genes in the human ENCODE pilot regions against their mouse orthologs in unprecedented detail, utilizing traditional transcriptomic and RNAseq data. The conservation status of every transcript has been investigated, and each functionally categorized as coding (separated into coding sequence [CDS] or nonsense-mediated decay [NMD] linked) or noncoding. In total, 36.7% of human and 19.3% of mouse coding transcripts are species specific, and we observe a 3.6 times excess of human NMD transcripts compared with mouse; in contrast to previous studies, the majority of species-specific AS is unlinked to transposable elements. We observe one conserved CDS variant and one conserved NMD variant per 2.3 and 11.4 genes, respectively. Subsequently, we identify and characterize equivalent AS patterns for 22.9% of these CDS or NMD-linked events in nonmammalian vertebrate genomes, and our data indicate that functional NMD-linked AS is more widespread and ancient than previously thought. Furthermore, although we observe an association between conserved AS and elevated sequence conservation, as previously reported, we emphasize that 30% of conserved AS exons display sequence conservation below the average score for constitutive exons. In conclusion, we demonstrate the value of detailed comparative annotation in generating a comprehensive set of AS transcripts, increasing our understanding of AS evolution in vertebrates. Our data supports a model whereby the acquisition of functional AS has occurred throughout vertebrate evolution and is considered alongside amino acid change as a key mechanism in gene evolution.
Resumo:
BACKGROUND: Modern sequencing technologies have massively increased the amount of data available for comparative genomics. Whole-transcriptome shotgun sequencing (RNA-seq) provides a powerful basis for comparative studies. In particular, this approach holds great promise for emerging model species in fields such as evolutionary developmental biology (evo-devo). RESULTS: We have sequenced early embryonic transcriptomes of two non-drosophilid dipteran species: the moth midge Clogmia albipunctata, and the scuttle fly Megaselia abdita. Our analysis includes a third, published, transcriptome for the hoverfly Episyrphus balteatus. These emerging models for comparative developmental studies close an important phylogenetic gap between Drosophila melanogaster and other insect model systems. In this paper, we provide a comparative analysis of early embryonic transcriptomes across species, and use our data for a phylogenomic re-evaluation of dipteran phylogenetic relationships. CONCLUSIONS: We show how comparative transcriptomics can be used to create useful resources for evo-devo, and to investigate phylogenetic relationships. Our results demonstrate that de novo assembly of short (Illumina) reads yields high-quality, high-coverage transcriptomic data sets. We use these data to investigate deep dipteran phylogenetic relationships. Our results, based on a concatenation of 160 orthologous genes, provide support for the traditional view of Clogmia being the sister group of Brachycera (Megaselia, Episyrphus, Drosophila), rather than that of Culicomorpha (which includes mosquitoes and blackflies).
Resumo:
Phylogenetic trees representing the evolutionary relationships of homologous genes are the entry point for many evolutionary analyses. For instance, the use of a phylogenetic tree can aid in the inference of orthology and paralogy relationships, and in the detection of relevant evolutionary events such as gene family expansions and contractions, horizontal gene transfer, recombination or incomplete lineage sorting. Similarly, given the plurality of evolutionary histories among genes encoded in a given genome, there is a need for the combined analysis of genome-wide collections of phylogenetic trees (phylomes). Here, we introduce a new release of PhylomeDB (http://phylomedb.org), a public repository of phylomes. Currently, PhylomeDB hosts 120 public phylomes, comprising >1.5 million maximum likelihood trees and multiple sequence alignments. In the current release, phylogenetic trees are annotated with taxonomic, protein-domain arrangement, functional and evolutionary information. PhylomeDB is also a major source for phylogeny-based predictions of orthology and paralogy, covering >10 million proteins across 1059 sequenced species. Here we describe newly implemented PhylomeDB features, and discuss a benchmark of the orthology predictions provided by the database, the impact of proteome updates and the use of the phylome approach in the analysis of newly sequenced genomes and transcriptomes.
Resumo:
Root systems consist of different root types (RTs) with distinct developmental and functional characteristics. RTs may be individually reprogrammed in response to their microenvironment to maximize adaptive plasticity. Molecular understanding of such specific remodeling-although crucial for crop improvement-is limited. Here, RT-specific transcriptomes of adult rice crown, large and fine lateral roots were assessed, revealing molecular evidence for functional diversity among individual RTs. Of the three rice RTs, crown roots displayed a significant enrichment of transcripts associated with phytohormones and secondary cell wall (SCW) metabolism, whereas lateral RTs showed a greater accumulation of transcripts related to mineral transport. In nature, arbuscular mycorrhizal (AM) symbiosis represents the default state of most root systems and is known to modify root system architecture. Rice RTs become heterogeneously colonized by AM fungi, with large laterals preferentially entering into the association. However, RT-specific transcriptional responses to AM symbiosis were quantitatively most pronounced for crown roots despite their modest physical engagement in the interaction. Furthermore, colonized crown roots adopted an expression profile more related to mycorrhizal large lateral than to noncolonized crown roots, suggesting a fundamental reprogramming of crown root character. Among these changes, a significant reduction in SCW transcripts was observed that was correlated with an alteration of SCW composition as determined by mass spectrometry. The combined change in SCW, hormone- and transport-related transcript profiles across the RTs indicates a previously overlooked switch of functional relationships among RTs during AM symbiosis, with a potential impact on root system architecture and functioning.
Resumo:
OBJECTIVE: To develop predictive models for early triage of burn patients based on hypersusceptibility to repeated infections. BACKGROUND: Infection remains a major cause of mortality and morbidity after severe trauma, demanding new strategies to combat infections. Models for infection prediction are lacking. METHODS: Secondary analysis of 459 burn patients (≥16 years old) with 20% or more total body surface area burns recruited from 6 US burn centers. We compared blood transcriptomes with a 180-hour cutoff on the injury-to-transcriptome interval of 47 patients (≤1 infection episode) to those of 66 hypersusceptible patients [multiple (≥2) infection episodes (MIE)]. We used LASSO regression to select biomarkers and multivariate logistic regression to built models, accuracy of which were assessed by area under receiver operating characteristic curve (AUROC) and cross-validation. RESULTS: Three predictive models were developed using covariates of (1) clinical characteristics; (2) expression profiles of 14 genomic probes; (3) combining (1) and (2). The genomic and clinical models were highly predictive of MIE status [AUROCGenomic = 0.946 (95% CI: 0.906-0.986); AUROCClinical = 0.864 (CI: 0.794-0.933); AUROCGenomic/AUROCClinical P = 0.044]. Combined model has an increased AUROCCombined of 0.967 (CI: 0.940-0.993) compared with the individual models (AUROCCombined/AUROCClinical P = 0.0069). Hypersusceptible patients show early alterations in immune-related signaling pathways, epigenetic modulation, and chromatin remodeling. CONCLUSIONS: Early triage of burn patients more susceptible to infections can be made using clinical characteristics and/or genomic signatures. Genomic signature suggests new insights into the pathophysiology of hypersusceptibility to infection may lead to novel potential therapeutic or prophylactic targets.
Resumo:
About 50% of living species are holometabolan insects. Therefore, unraveling the ori- gin of insect metamorphosis from the hemimetabolan (gradual metamorphosis) to the holometabolan (sudden metamorphosis at the end of the life cycle) mode is equivalent to explaining how all this biodiversity originated. One of the problems with studying the evolution from hemimetaboly to holometaboly is that most information is available only in holometabolan species. Within the hemimetabolan group, our model, the cock- roach Blattella germanica, is the most studied species. However, given that the study of adult morphogenesis at organismic level is still complex, we focused on the study of the tergal gland (TG) as a minimal model of metamorphosis. The TG is formed in tergites 7 and 8 (T7-8) in the last days of the last nymphal instar (nymph 6). The comparative study of four T7-T8 transcriptomes provided us with crucial keys of TG formation, but also essential information about the mechanisms and circuitry that allows the shift from nymphal to adult morphogenesis.
Resumo:
Male germ cell differentiation, spermatogenesis is an exceptional developmental process that produces a massive amount of genetically unique spermatozoa. The complexity of this process along with the technical limitations in the germline research has left many aspects of spermatogenesis poorly understood. Post-meiotic haploid round spermatids possess the most complex transcriptomes of the whole body. Correspondingly, efficient and accurate control mechanisms are necessary to deal with the huge diversity of transcribed RNAs in these cells. The high transcriptional activity in round spermatids is accompanied by the presence of an uncommonly large cytoplasmic ribonucleoprotein granule, called the chromatoid body (CB) that is conjectured to participate in the RNA post-transcriptional regulation. However, very little is known about the possible mechanisms of the CB function. The development of a procedure to isolate CBs from mouse testes was this study’s objective. Anti-MVH immunoprecipitation of cross-linked CBs from a fractionated testicular cell lysate was optimized to yield considerable quantities of pure and intact CBs from mice testes. This protocol produced reliable and reproducible data from the subsequent analysis of CB’s protein and RNA components. We found that the majority of the CB’s proteome consists of RNA-binding proteins that associate functionally with different pathways. We also demonstrated notable localization patterns of one of the CB transient components, SAM68 and showed that its ablation does not change the general composition or structure of the CB. CB-associated RNA analysis revealed a strong accumulation of PIWI-interacting RNAs (piRNAs), mRNAs and long non-coding RNAs (lncRNAs) in the CB. When the CB transcriptome and proteome analysis results were combined, the most pronounced molecular functions in the CB were related to piRNA pathway, RNA post-transcriptional processing and CB structural scaffolding. In addition, we demonstrated that the CB is a target for the main RNA flux from the nucleus throughout all steps of round spermatid development. Moreover, we provided preliminary evidence that those isolated CBs slice target RNAs in vitro in an ATPdependent manner. Altogether, these results make a strong suggestion that the CB functions involve RNA-related and RNA-mediated mechanisms. All the existing data supports the hypothesis that the CB coordinates the highly complex haploid transcriptome during the preparation of the male gametes for fertilization. Thereby, this study provides a fundamental basis for the future functional analyses of ribonucleoprotein granules and offers also important insights into the mechanisms governing male fertility.
Resumo:
Lichens are symbiotic organisms, which consist of the fungal partner and the photosynthetic partner, which can be either an alga or a cyanobacterium. In some lichen species the symbiosis is tripartite, where the relationship includes both an alga and a cyanobacterium alongside the primary symbiont, fungus. The lichen symbiosis is an evolutionarily old adaptation to life on land and many extant fungal species have evolved from lichenised ancestors. Lichens inhabit a wide range of habitats and are capable of living in harsh environments and on nutrient poor substrates, such as bare rocks, often enduring frequent cycles of drying and wetting. Most lichen species are desiccation tolerant, and they can survive long periods of dehydration, but can rapidly resume photosynthesis upon rehydration. The molecular mechanisms behind lichen desiccation tolerance are still largely uncharacterised and little information is available for any lichen species at the genomic or transcriptomic level. The emergence of the high-throughput next generation sequencing (NGS) technologies and the subsequent decrease in the cost of sequencing new genomes and transcriptomes has enabled non-model organism research on the whole genome level. In this doctoral work the transcriptome and genome of the grey reindeer lichen, Cladonia rangiferina, were sequenced, de novo assembled and characterised using NGS and traditional expressed sequence tag (EST) technologies. RNA extraction methods were optimised to improve the yield and quality of RNA extracted from lichen tissue. The effects of rehydration and desiccation on C. rangiferina gene expression on whole transcriptome level were studied and the most differentially expressed genes were identified. The secondary metabolites present in C. rangiferina decreased the quality – integrity, optical characteristics and utility for sensitive molecular biological applications – of the extracted RNA requiring an optimised RNA extraction method for isolating sufficient quantities of high-quality RNA from lichen tissue in a time- and cost-efficient manner. The de novo assembly of the transcriptome of C. rangiferina was used to produce a set of contiguous unigene sequences that were used to investigate the biological functions and pathways active in a hydrated lichen thallus. The de novo assembly of the genome yielded an assembly containing mostly genes derived from the fungal partner. The assembly was of sufficient quality, in size similar to other lichen-forming fungal genomes and included most of the core eukaryotic genes. Differences in gene expression were detected in all studied stages of desiccation and rehydration, but the largest changes occurred during the early stages of rehydration. The most differentially expressed genes did not have any annotations, making them potentially lichen-specific genes, but several genes known to participate in environmental stress tolerance in other organisms were also identified as differentially expressed.
Resumo:
Lysinuric protein intolerance (LPI) is a recessively inherited disorder characterised by reduced plasma and increased urinary levels of cationic amino acids (CAAs), protein malnutrition, growth failure and hyperlipidemia. Some patients develop severe immunological, renal and pulmonary complications. All Finnish patients share the same LPIFin mutation in the SLC7A7 gene that encodes CAA transporter y+LAT1. The aim of this study was to examine molecular factors contributing to the various symptoms, systemic metabolic and lipid profiles, and innate immune responses in LPI. The transcriptomes, metabolomes and lipidomes were analysed in whole-blood cells and plasma using RNA microarrays and gas or liquid chromatography-mass spectrometry techniques, respectively. Toll-like receptor (TLR) signalling in monocyte-derived macrophages exposed to pathogens was scrutinised using qRT-PCR and the Luminex technology. Altered levels of transcripts participating in amino acid transport, immune responses, apoptosis and pathways of hepatic and renal metabolism were identified in the LPI whole-blood cells. The patients had increased non-essential amino acid, triacylglycerol and fatty acid levels, and decreased plasma levels of phosphatidylcholines and practically all essential amino acids. In addition, elevated plasma levels of eight metabolites, long-chain triacylglycerols, two chemoattractant chemokines and nitric oxide correlated with the reduced glomerular function in the patients with kidney disease. Accordingly, it can be hypothesised that the patients have increased autophagy, inflammation, oxidative stress and apoptosis, leading to hepatic steatosis, uremic toxicity and altered intestinal microbe metabolism. Furthermore, the LPI macrophages showed disruption in the TLR2/1, TLR4 and TLR9 pathways, suggesting innate immune dysfunctions with an excessive response to bacterial infections but a deficient viral DNA response.
Resumo:
The plant family Apocynaceae accumulates thousands of monoterpene indole alkaloids (MIAs) which originate, biosynthetically, from the common secoiridoid intermediate, strictosidine, that is formed from the condensation of tryptophan and secologanin molecules. MIAs demonstrate remarkable structural diversity and have pharmaceutically valuable biological activities. For example; a subunit of the potent anti-neoplastic molecules vincristine and vinblastine is the aspidosperma alkaloid, vindoline. Vindoline accumulates to trace levels under natural conditions. Research programs have determined that there is significant developmental and light regulation involved in the biosynthesis of this MIA. Furthermore, the biosynthetic pathway leading to vindoline is split among at least five independent cell types. Little is known of how intermediates are shuttled between these cell types. The late stage events in vindoline biosynthesis involve six enzymatic steps from tabersonine. The fourth biochemical step, in this pathway, is an indole N-methylation performed by a recently identified N-methyltransfearse (NMT). For almost twenty years the gene encoding this NMT had eluded discovery; however, in 2010 Liscombe et al. reported the identification of a γ-tocopherol C-methyltransferase homologue capable of indole N-methylating 2,3-dihydrotabersonine and Virus Induced Gene Silencing (VIGS) suppression of the messenger has since proven its involvement in vindoline biosynthesis. Recent large scale sequencing initiatives, performed on non-model medicinal plant transcriptomes, has permitted identification of candidate genes, presumably involved, in MIA biosynthesis never seen before in plant specialized metabolism research. Probing the transcriptome assemblies of Catharanthus roseus (L.)G.Don, Vinca minor L., Rauwolfia serpentine (L.)Benth ex Kurz, Tabernaemontana elegans, and Amsonia hubrichtii, with the nucleotide sequence of the N-methyltransferase involved in vindoline biosynthesis, revealed eight new homologous methyltransferases. This thesis describes the identification, molecular cloning, recombinant expression and biochemical characterization of two picrinine NMTs, one from V. minor and one from R. serpentina, a perivine NMT from C. roseus, and an ajmaline NMT from R. serpentina. While these TLMTs were expressed and functional in planta, they were active at relatively low levels and their N-methylated alkaloid products were not apparent our from alkaloid isolates of the plants. It appears that, for the most part, these TLMTs, participate in apparently silent biochemical pathways, awaiting the appropriate developmental and environmental cues for activity.