964 resultados para TRANSCRIPTOME ANALYSIS


Relevância:

30.00% 30.00%

Publicador:

Resumo:

In male tephritid fruit flies of the genus Bactrocera, feeding on secondary plant compounds (sensu lato male lures = methyl eugenol, raspberry ketone and zingerone) increases male mating success. Ingested male lures alter the male pheromonal blend, normally making it more attractive to females and this is considered the primary mechanism for the enhanced mating success. However, the male lures raspberry ketone and zingerone are known, across a diverse range of other organisms, to be involved in increasing energy metabolism. If this also occurs in Bactrocera, then this may represent an additional benefit to males as courtship is metabolically expensive and lure feeding may increase a fly's short-term energy. We tested this hypothesis by performing comparative RNA-seq analysis between zingerone-fed and unfed males of Bactrocera tryoni. We also carried out behavioural assays with zingerone- and cuelure-fed males to test whether they became more active. RNA-seq analysis revealed, in zingerone-fed flies, up-regulation of 3183 genes with homologues transcripts to those known to regulate intermale aggression, pheromone synthesis, mating and accessory gland proteins, along with significant enrichment of several energy metabolic pathways and gene ontology terms. Behavioural assays show significant increases in locomotor activity, weight reduction and successful mating after mounting; all direct/indirect measures of increased activity. These results suggest that feeding on lures leads to complex physiological changes, which result in more competitive males. These results do not negate the pheromone effect, but do strongly suggest that the phytochemical-induced sexual selection is governed by both female preference and male competitive mechanisms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Pangasianodon hypophthalmus is a commercially important freshwater fish used in inland aquaculture in the Mekong Delta, Vietnam. The current study using Ion Torrent technology generated EST resources from the kidney for Tra catfish reared at a salinity level of 9 ppt. We obtained 2,623,929 reads after trimming and processing with an average length of 104 bp. De novo assemblies were generated using CLC Genomic Workbench, Trinity and Velvet/Oases with the best overall contig performance resulting from the CLC assembly. De novo assembly using CLC yielded 29,940 contigs, and allowing identification of 5,710 putative genes when comppared with NCBI non-redundant database. A large number of single nucleotide polymorphisms (SNPs) were also detected. The sequence collection generated in our study represents the most comprehensive transcriptomic resource for P. hypophthalmus available to date.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This thesis studies human gene expression space using high throughput gene expression data from DNA microarrays. In molecular biology, high throughput techniques allow numerical measurements of expression of tens of thousands of genes simultaneously. In a single study, this data is traditionally obtained from a limited number of sample types with a small number of replicates. For organism-wide analysis, this data has been largely unavailable and the global structure of human transcriptome has remained unknown. This thesis introduces a human transcriptome map of different biological entities and analysis of its general structure. The map is constructed from gene expression data from the two largest public microarray data repositories, GEO and ArrayExpress. The creation of this map contributed to the development of ArrayExpress by identifying and retrofitting the previously unusable and missing data and by improving the access to its data. It also contributed to creation of several new tools for microarray data manipulation and establishment of data exchange between GEO and ArrayExpress. The data integration for the global map required creation of a new large ontology of human cell types, disease states, organism parts and cell lines. The ontology was used in a new text mining and decision tree based method for automatic conversion of human readable free text microarray data annotations into categorised format. The data comparability and minimisation of the systematic measurement errors that are characteristic to each lab- oratory in this large cross-laboratories integrated dataset, was ensured by computation of a range of microarray data quality metrics and exclusion of incomparable data. The structure of a global map of human gene expression was then explored by principal component analysis and hierarchical clustering using heuristics and help from another purpose built sample ontology. A preface and motivation to the construction and analysis of a global map of human gene expression is given by analysis of two microarray datasets of human malignant melanoma. The analysis of these sets incorporate indirect comparison of statistical methods for finding differentially expressed genes and point to the need to study gene expression on a global level.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Gene expression is one of the most critical factors influencing the phenotype of a cell. As a result of several technological advances, measuring gene expression levels has become one of the most common molecular biological measurements to study the behaviour of cells. The scientific community has produced enormous and constantly increasing collection of gene expression data from various human cells both from healthy and pathological conditions. However, while each of these studies is informative and enlighting in its own context and research setup, diverging methods and terminologies make it very challenging to integrate existing gene expression data to a more comprehensive view of human transcriptome function. On the other hand, bioinformatic science advances only through data integration and synthesis. The aim of this study was to develop biological and mathematical methods to overcome these challenges and to construct an integrated database of human transcriptome as well as to demonstrate its usage. Methods developed in this study can be divided in two distinct parts. First, the biological and medical annotation of the existing gene expression measurements needed to be encoded by systematic vocabularies. There was no single existing biomedical ontology or vocabulary suitable for this purpose. Thus, new annotation terminology was developed as a part of this work. Second part was to develop mathematical methods correcting the noise and systematic differences/errors in the data caused by various array generations. Additionally, there was a need to develop suitable computational methods for sample collection and archiving, unique sample identification, database structures, data retrieval and visualization. Bioinformatic methods were developed to analyze gene expression levels and putative functional associations of human genes by using the integrated gene expression data. Also a method to interpret individual gene expression profiles across all the healthy and pathological tissues of the reference database was developed. As a result of this work 9783 human gene expression samples measured by Affymetrix microarrays were integrated to form a unique human transcriptome resource GeneSapiens. This makes it possible to analyse expression levels of 17330 genes across 175 types of healthy and pathological human tissues. Application of this resource to interpret individual gene expression measurements allowed identification of tissue of origin with 92.0% accuracy among 44 healthy tissue types. Systematic analysis of transcriptional activity levels of 459 kinase genes was performed across 44 healthy and 55 pathological tissue types and a genome wide analysis of kinase gene co-expression networks was done. This analysis revealed biologically and medically interesting data on putative kinase gene functions in health and disease. Finally, we developed a method for alignment of gene expression profiles (AGEP) to perform analysis for individual patient samples to pinpoint gene- and pathway-specific changes in the test sample in relation to the reference transcriptome database. We also showed how large-scale gene expression data resources can be used to quantitatively characterize changes in the transcriptomic program of differentiating stem cells. Taken together, these studies indicate the power of systematic bioinformatic analyses to infer biological and medical insights from existing published datasets as well as to facilitate the interpretation of new molecular profiling data from individual patients.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Dinoflagellates possess many physiological processes that appear to be under post-transcriptional control. However, the extent to which their genes are regulated post-transcriptionally remains unresolved. To gain insight into the roles of differential mRNA stability and de novo transcription in dinoflagellates, we biosynthetically labeled RNA with 4-thiouracil to isolate newly transcribed and pre-existing RNA pools in Karenia brevis. These isolated fractions were then used for analysis of global mRNA stability and de novo transcription by hybridization to a K. brevis microarray. Global K. brevis mRNA half-lives were calculated from the ratio of newly transcribed to pre-existing RNA for 7086 array features using the online software HALO (Half-life Organizer). Overall, mRNA half-lives were substantially longer than reported in other organisms studied at the global level, ranging from 42 minutes to greater than 144 h, with a median of 33 hours. Consistent with well-documented trends observed in other organisms, housekeeping processes, including energy metabolism and transport, were significantly enriched in the most highly stable messages. Shorter-lived transcripts included a higher proportion of transcriptional regulation, stress response, and other response/regulatory processes. One such family of proteins involved in post-transcriptional regulation in chloroplasts and mitochondria, the pentatricopeptide repeat (PPR) proteins, had dramatically shorter half-lives when compared to the arrayed transcriptome. As transcript abundances for PPR proteins were previously observed to rapidly increase in response to nutrient addition, we queried the newly synthesized RNA pools at 1 and 4 h following nitrate addition to N-depleted cultures. Transcriptome-wide there was little evidence of increases in the rate of de novo transcription during the first 4 h, relative to that in N-depleted cells, and no evidence for increased PPR protein transcription. These results lend support to the growing consensus of post-transcriptional control of gene expression in dinoflagellates.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Understanding the dynamics of eukaryotic transcriptome is essential for studying the complexity of transcriptional regulation and its impact on phenotype. However, comprehensive studies of transcriptomes at single base resolution are rare, even for modern organisms, and lacking for rice. Here, we present the first transcriptome atlas for eight organs of cultivated rice. Using high-throughput paired-end RNA-seq, we unambiguously detected transcripts expressing at an extremely low level, as well as a substantial number of novel transcripts, exons, and untranslated regions. An analysis of alternative splicing in the rice transcriptome revealed that alternative cis-splicing occurred in similar to 33% of all rice genes. This is far more than previously reported. In addition, we also identified 234 putative chimeric transcripts that seem to be produced by trans-splicing, indicating that transcript fusion events are more common than expected. In-depth analysis revealed a multitude of fusion transcripts that might be by-products of alternative splicing. Validation and chimeric transcript structural analysis provided evidence that some of these transcripts are likely to be functional in the cell. Taken together, our data provide extensive evidence that transcriptional regulation in rice is vastly more complex than previously believed.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: There is considerable interest in the development of methods to efficiently identify all coding variants present in large sample sets of humans. There are three approaches possible: whole-genome sequencing, whole-exome sequencing using exon capture methods, and RNA-Seq. While whole-genome sequencing is the most complete, it remains sufficiently expensive that cost effective alternatives are important. RESULTS: Here we provide a systematic exploration of how well RNA-Seq can identify human coding variants by comparing variants identified through high coverage whole-genome sequencing to those identified by high coverage RNA-Seq in the same individual. This comparison allowed us to directly evaluate the sensitivity and specificity of RNA-Seq in identifying coding variants, and to evaluate how key parameters such as the degree of coverage and the expression levels of genes interact to influence performance. We find that although only 40% of exonic variants identified by whole genome sequencing were captured using RNA-Seq; this number rose to 81% when concentrating on genes known to be well-expressed in the source tissue. We also find that a high false positive rate can be problematic when working with RNA-Seq data, especially at higher levels of coverage. CONCLUSIONS: We conclude that as long as a tissue relevant to the trait under study is available and suitable quality control screens are implemented, RNA-Seq is a fast and inexpensive alternative approach for finding coding variants in genes with sufficiently high expression levels.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: Since mature erythrocytes are terminally differentiated cells without nuclei and organelles, it is commonly thought that they do not contain nucleic acids. In this study, we have re-examined this issue by analyzing the transcriptome of a purified population of human mature erythrocytes from individuals with normal hemoglobin (HbAA) and homozygous sickle cell disease (HbSS). METHODS AND FINDINGS: Using a combination of microarray analysis, real-time RT-PCR and Northern blots, we found that mature erythrocytes, while lacking ribosomal and large-sized RNAs, contain abundant and diverse microRNAs. MicroRNA expression of erythrocytes was different from that of reticulocytes and leukocytes, and contributed the majority of the microRNA expression in whole blood. When we used microRNA microarrays to analyze erythrocytes from HbAA and HbSS individuals, we noted a dramatic difference in their microRNA expression pattern. We found that miR-320 played an important role for the down-regulation of its target gene, CD71 during reticulocyte terminal differentiation. Further investigation revealed that poor expression of miR-320 in HbSS cells was associated with their defective downregulation CD71 during terminal differentiation. CONCLUSIONS: In summary, we have discovered significant microRNA expression in human mature erythrocytes, which is dramatically altered in HbSS erythrocytes and their defect in terminal differentiation. Thus, the global analysis of microRNA expression in circulating erythrocytes can provide mechanistic insights into the disease phenotypes of erythrocyte diseases.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

© 2014 .The adoption of antisense gene silencing as a novel disinfectant for prokaryotic organisms is hindered by poor silencing efficiencies. Few studies have considered the effects of off-targets on silencing efficiencies, especially in prokaryotic organisms. In this computational study, a novel algorithm was developed that determined and sorted the number of off-targets as a function of alignment length in Escherichia coli K-12 MG1655 and Mycobacterium tuberculosis H37Rv. The mean number of off-targets per a single location was calculated to be 14.1. ±. 13.3 and 36.1. ±. 58.5 for the genomes of E. coli K-12 MG1655 and M. tuberculosis H37Rv, respectively. Furthermore, when the entire transcriptome was analyzed, it was found that there was no general gene location that could be targeted to minimize or maximize the number of off-targets. In an effort to determine the effects of off-targets on silencing efficiencies, previously published studies were used. Analyses with acpP, ino1, and marORAB revealed a statistically significant relationship between the number of short alignment length off-targets hybrids and the efficacy of the antisense gene silencing, suggesting that the minimization of off-targets may be beneficial for antisense gene silencing in prokaryotic organisms.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Copepods of the genus Calanus are key zooplankton species in temperate to arctic marine ecosystems. Despite their ecological importance, species identification remains challenging. Furthermore, the recent report of hybrids among Calanus species highlights the need for diagnostic nuclear markers to efficiently identify parental species and hybrids. Using next-generation sequencing analysis of both the genome and transcriptome from two sibling species, Calanus finmarchicus and Calanus glacialis, we developed a panel of 12 nuclear insertion/deletion markers. All the markers showed species-specific amplicon length. Furthermore, most of the markers were successfully amplified in other Calanus species, allowing the molecular identification of Calanus helgolandicus, Calanus hyperboreus and Calanus marshallae.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Amphibian skin secretions are rich in antimicrobial peptides that act as important components of an innate immune system. Here, we describe a novel “shotgun” skin peptide precursor cloning technique that facilitates rapid access to these genetically encoded molecules and effects their subsequent identification and structural characterization from the secretory peptidome. Adopting this approach on a skin secretion-derived library from a hitherto unstudied Chinese species of frog, we identified a family of novel antimicrobial peptide homologs, named pelophylaxins, that belong to previously identified families (ranatuerins, brevinins and temporins) found predominantly in the skin secretions from frogs of the genus Rana. These data further substantiate the scientifically robust nature of applying parallel transcriptome and peptidome analyses on frog defensive skin secretions that can be obtained in a non-invasive, non-destructive manner. In addition, the present data illustrate that rapid structural characterization of frog skin secretion peptides can be achieved from an unstudied species without prior knowledge of primary structures of endogenous peptides.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Peptidomics is a powerful set of tools for the identification, structural elucidation and discovery of novel regulatory peptides and for monitoring the degradation pathways of structurally and catalytically important proteins. Amphibian skin secretions, arising from specialized granular glands, often contain complex peptidomes containing many components of entirely novel structure and unique site-substituted analogues of known peptide families. Following the discovery that the granular gland transcriptome is present in such secretions in a PCR-amenable form, we designed a strategy for peptide structural characterization involving the integration of ‘shotgun’ cloning of cDNAs encoding peptide precursors, deduction of putative bioactive peptide structures, and confirmation of these structures using tandem MS/MS sequencing. Here, we illustrate this strategy by means of elucidation of the primary structures of nigrocin-2 homologues from the defensive skin secretions of four species of Chinese Odorrana frogs, O. schmackeri, O. livida, O. hejiangensis and O. versabilis. Synthetic replicates of the peptides were found to possess antimicrobial activity. Nigrocin-2 peptides occur widely in the skin secretions of Asian ranid frogs and in those of the Odorrana group, and are particularly well-represented and of diverse structure in some species. Integration of the molecular analytical technologies described provides a means for rapid structural characterization of novel peptides from complex natural libraries in the absence of systematic online database information.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: To date, there are no clinically reliable predictive markers of response to the current treatment regimens for advanced colorectal cancer. The aim of the current study was to compare and assess the power of transcriptional profiling using a generic microarray and a disease-specific transcriptome-based microarray. We also examined the biological and clinical relevance of the disease-specific transcriptome.

METHODS: DNA microarray profiling was carried out on isogenic sensitive and 5-FU-resistant HCT116 colorectal cancer cell lines using the Affymetrix HG-U133 Plus2.0 array and the Almac Diagnostics Colorectal cancer disease specific Research tool. In addition, DNA microarray profiling was also carried out on pre-treatment metastatic colorectal cancer biopsies using the colorectal cancer disease specific Research tool. The two microarray platforms were compared based on detection of probesets and biological information.

RESULTS: The results demonstrated that the disease-specific transcriptome-based microarray was able to out-perform the generic genomic-based microarray on a number of levels including detection of transcripts and pathway analysis. In addition, the disease-specific microarray contains a high percentage of antisense transcripts and further analysis demonstrated that a number of these exist in sense:antisense pairs. Comparison between cell line models and metastatic CRC patient biopsies further demonstrated that a number of the identified sense:antisense pairs were also detected in CRC patient biopsies, suggesting potential clinical relevance.

CONCLUSIONS: Analysis from our in vitro and clinical experiments has demonstrated that many transcripts exist in sense:antisense pairs including IGF2BP2, which may have a direct regulatory function in the context of colorectal cancer. While the functional relevance of the antisense transcripts has been established by many studies, their functional role is currently unclear; however, the numbers that have been detected by the disease-specific microarray would suggest that they may be important regulatory transcripts. This study has demonstrated the power of a disease-specific transcriptome-based approach and highlighted the potential novel biologically and clinically relevant information that is gained when using such a methodology.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

mRNA chimeras from chromosomal translocations often play a role as transforming oncogenes. However, cancer transcriptomes also contain mRNA chimeras that may play a role in tumor development, which arise as transcriptional or post-transcriptional events. To identify such chimeras, we developed a deterministic screening strategy for long-range sequence analysis. High-throughput, long-read sequencing was then performed on cDNA libraries from major tumor histotypes and corresponding normal tissues. These analyses led to the identification of 378 chimeras, with an unexpectedly high frequency of expression (˜2 x 10(-5) of all mRNA). Functional assays in breast and ovarian cancer cell lines showed that a large fraction of mRNA chimeras regulates cell replication. Strikingly, chimeras were shown to include both positive and negative regulators of cell growth, which functioned as such in a cell-type-specific manner. Replication-controlling chimeras were found to be expressed by most cancers from breast, ovary, colon, uterus, kidney, lung, and stomach, suggesting a widespread role in tumor development.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The aim of this study was to characterize the transcriptome of a balanced polymorphism, under the regulation of a single gene, for phosphate fertilizer responsiveness/arsenate toler- ance in wild grass Holcus lanatus genotypes screened from the same habitat.

De novo transcriptome sequencing, RNAseq (RNA sequencing) and single nucleotide poly- morphism (SNP) calling were conducted on RNA extracted from H.lanatus. Roche 454 sequencing data were assembled into c. 22 000 isotigs, and paired-end Illumina reads for phosphorus-starved (P) and phosphorus-treated (P+) genovars of tolerant (T) and nontoler- ant (N) phenotypes were mapped to this reference transcriptome.

Heatmaps of the gene expression data showed strong clustering of each P+/P treated genovar, as well as clustering by N/T phenotype. Statistical analysis identified 87 isotigs to be significantly differentially expressed between N and T phenotypes and 258 between P+ and P treated plants. SNPs and transcript expression that systematically differed between N and T phenotypes had regulatory function, namely proteases, kinases and ribonuclear RNA- binding protein and transposable elements.

A single gene for arsenate tolerance led to distinct phenotype transcriptomes and SNP pro- files, with large differences in upstream post-translational and post-transcriptional regulatory genes rather than in genes directly involved in P nutrition transport and metabolism per se.