33 resultados para gene insertions
Resumo:
MOTIVATION: Synthetic lethal interactions represent pairs of genes whose individual mutations are not lethal, while the double mutation of both genes does incur lethality. Several studies have shown a correlation between functional similarity of genes and their distances in networks based on synthetic lethal interactions. However, there is a lack of algorithms for predicting gene function from synthetic lethality interaction networks. RESULTS: In this article, we present a novel technique called kernelROD for gene function prediction from synthetic lethal interaction networks based on kernel machines. We apply our novel algorithm to Gene Ontology functional annotation prediction in yeast. Our experiments show that our method leads to improved gene function prediction compared with state-of-the-art competitors and that combining genetic and congruence networks leads to a further improvement in prediction accuracy.
Resumo:
Understanding the regulatory mechanisms that are responsible for an organism's response to environmental change is an important issue in molecular biology. A first and important step towards this goal is to detect genes whose expression levels are affected by altered external conditions. A range of methods to test for differential gene expression, both in static as well as in time-course experiments, have been proposed. While these tests answer the question whether a gene is differentially expressed, they do not explicitly address the question when a gene is differentially expressed, although this information may provide insights into the course and causal structure of regulatory programs. In this article, we propose a two-sample test for identifying intervals of differential gene expression in microarray time series. Our approach is based on Gaussian process regression, can deal with arbitrary numbers of replicates, and is robust with respect to outliers. We apply our algorithm to study the response of Arabidopsis thaliana genes to an infection by a fungal pathogen using a microarray time series dataset covering 30,336 gene probes at 24 observed time points. In classification experiments, our test compares favorably with existing methods and provides additional insights into time-dependent differential expression.
Resumo:
A nonparametric Bayesian extension of Factor Analysis (FA) is proposed where observed data $\mathbf{Y}$ is modeled as a linear superposition, $\mathbf{G}$, of a potentially infinite number of hidden factors, $\mathbf{X}$. The Indian Buffet Process (IBP) is used as a prior on $\mathbf{G}$ to incorporate sparsity and to allow the number of latent features to be inferred. The model's utility for modeling gene expression data is investigated using randomly generated data sets based on a known sparse connectivity matrix for E. Coli, and on three biological data sets of increasing complexity.
Resumo:
Small RNAs have several important biological functions. MicroRNAs (miRNAs) and trans-acting small interfering RNAs (tasiRNAs) regulate mRNA stability and translation, and siRNAs cause post-transcriptional gene silencing of transposons, viruses and transgenes and are important in both the establishment and maintenance of cytosine DNA methylation. Here, we study the role of the four Arabidopsis thaliana DICER-LIKE genes (DCL1-DCL4) in these processes. Sequencing of small RNAs from a dcl2 dcl3 dcl4 triple mutant showed markedly reduced tasiRNA and siRNA production and indicated that DCL1, in addition to its role as the major enzyme for processing miRNAs, has a previously unknown role in the production of small RNAs from endogenous inverted repeats. DCL2, DCL3 and DCL4 showed functional redundancy in siRNA and tasiRNA production and in the establishment and maintenance of DNA methylation. Our studies also suggest that asymmetric DNA methylation can be maintained by pathways that do not require siRNAs.