976 resultados para Gene Selection
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
The dataset contains raw data (quantification cycle) for a study which determined the most suitable hepatic reference genes for normalisation of qPCR data orginating from juvenile Atlantic salmon (14 days) exposed to 14 and 22 degrees C. These results will be useful for anyone wanting to study the effects of climate change/elevated temperature on reproductive physiology of fish (and perhaphs other vertebrates).
Resumo:
The dataset contains raw data (quantification cycle) for a study which determined the most suitable hepatic reference genes for normalisation of qPCR data orginating from adult (entire reproductive season) Atlantic salmon (14 days) exposed to 14 and 22 degrees C. These results will be useful for anyone wanting to study the effects of climate change/elevated temperature on reproductive physiology of fish (and perhaphs other vertebrates). In addition, a target gene (vitellogenin) has normalised using an inappropriate and an 'ideal' reference gene to demonstrate the consequences of using an unstable reference gene for normalisation. For the adult experiment, maiden and repeat adult females were held at the Salmon Enterprises of Tasmania (SALTAS) Wayatinah Hatchery (Tasmania, Australia) at ambient temperature and photoperiod in either 200 (maidens) or 50 (repeats) m3 circular tanks at stocking densities of 12-18, and 24-36 kg m-3 for maidens and repeats, respectively, until transfered to the experimental tanks.
Resumo:
Today, quantitative real-time PCR is the method of choice for rapid and reliable quantification of mRNA transcription. However, for an exact comparison of mRNA transcription in different samples or tissues it is crucial to choose the appropriate reference gene. Recently glyceraldehyde 3-phosphate dehydrogenase and P-actin have been used for that purpose. However, it has been reported that these genes as well as alternatives, like rRNA genes, are unsuitable references, because their transcription is significantly regulated in various experimental settings and variable in different tissues. Therefore, quantitative real-time PCR was used to determine the mRNA transcription profiles of 13 putative reference genes, comparing their transcription in 16 different tissues and in CCRF-HSB-2 cells stimulated with 12-O-tetradecanoylphorbol-13-acetate and ionomycin. Our results show that Classical reference genes are indeed unsuitable, whereas the RNA polymerase II gene was the gene with the most constant expression in different tissues and following stimulation in CCRF-HSB-2 cells. (C) 2003 Elsevier Inc. All rights reserved.
Resumo:
This work was supported by a Knowledge Transfer Network BBSRC Industrial Case (#414 BB/L502467/1) studentship in association Zoetis Inc.
Resumo:
2011
Resumo:
In the context of cancer diagnosis and treatment, we consider the problem of constructing an accurate prediction rule on the basis of a relatively small number of tumor tissue samples of known type containing the expression data on very many (possibly thousands) genes. Recently, results have been presented in the literature suggesting that it is possible to construct a prediction rule from only a few genes such that it has a negligible prediction error rate. However, in these results the test error or the leave-one-out cross-validated error is calculated without allowance for the selection bias. There is no allowance because the rule is either tested on tissue samples that were used in the first instance to select the genes being used in the rule or because the cross-validation of the rule is not external to the selection process; that is, gene selection is not performed in training the rule at each stage of the cross-validation process. We describe how in practice the selection bias can be assessed and corrected for by either performing a cross-validation or applying the bootstrap external to the selection process. We recommend using 10-fold rather than leave-one-out cross-validation, and concerning the bootstrap, we suggest using the so-called. 632+ bootstrap error estimate designed to handle overfitted prediction rules. Using two published data sets, we demonstrate that when correction is made for the selection bias, the cross-validated error is no longer zero for a subset of only a few genes.
Resumo:
Many learning problems require handling high dimensional datasets with a relatively small number of instances. Learning algorithms are thus confronted with the curse of dimensionality, and need to address it in order to be effective. Examples of these types of data include the bag-of-words representation in text classification problems and gene expression data for tumor detection/classification. Usually, among the high number of features characterizing the instances, many may be irrelevant (or even detrimental) for the learning tasks. It is thus clear that there is a need for adequate techniques for feature representation, reduction, and selection, to improve both the classification accuracy and the memory requirements. In this paper, we propose combined unsupervised feature discretization and feature selection techniques, suitable for medium and high-dimensional datasets. The experimental results on several standard datasets, with both sparse and dense features, show the efficiency of the proposed techniques as well as improvements over previous related techniques.
Resumo:
Feature selection is a central problem in machine learning and pattern recognition. On large datasets (in terms of dimension and/or number of instances), using search-based or wrapper techniques can be cornputationally prohibitive. Moreover, many filter methods based on relevance/redundancy assessment also take a prohibitively long time on high-dimensional. datasets. In this paper, we propose efficient unsupervised and supervised feature selection/ranking filters for high-dimensional datasets. These methods use low-complexity relevance and redundancy criteria, applicable to supervised, semi-supervised, and unsupervised learning, being able to act as pre-processors for computationally intensive methods to focus their attention on smaller subsets of promising features. The experimental results, with up to 10(5) features, show the time efficiency of our methods, with lower generalization error than state-of-the-art techniques, while being dramatically simpler and faster.
Resumo:
Recent investigations have shown that the maintenance of genomic imprinting of the murine insulin-like growth factor 2 (Igf2) gene involves at least two factors: the DNA (cytosine-5-)-methyltransferase activity, which is required to preserve the paternal specific expression of Igf2, and the H19 gene (lying 90 kb downstream of Igf2 gene), which upon inactivation leads to relaxation of the Igf2 imprint. It is not yet clear how these two factors are related to each other in the process of maintenance of Igf2 imprinting and, in particular, whether the latter is acting through cis elements or whether the H19 RNA itself is involved. By using Southern blots and the bisulfite genomic-sequencing technique, we have investigated the allelic methylation patterns (epigenotypes) of the Igf2 gene in two strains of mouse with distinct deletions of the H19 gene. The results show that maternal transmission of H19 gene deletions leads the maternal allele of Igf2 to adopt the epigenotype of the paternal allele and indicate that this phenomenon is influenced directly or indirectly by the H19 gene expression. More importantly, the bisulfite genomic-sequencing allowed us to show that the methylation pattern of the paternal allele of the Igf2 gene is affected in trans by deletions of the active maternal allele of the H19 gene. Selection during development for the appropriate expression of Igf2, dosage-dependent factors that bind to the Igf2 gene, or methylation transfer between the parental alleles could be involved in this trans effect.
Resumo:
Suboptimal maternal nutrition during gestation results in the establishment of long-term phenotypic changes and an increased disease risk in the offspring. To elucidate how such environmental sensitivity results in physiological outcomes, the molecular characterisation of these offspring has become the focus of many studies. However, the likely modification of key cellular processes such as metabolism in response to maternal undernutrition raises the question of whether the genes typically used as reference constants in gene expression studies are suitable controls. Using a mouse model of maternal protein undernutrition, we have investigated the stability of seven commonly used reference genes (18s, Hprt1, Pgk1, Ppib, Sdha, Tbp and Tuba1) in a variety of offspring tissues including liver, kidney, heart, retro-peritoneal and inter-scapular fat, extra-embryonic placenta and yolk sac, as well as in the preimplantation blastocyst and blastocyst-derived embryonic stem cells. We find that although the selected reference genes are all highly stable within this system, they show tissue, treatment and sex-specific variation. Furthermore, software-based selection approaches rank reference genes differently and do not always identify genes which differ between conditions. Therefore, we recommend that reference gene selection for gene expression studies should be thoroughly validated for each tissue of interest. © 2011 Elsevier Inc.
Resumo:
Purpose: The diagnosis of prostate cancer in men with persistently increased prostate specific antigen after a negative prostate biopsy has become a great challenge for urologists and pathologists. We analyzed the diagnostic value of 6 genes in the tissue of patients with prostate cancer. Materials and Methods: The study was comprised of 50 patients with localized disease who underwent radical prostatectomy. Gene selection was based on a previous microarray analysis. Among 4,147 genes with different expressions between 2 pools of patients 6 genes (PSMA, TMEFF2, GREB1, TH1L, IgH3 and PGC) were selected. These genes were tested for diagnostic value using the quantitative reverse transcription polymerase chain reaction method. Initially malignant tissue samples from 33 patients were analyzed and in the second part of the study we analyzed benign tissue samples from the other 17 patients with prostate cancer. The control group was comprised of tissue samples of patients with benign prostatic hyperplasia. Results: Analysis of malignant prostatic tissue demonstrated that prostate specific membrane antigen was over expressed (mean 9 times) and pepsinogen C was under expressed (mean 1.3 X 10(-4) times) in all cases compared to benign prostatic hyperplasia. The other 4 tested genes showed a variable expression pattern not allowing for differentiation between benign and malignant cases. When we tested these results in the benign prostate tissues from patients with cancer, pepsinogen C maintained the expression pattern. In terms of prostate specific membrane antigen, despite over expression in most cases (mean 12 times), 2 cases (12%) presented with under expression. Conclusions: Pepsinogen C tissue expression may constitute a powerful adjunctive method to prostate biopsy in the diagnosis of prostate cancer cases.
Resumo:
Aims: The physiological examination of amylase production by Aeromonas hydrophila JMP636 and identification of the mechanism of regulation. Methods and Results: Aeromonas hydrophila JMP636 was grown with single, then dual carbon sources; the growth cycle was followed and amylase activity throughout was monitored. The levels of cAMP, a known secondary messenger for the regulatory gene crp, were also examined. Amylase activity was regulated by catabolite repression. Physiological studies revealed that JMP636 exhibited both diauxic growth, with two carbon sources, and the 'acid toxicity' effect on glucose. The crp gene was cloned, expressed and inactivated from the JMP636 chromosome. Catabolite repression of amylase production and the 'acid toxicity' effect both require crp and were linked to cAMP levels. Conclusions: Regulation of amylase production was predicted to follow the model CRP-mediated cAMP-dependent Escherichia coli catabolite regulation system. Significance and Impact of the Study: This work provides an understanding of the physiology of the opportunistic pathogen Aer. hydrophila through identification of the mechanism of catabolite repression of amylase production and the existence of crp within this cell. It also provides a broader knowledge of global gene regulation and suggests regulatory mechanisms of other Aer. hydrophila gene/s.
Resumo:
Array technologies have made it possible to record simultaneously the expression pattern of thousands of genes. A fundamental problem in the analysis of gene expression data is the identification of highly relevant genes that either discriminate between phenotypic labels or are important with respect to the cellular process studied in the experiment: for example cell cycle or heat shock in yeast experiments, chemical or genetic perturbations of mammalian cell lines, and genes involved in class discovery for human tumors. In this paper we focus on the task of unsupervised gene selection. The problem of selecting a small subset of genes is particularly challenging as the datasets involved are typically characterized by a very small sample size ?? the order of few tens of tissue samples ??d by a very large feature space as the number of genes tend to be in the high thousands. We propose a model independent approach which scores candidate gene selections using spectral properties of the candidate affinity matrix. The algorithm is very straightforward to implement yet contains a number of remarkable properties which guarantee consistent sparse selections. To illustrate the value of our approach we applied our algorithm on five different datasets. The first consists of time course data from four well studied Hematopoietic cell lines (HL-60, Jurkat, NB4, and U937). The other four datasets include three well studied treatment outcomes (large cell lymphoma, childhood medulloblastomas, breast tumors) and one unpublished dataset (lymph status). We compared our approach both with other unsupervised methods (SOM,PCA,GS) and with supervised methods (SNR,RMB,RFE). The results clearly show that our approach considerably outperforms all the other unsupervised approaches in our study, is competitive with supervised methods and in some case even outperforms supervised approaches.