3 resultados para cDNA microarrays
em Repositório Científico do Instituto Politécnico de Lisboa - Portugal
Resumo:
Microarray allow to monitoring simultaneously thousands of genes, where the abundance of the transcripts under a same experimental condition at the same time can be quantified. Among various available array technologies, double channel cDNA microarray experiments have arisen in numerous technical protocols associated to genomic studies, which is the focus of this work. Microarray experiments involve many steps and each one can affect the quality of raw data. Background correction and normalization are preprocessing techniques to clean and correct the raw data when undesirable fluctuations arise from technical factors. Several recent studies showed that there is no preprocessing strategy that outperforms others in all circumstances and thus it seems difficult to provide general recommendations. In this work, it is proposed to use exploratory techniques to visualize the effects of preprocessing methods on statistical analysis of cancer two-channel microarray data sets, where the cancer types (classes) are known. For selecting differential expressed genes the arrow plot was used and the graph of profiles resultant from the correspondence analysis for visualizing the results. It was used 6 background methods and 6 normalization methods, performing 36 pre-processing methods and it was analyzed in a published cDNA microarray database (Liver) available at http://genome-www5.stanford.edu/ which microarrays were already classified by cancer type. All statistical analyses were performed using the R statistical software.
Resumo:
Background: A common task in analyzing microarray data is to determine which genes are differentially expressed across two (or more) kind of tissue samples or samples submitted under experimental conditions. Several statistical methods have been proposed to accomplish this goal, generally based on measures of distance between classes. It is well known that biological samples are heterogeneous because of factors such as molecular subtypes or genetic background that are often unknown to the experimenter. For instance, in experiments which involve molecular classification of tumors it is important to identify significant subtypes of cancer. Bimodal or multimodal distributions often reflect the presence of subsamples mixtures. Consequently, there can be genes differentially expressed on sample subgroups which are missed if usual statistical approaches are used. In this paper we propose a new graphical tool which not only identifies genes with up and down regulations, but also genes with differential expression in different subclasses, that are usually missed if current statistical methods are used. This tool is based on two measures of distance between samples, namely the overlapping coefficient (OVL) between two densities and the area under the receiver operating characteristic (ROC) curve. The methodology proposed here was implemented in the open-source R software. Results: This method was applied to a publicly available dataset, as well as to a simulated dataset. We compared our results with the ones obtained using some of the standard methods for detecting differentially expressed genes, namely Welch t-statistic, fold change (FC), rank products (RP), average difference (AD), weighted average difference (WAD), moderated t-statistic (modT), intensity-based moderated t-statistic (ibmT), significance analysis of microarrays (samT) and area under the ROC curve (AUC). On both datasets all differentially expressed genes with bimodal or multimodal distributions were not selected by all standard selection procedures. We also compared our results with (i) area between ROC curve and rising area (ABCR) and (ii) the test for not proper ROC curves (TNRC). We found our methodology more comprehensive, because it detects both bimodal and multimodal distributions and different variances can be considered on both samples. Another advantage of our method is that we can analyze graphically the behavior of different kinds of differentially expressed genes. Conclusion: Our results indicate that the arrow plot represents a new flexible and useful tool for the analysis of gene expression profiles from microarrays.
Resumo:
High salinity causes remarkable losses in rice productivity worldwide mainly because it inhibits growth and reduces grain yield. To cope with environmental changes, plants evolved several adaptive mechanisms, which involve the regulation of many stress-responsive genes. Among these, we have chosen OsRMC to study its transcriptional regulation in rice seedlings subjected to high salinity. Its transcription was highly induced by salt treatment and showed a stress-dose-dependent pattern. OsRMC encodes a receptor-like kinase described as a negative regulator of salt stress responses in rice. To investigate how OsRMC is regulated in response to high salinity, a salt-induced rice cDNA expression library was constructed and subsequently screened using the yeast one-hybrid system and the OsRMC promoter as bait. Thereby, two transcription factors (TFs), OsEREBP1 and OsEREBP2, belonging to the AP2/ERF family were identified. Both TFs were shown to bind to the same GCC-like DNA motif in OsRMC promoter and to negatively regulate its gene expression. The identified TFs were characterized regarding their gene expression under different abiotic stress conditions. This study revealed that OsEREBP1 transcript level is not significantly affected by salt, ABA or severe cold (5 °C) and is only slightly regulated by drought and moderate cold. On the other hand, the OsEREBP2 transcript level increased after cold, ABA, drought and high salinity treatments, indicating that OsEREBP2 may play a central role mediating the response to different abiotic stresses. Gene expression analysis in rice varieties with contrasting salt tolerance further suggests that OsEREBP2 is involved in salt stress response in rice.