3 resultados para Misclassification
em National Center for Biotechnology Information - NCBI
Resumo:
The comparative typing of matched tumor and blood DNAs at dinucleotide repeat (microsatellite) loci has revealed in tumor DNA the presence of alleles that are not observed in normal DNA. The occurrence of these additional alleles is possibly due to replication errors (RERs). Although this observation has led to the recognition of a subtype of colorectal cancer with a high incidence of RERs (caused by a deficiency in DNA mismatch repair), a thorough analysis of the RER frequency in a consecutive series of colorectal cancers had not been reported. It is shown here that the extensive typing of 88 colorectal tumors reveals a bimodal distribution for the frequency of RER at microsatellite loci. Within the major mode (75 tumors, RER− subtype), the probability that a locus exhibited instability did not differ significantly among loci and tumors, being 0.02. The subsequent development of a statistical test for an operational discrimination between the RER− and RER+ subtypes indicated that the probability of misclassification did not exceed 0.001 in this series. The frequency of K-ras mutation was found to be equivalent in the two subtypes. However, in the RER+ tumors, the p53 gene mutation was less frequently detected, the adenomatous polyposis coli (APC) mutation was rare, and the biallelic inactivation of either of these genes was not observed. Furthermore, the concomitant occurrence of APC and tumor growth factor β receptor type II gene alterations was found only once. These data suggest that the repertoires of genes that are frequently altered in RER+ and RER− tumors may be more different than previously thought.
Resumo:
We present statistical methods for analyzing replicated cDNA microarray expression data and report the results of a controlled experiment. The study was conducted to investigate inherent variability in gene expression data and the extent to which replication in an experiment produces more consistent and reliable findings. We introduce a statistical model to describe the probability that mRNA is contained in the target sample tissue, converted to probe, and ultimately detected on the slide. We also introduce a method to analyze the combined data from all replicates. Of the 288 genes considered in this controlled experiment, 32 would be expected to produce strong hybridization signals because of the known presence of repetitive sequences within them. Results based on individual replicates, however, show that there are 55, 36, and 58 highly expressed genes in replicates 1, 2, and 3, respectively. On the other hand, an analysis by using the combined data from all 3 replicates reveals that only 2 of the 288 genes are incorrectly classified as expressed. Our experiment shows that any single microarray output is subject to substantial variability. By pooling data from replicates, we can provide a more reliable analysis of gene expression data. Therefore, we conclude that designing experiments with replications will greatly reduce misclassification rates. We recommend that at least three replicates be used in designing experiments by using cDNA microarrays, particularly when gene expression data from single specimens are being analyzed.