373 resultados para unsupervised
Resumo:
Peripheral T-cell lymphoma, not otherwise specified is a heterogeneous group of aggressive neoplasms with indistinct borders. By gene expression profiling we previously reported unsupervised clusters of peripheral T-cell lymphomas, not otherwise specified correlating with CD30 expression. In this work we extended the analysis of peripheral T-cell lymphoma molecular profiles to prototypical CD30(+) peripheral T-cell lymphomas (anaplastic large cell lymphomas), and validated mRNA expression profiles at the protein level. Existing transcriptomic datasets from peripheral T-cell lymphomas, not otherwise specified and anaplastic large cell lymphomas were reanalyzed. Twenty-one markers were selected for immunohistochemical validation on 80 peripheral T-cell lymphoma samples (not otherwise specified, CD30(+) and CD30(-); anaplastic large cell lymphomas, ALK(+) and ALK(-)), and differences between subgroups were assessed. Clinical follow-up was recorded. Compared to CD30(-) tumors, CD30(+) peripheral T-cell lymphomas, not otherwise specified were significantly enriched in ALK(-) anaplastic large cell lymphoma-related genes. By immunohistochemistry, CD30(+) peripheral T-cell lymphomas, not otherwise specified differed significantly from CD30(-) samples [down-regulated expression of T-cell receptor-associated proximal tyrosine kinases (Lck, Fyn, Itk) and of proteins involved in T-cell differentiation/activation (CD69, ICOS, CD52, NFATc2); upregulation of JunB and MUM1], while overlapping with anaplastic large cell lymphomas. CD30(-) peripheral T-cell lymphomas, not otherwise specified tended to have an inferior clinical outcome compared to the CD30(+) subgroups. In conclusion, we show molecular and phenotypic features common to CD30(+) peripheral T-cell lymphomas, and significant differences between CD30(-) and CD30(+) peripheral T-cell lymphomas, not otherwise specified, suggesting that CD30 expression might delineate two biologically distinct subgroups.
Resumo:
The 2009-2010 Data Fusion Contest organized by the Data Fusion Technical Committee of the IEEE Geoscience and Remote Sensing Society was focused on the detection of flooded areas using multi-temporal and multi-modal images. Both high spatial resolution optical and synthetic aperture radar data were provided. The goal was not only to identify the best algorithms (in terms of accuracy), but also to investigate the further improvement derived from decision fusion. This paper presents the four awarded algorithms and the conclusions of the contest, investigating both supervised and unsupervised methods and the use of multi-modal data for flood detection. Interestingly, a simple unsupervised change detection method provided similar accuracy as supervised approaches, and a digital elevation model-based predictive method yielded a comparable projected change detection map without using post-event data.
Resumo:
Background: HSTL is a rare entity characterized by an infiltration of bone marrow, spleen and liver tissues by neoplastic gammadelta (gd) -more rarely alphabeta (ab)- T cells. Its pathogenesis is poorly understood. Our purpose was to identify the molecular signature of HSTL and explore molecular pathways implicated in its pathogenesis.Methods: Gene expression profiling and array CGH analysis of 10 HSTL samples (7gd, 3ab), 1 HSTL cell line (DERL2), 2 normal gd samples together with 16 peripheral T-cell lymphoma not otherwise specified (PTCL,NOS) and 7 nasal NK/T cell lymphomas were performed.Results: By unsupervised analysis, ab and gdHSTL clustered together remarkably separated from other lymphoma entities. Compared to PTCL, NOS, HSTL overexpresed genes encoding NK-associated molecules, oncogenes (VAV3) and the Sphingosine-1-phosphatase receptor 5 involved in cell trafficking. Compared to normal gd cells, HSTL overexpressed genes encoding NK-cell and multi drug resistance-associated molecules, transcription factors (RHOB), oncogenes (MAFB, FOS, JUN, VAV3) and the tyrosine kinase SYK whereas genes encoding cytotoxic molecules and the tumor suppressor gene AIM1 were among the most downregulated. By immunohistochemistry, SYK was demonstrated on HSTL cells with expression of its phosphorylated form in DERL2 cells by Western blot. Functional studies using a SYK inhibitor revealed a dose dependent increase of apoptotic DERL2 cells suggesting that SYK could be a candidate target for pharmacologic inhibition. Downexpression of AIM1 was validated by qRT-PCR. Methylation analysis of DERL2 genomic DNA treated by bisulfite demonstrated highly methylated CpG islands of AIM1. Genomic profiles confirmed recurrent isochromosome 7q (n=6/9) without alterations at 9q22 and 6q21 containing SYK and AIM1 genes, respectively.Conclusion: The current study identifies a distinct molecular signature for HSTL and highlights oncogenic pathways which offer rationale for exploring new therapeutic options such as SYK inhibitors. It supports the view of gd and ab HSTL as a single entity.
Resumo:
Given a set of images of scenes containing different object categories (e.g. grass, roads) our objective is to discover these objects in each image, and to use this object occurrences to perform a scene classification (e.g. beach scene, mountain scene). We achieve this by using a supervised learning algorithm able to learn with few images to facilitate the user task. We use a probabilistic model to recognise the objects and further we classify the scene based on their object occurrences. Experimental results are shown and evaluated to prove the validity of our proposal. Object recognition performance is compared to the approaches of He et al. (2004) and Marti et al. (2001) using their own datasets. Furthermore an unsupervised method is implemented in order to evaluate the advantages and disadvantages of our supervised classification approach versus an unsupervised one
Resumo:
Nowadays, the joint exploitation of images acquired daily by remote sensing instruments and of images available from archives allows a detailed monitoring of the transitions occurring at the surface of the Earth. These modifications of the land cover generate spectral discrepancies that can be detected via the analysis of remote sensing images. Independently from the origin of the images and of type of surface change, a correct processing of such data implies the adoption of flexible, robust and possibly nonlinear method, to correctly account for the complex statistical relationships characterizing the pixels of the images. This Thesis deals with the development and the application of advanced statistical methods for multi-temporal optical remote sensing image processing tasks. Three different families of machine learning models have been explored and fundamental solutions for change detection problems are provided. In the first part, change detection with user supervision has been considered. In a first application, a nonlinear classifier has been applied with the intent of precisely delineating flooded regions from a pair of images. In a second case study, the spatial context of each pixel has been injected into another nonlinear classifier to obtain a precise mapping of new urban structures. In both cases, the user provides the classifier with examples of what he believes has changed or not. In the second part, a completely automatic and unsupervised method for precise binary detection of changes has been proposed. The technique allows a very accurate mapping without any user intervention, resulting particularly useful when readiness and reaction times of the system are a crucial constraint. In the third, the problem of statistical distributions shifting between acquisitions is studied. Two approaches to transform the couple of bi-temporal images and reduce their differences unrelated to changes in land cover are studied. The methods align the distributions of the images, so that the pixel-wise comparison could be carried out with higher accuracy. Furthermore, the second method can deal with images from different sensors, no matter the dimensionality of the data nor the spectral information content. This opens the doors to possible solutions for a crucial problem in the field: detecting changes when the images have been acquired by two different sensors.
Resumo:
ABSTRACT Poor outcome for glioblastoma patients is largely due to resistance to chemoradiation therapy. While epigenetic inactivation of MGMT mediated DNA repair is highly predictive for benefit from the alkylating agent therapy Temozolomide, additional mechanisms for resistance associated with molecular alterations exist. Furthermore, new concepts in cancer suggest that resistance to treatment may be linked to cancer stem cells that escape therapy and act as source for tumour recurrence. We determined gene expression signatures associated with outcome in glioblastoma patients enrolled in a phase II and phase III clinical trial establishing the new combination therapy of radiation plus concomitant and adjuvant Temozolomide. Correlating stable gene clusters emerging from unsupervised analysis with survival of 42 treated patients identified a number of biological processes associated with outcome. Most prominent, a gene cluster dominated by HOX genes and comprising PROM1, was associated with resistance. PROM1 encodes CD133, a marker for a subpopulation of tumour cells enriched for glioblastoma stem- like cells. The core of this correlated HOX cluster was comprised in the top genes of a "self-renewal signature" defined in a mouse model for MLL-AF9 initiated leukaemia. The association of the HOX gene cluster with tumour resistance was confirmed in two external data sets of 146 malignant glioma As additional resistance factors we identified over-expression of the epidermal growth factor receptor gene, EGFR, while increased gene expression related to biological features of tumour host interaction, including markers for tumour vascular and cell adhesion, and innate immune response, were associated with better outcome. The "self-renewal" signature associated with resistance to the new combination chemoradiation therapy provides first clinical evidence that glioma stem like cells may implicated in resistance in a uniformly treated cohort of glioblastoma patients. This study underlines the need to target the tumour stem cell compartment, and provides some testable hypothesis for biological mechanisms relevant for malignant behaviour of glioblastoma that may be targeted in new treatment approaches. Résumé Le glioblastome, tumeur cérébrale primaire maligne la plus fréquente, est connue pour son mauvais pronostique. Des avancées chimiothérapeutiques récentes avec des agents alkylants comme le témozolomide (TMZ), ont permis une amélioration notable dans la survie de certains patients. Les bénéficiaires ont la caractéristique commune de présenter une particularité génétique, la methylation du MGMT (methylguanine methyltransferase). Néanmoins, d'autres mécanismes de résistance en fonction des aberrations moléculaires existent. Nous avons établi les profils d'expressions génétiques des patients traités par irradiation et TMZ dans des études cliniques de phase II et III. En combinant des méthodes non-supervisées et supervisées, de l'étude de la cohorte des patients traités nous avons découvert des groupes de gènes associés à la survie. Un ensemble de gènes contenant les gènes Hox semble lié au mécanisme de résistance au traitement. Récemment, les gènes Hox ont été décrits comme faisant partie d"une signature d'autorenouvellement (self-renewal) des cellules souches cancéreuses de la leucémie. L'autorenouvellement est un processus grâce auquel les cellules souches se maintiennent tout au long de la vie. Cette association à la résistance est confirmée dans deux autres études indépendantes. Un autre facteur de résistance au traitement est la surexpression du gène EGFR. D'autre part, deux groupes de gènes associés à la relation entre hôte-tumeur tels que les marqueurs des vaisseaux tumoraux et de la réponse immunitaire innée s'avèrent avoir un effet positif sur la survie des patients traités. La découverte de la signature d'autorenouvellement comme facteur de résistance à la nouvelle chimio-radiothérapie offre une preuve clinique que les cellules souches cancéreuses sont impliquées dans la résistance au traitement. If est donc logique de penser que le traitement ciblé contre des cellules souches cancéreuses va dans l'avenir permettre des thérapies anticancéreuses plus performantes.
Resumo:
Glioblastoma are rapidly proliferating brain tumors in which hypoxia is readily recognizable, as indicated by focal or extensive necrosis and vascular proliferation, two independent diagnostic criteria for glioblastoma. Gene expression profiling of glioblastoma revealed a gene expression signature associated with hypoxia-regulated genes. The correlated gene set emerging from unsupervised analysis comprised known hypoxia-inducible genes involved in angiogenesis and inflammation such as VEGF and BIRC3, respectively. The relationship between hypoxia-modulated angiogenic genes and inflammatory genes was associated with outcome in our cohort of glioblastoma patients treated within prospective clinical trials of combined chemoradiotherapy. The hypoxia regulation of several new genes comprised in this cluster including ZNF395, TNFAIP3, and TREM1 was experimentally confirmed in glioma cell lines and primary monocytes exposed to hypoxia in vitro. Interestingly, the cluster seems to characterize differential response of tumor cells, stromal cells and the macrophage/microglia compartment to hypoxic conditions. Most genes classically associated with the inflammatory compartment are part of the NF-kappaB signaling pathway including TNFAIP3 and BIRC3 that have been shown to be involved in resistance to chemotherapy.Our results associate hypoxia-driven tumor response with inflammation in glioblastoma, hence underlining the importance of tumor-host interaction involving the inflammatory compartment.
Resumo:
Mouse models are important tools to decipher the molecular mechanisms of mammary carcinogenesis and to mimic the respective human disease. Despite sharing common phenotypic and genetic features, the proper translation of murine models to human breast cancer remains a challenging task. In a previous study we showed that in the SV40 transgenic WAP-T mice an active Met-pathway and epithelial-mesenchymal characteristics distinguish low- and high-grade mammary carcinoma. To assign these murine tumors to corresponding human tumors we here incorporated the analysis of expression of transcription factor (TF) coding genes and show that thereby a more accurate interspecies translation can be achieved. We describe a novel cross-species translation procedure and demonstrate that expression of unsupervised selected TFs, such as ELF5, HOXA5 and TFCP2L1, can clearly distinguish between the human molecular breast cancer subtypes-or as, for example, expression of TFAP2B between yet unclassified subgroups. By integrating different levels of information like histology, gene set enrichment, expression of differentiation markers and TFs we conclude that tumors in WAP-T mice exhibit similarities to both, human basal-like and non-basal-like subtypes. We furthermore suggest that the low- and high-grade WAP-T tumor phenotypes might arise from distinct cells of tumor origin. Our results underscore the importance of TFs as common cross-species denominators in the regulatory networks underlying mammary carcinogenesis.
Resumo:
Counterfeit pharmaceutical products have become a widespread problem in the last decade. Various analytical techniques have been applied to discriminate between genuine and counterfeit products. Among these, Near-infrared (NIR) and Raman spectroscopy provided promising results.The present study offers a methodology allowing to provide more valuable information fororganisations engaged in the fight against counterfeiting of medicines.A database was established by analyzing counterfeits of a particular pharmaceutical product using Near-infrared (NIR) and Raman spectroscopy. Unsupervised chemometric techniques (i.e. principal component analysis - PCA and hierarchical cluster analysis - HCA) were implemented to identify the classes within the datasets. Gas Chromatography coupled to Mass Spectrometry (GC-MS) and Fourier Transform Infrared Spectroscopy (FT-IR) were used to determine the number of different chemical profiles within the counterfeits. A comparison with the classes established by NIR and Raman spectroscopy allowed to evaluate the discriminating power provided by these techniques. Supervised classifiers (i.e. k-Nearest Neighbors, Partial Least Squares Discriminant Analysis, Probabilistic Neural Networks and Counterpropagation Artificial Neural Networks) were applied on the acquired NIR and Raman spectra and the results were compared to the ones provided by the unsupervised classifiers.The retained strategy for routine applications, founded on the classes identified by NIR and Raman spectroscopy, uses a classification algorithm based on distance measures and Receiver Operating Characteristics (ROC) curves. The model is able to compare the spectrum of a new counterfeit with that of previously analyzed products and to determine if a new specimen belongs to one of the existing classes, consequently allowing to establish a link with other counterfeits of the database.
Resumo:
Segmenting ultrasound images is a challenging problemwhere standard unsupervised segmentation methods such asthe well-known Chan-Vese method fail. We propose in thispaper an efficient segmentation method for this class ofimages. Our proposed algorithm is based on asemi-supervised approach (user labels) and the use ofimage patches as data features. We also consider thePearson distance between patches, which has been shown tobe robust w.r.t speckle noise present in ultrasoundimages. Our results on phantom and clinical data show avery high similarity agreement with the ground truthprovided by a medical expert.
Resumo:
Pulse-wave velocity (PWV) is considered as the gold-standard method to assess arterial stiffness, an independent predictor of cardiovascular morbidity and mortality. Current available devices that measure PWV need to be operated by skilled medical staff, thus, reducing the potential use of PWV in the ambulatory setting. In this paper, we present a new technique allowing continuous, unsupervised measurements of pulse transit times (PTT) in central arteries by means of a chest sensor. This technique relies on measuring the propagation time of pressure pulses from their genesis in the left ventricle to their later arrival at the cutaneous vasculature on the sternum. Combined thoracic impedance cardiography and phonocardiography are used to detect the opening of the aortic valve, from which a pre-ejection period (PEP) value is estimated. Multichannel reflective photoplethysmography at the sternum is used to detect the distal pulse-arrival time (PAT). A PTT value is then calculated as PTT = PAT - PEP. After optimizing the parameters of the chest PTT calculation algorithm on a nine-subject cohort, a prospective validation study involving 31 normo- and hypertensive subjects was performed. 1/chest PTT correlated very well with the COMPLIOR carotid to femoral PWV (r = 0.88, p < 10 (-9)). Finally, an empirical method to map chest PTT values onto chest PWV values is explored.
Resumo:
In this report, the authors present two cases of accidental death in children of addicted parents. In the first case, the child was left unattended at home while the mother went out to buy cocaine. She was arrested and detained with no mention of the unsupervised child. The cause of death in this case was determined to be starvation and dehydration. In the second case, a child mistakenly received a methadone suppository by her father instead of an antipyretic suppository. Toxicological analysis of the femoral blood revealed methadone at a concentration of 1.2 mg/L. The cause of death was determined to be methadone intoxication. The literature is reviewed and discussed. We report these cases to illustrate the risk of harm to children from illicit drugs and prescription medications at home and because there is no mention of accidental death in children following a methadone suppository administration in the current literature.
Resumo:
In this paper we propose a novel unsupervised approach to learning domain-specific ontologies from large open-domain text collections. The method is based on the joint exploitation of Semantic Domains and Super Sense Tagging for Information Retrieval tasks. Our approach is able to retrieve domain specific terms and concepts while associating them with a set of high level ontological types, named supersenses, providing flat ontologies characterized by very high accuracy and pertinence to the domain.
Resumo:
La présente étude est à la fois une évaluation du processus de la mise en oeuvre et des impacts de la police de proximité dans les cinq plus grandes zones urbaines de Suisse - Bâle, Berne, Genève, Lausanne et Zurich. La police de proximité (community policing) est à la fois une philosophie et une stratégie organisationnelle qui favorise un partenariat renouvelé entre la police et les communautés locales dans le but de résoudre les problèmes relatifs à la sécurité et à l'ordre public. L'évaluation de processus a analysé des données relatives aux réformes internes de la police qui ont été obtenues par l'intermédiaire d'entretiens semi-structurés avec des administrateurs clés des cinq départements de police, ainsi que dans des documents écrits de la police et d'autres sources publiques. L'évaluation des impacts, quant à elle, s'est basée sur des variables contextuelles telles que des statistiques policières et des données de recensement, ainsi que sur des indicateurs d'impacts construit à partir des données du Swiss Crime Survey (SCS) relatives au sentiment d'insécurité, à la perception du désordre public et à la satisfaction de la population à l'égard de la police. Le SCS est un sondage régulier qui a permis d'interroger des habitants des cinq grandes zones urbaines à plusieurs reprises depuis le milieu des années 1980. L'évaluation de processus a abouti à un « Calendrier des activités » visant à créer des données de panel permettant de mesurer les progrès réalisés dans la mise en oeuvre de la police de proximité à l'aide d'une grille d'évaluation à six dimensions à des intervalles de cinq ans entre 1990 et 2010. L'évaluation des impacts, effectuée ex post facto, a utilisé un concept de recherche non-expérimental (observational design) dans le but d'analyser les impacts de différents modèles de police de proximité dans des zones comparables à travers les cinq villes étudiées. Les quartiers urbains, délimités par zone de code postal, ont ainsi été regroupés par l'intermédiaire d'une typologie réalisée à l'aide d'algorithmes d'apprentissage automatique (machine learning). Des algorithmes supervisés et non supervisés ont été utilisés sur les données à haute dimensionnalité relatives à la criminalité, à la structure socio-économique et démographique et au cadre bâti dans le but de regrouper les quartiers urbains les plus similaires dans des clusters. D'abord, les cartes auto-organisatrices (self-organizing maps) ont été utilisées dans le but de réduire la variance intra-cluster des variables contextuelles et de maximiser simultanément la variance inter-cluster des réponses au sondage. Ensuite, l'algorithme des forêts d'arbres décisionnels (random forests) a permis à la fois d'évaluer la pertinence de la typologie de quartier élaborée et de sélectionner les variables contextuelles clés afin de construire un modèle parcimonieux faisant un minimum d'erreurs de classification. Enfin, pour l'analyse des impacts, la méthode des appariements des coefficients de propension (propensity score matching) a été utilisée pour équilibrer les échantillons prétest-posttest en termes d'âge, de sexe et de niveau d'éducation des répondants au sein de chaque type de quartier ainsi identifié dans chacune des villes, avant d'effectuer un test statistique de la différence observée dans les indicateurs d'impacts. De plus, tous les résultats statistiquement significatifs ont été soumis à une analyse de sensibilité (sensitivity analysis) afin d'évaluer leur robustesse face à un biais potentiel dû à des covariables non observées. L'étude relève qu'au cours des quinze dernières années, les cinq services de police ont entamé des réformes majeures de leur organisation ainsi que de leurs stratégies opérationnelles et qu'ils ont noué des partenariats stratégiques afin de mettre en oeuvre la police de proximité. La typologie de quartier développée a abouti à une réduction de la variance intra-cluster des variables contextuelles et permet d'expliquer une partie significative de la variance inter-cluster des indicateurs d'impacts avant la mise en oeuvre du traitement. Ceci semble suggérer que les méthodes de géocomputation aident à équilibrer les covariables observées et donc à réduire les menaces relatives à la validité interne d'un concept de recherche non-expérimental. Enfin, l'analyse des impacts a révélé que le sentiment d'insécurité a diminué de manière significative pendant la période 2000-2005 dans les quartiers se trouvant à l'intérieur et autour des centres-villes de Berne et de Zurich. Ces améliorations sont assez robustes face à des biais dus à des covariables inobservées et covarient dans le temps et l'espace avec la mise en oeuvre de la police de proximité. L'hypothèse alternative envisageant que les diminutions observées dans le sentiment d'insécurité soient, partiellement, un résultat des interventions policières de proximité semble donc être aussi plausible que l'hypothèse nulle considérant l'absence absolue d'effet. Ceci, même si le concept de recherche non-expérimental mis en oeuvre ne peut pas complètement exclure la sélection et la régression à la moyenne comme explications alternatives. The current research project is both a process and impact evaluation of community policing in Switzerland's five major urban areas - Basel, Bern, Geneva, Lausanne, and Zurich. Community policing is both a philosophy and an organizational strategy that promotes a renewed partnership between the police and the community to solve problems of crime and disorder. The process evaluation data on police internal reforms were obtained through semi-structured interviews with key administrators from the five police departments as well as from police internal documents and additional public sources. The impact evaluation uses official crime records and census statistics as contextual variables as well as Swiss Crime Survey (SCS) data on fear of crime, perceptions of disorder, and public attitudes towards the police as outcome measures. The SCS is a standing survey instrument that has polled residents of the five urban areas repeatedly since the mid-1980s. The process evaluation produced a "Calendar of Action" to create panel data to measure community policing implementation progress over six evaluative dimensions in intervals of five years between 1990 and 2010. The impact evaluation, carried out ex post facto, uses an observational design that analyzes the impact of the different community policing models between matched comparison areas across the five cities. Using ZIP code districts as proxies for urban neighborhoods, geospatial data mining algorithms serve to develop a neighborhood typology in order to match the comparison areas. To this end, both unsupervised and supervised algorithms are used to analyze high-dimensional data on crime, the socio-economic and demographic structure, and the built environment in order to classify urban neighborhoods into clusters of similar type. In a first step, self-organizing maps serve as tools to develop a clustering algorithm that reduces the within-cluster variance in the contextual variables and simultaneously maximizes the between-cluster variance in survey responses. The random forests algorithm then serves to assess the appropriateness of the resulting neighborhood typology and to select the key contextual variables in order to build a parsimonious model that makes a minimum of classification errors. Finally, for the impact analysis, propensity score matching methods are used to match the survey respondents of the pretest and posttest samples on age, gender, and their level of education for each neighborhood type identified within each city, before conducting a statistical test of the observed difference in the outcome measures. Moreover, all significant results were subjected to a sensitivity analysis to assess the robustness of these findings in the face of potential bias due to some unobserved covariates. The study finds that over the last fifteen years, all five police departments have undertaken major reforms of their internal organization and operating strategies and forged strategic partnerships in order to implement community policing. The resulting neighborhood typology reduced the within-cluster variance of the contextual variables and accounted for a significant share of the between-cluster variance in the outcome measures prior to treatment, suggesting that geocomputational methods help to balance the observed covariates and hence to reduce threats to the internal validity of an observational design. Finally, the impact analysis revealed that fear of crime dropped significantly over the 2000-2005 period in the neighborhoods in and around the urban centers of Bern and Zurich. These improvements are fairly robust in the face of bias due to some unobserved covariate and covary temporally and spatially with the implementation of community policing. The alternative hypothesis that the observed reductions in fear of crime were at least in part a result of community policing interventions thus appears at least as plausible as the null hypothesis of absolutely no effect, even if the observational design cannot completely rule out selection and regression to the mean as alternative explanations.
Resumo:
Colorectal cancer (CRC) is a major cause of cancer mortality. Whereas some patients respond well to therapy, others do not, and thus more precise, individualized treatment strategies are needed. To that end, we analyzed gene expression profiles from 1,290 CRC tumors using consensus-based unsupervised clustering. The resultant clusters were then associated with therapeutic response data to the epidermal growth factor receptor-targeted drug cetuximab in 80 patients. The results of these studies define six clinically relevant CRC subtypes. Each subtype shares similarities to distinct cell types within the normal colon crypt and shows differing degrees of 'stemness' and Wnt signaling. Subtype-specific gene signatures are proposed to identify these subtypes. Three subtypes have markedly better disease-free survival (DFS) after surgical resection, suggesting these patients might be spared from the adverse effects of chemotherapy when they have localized disease. One of these three subtypes, identified by filamin A expression, does not respond to cetuximab but may respond to cMET receptor tyrosine kinase inhibitors in the metastatic setting. Two other subtypes, with poor and intermediate DFS, associate with improved response to the chemotherapy regimen FOLFIRI in adjuvant or metastatic settings. Development of clinically deployable assays for these subtypes and of subtype-specific therapies may contribute to more effective management of this challenging disease.