892 resultados para Two-stage classification
Resumo:
Colorectal cancer (CRC) is a major cause of cancer mortality. Whereas some patients respond well to therapy, others do not, and thus more precise, individualized treatment strategies are needed. To that end, we analyzed gene expression profiles from 1,290 CRC tumors using consensus-based unsupervised clustering. The resultant clusters were then associated with therapeutic response data to the epidermal growth factor receptor-targeted drug cetuximab in 80 patients. The results of these studies define six clinically relevant CRC subtypes. Each subtype shares similarities to distinct cell types within the normal colon crypt and shows differing degrees of 'stemness' and Wnt signaling. Subtype-specific gene signatures are proposed to identify these subtypes. Three subtypes have markedly better disease-free survival (DFS) after surgical resection, suggesting these patients might be spared from the adverse effects of chemotherapy when they have localized disease. One of these three subtypes, identified by filamin A expression, does not respond to cetuximab but may respond to cMET receptor tyrosine kinase inhibitors in the metastatic setting. Two other subtypes, with poor and intermediate DFS, associate with improved response to the chemotherapy regimen FOLFIRI in adjuvant or metastatic settings. Development of clinically deployable assays for these subtypes and of subtype-specific therapies may contribute to more effective management of this challenging disease.
Resumo:
Although the influence of clay mineralogy on soil physical properties has been widely studied, spatial relationships between these features in Alfisols have rarely been examined. The purpose of this work was to relate the clay minerals and physical properties of an Alfisol of sandstone origin in two slope curvatures. The crystallographic properties such as mean crystallite size (MCS) and width at half height (WHH) of hematite, goethite, kaolinite and gibbsite; contents of hematite and goethite; aluminium substitution (AS) and specific surface area (SSA) of hematite and goethite; the goethite/(goethite+hematite) and kaolinite/(kaolinite+gibbsite) ratios; and the citrate/bicarbonate/dithionite extractable Fe (Fe d) were correlated with the soil physical properties through Pearson correlation coefficients and cross-semivariograms. The correlations found between aluminium substitution in goethite and the soil physical properties suggest that the degree of crystallinity of this mineral influences soil properties used as soil quality indicators. Thus, goethite with a high aluminium substitution resulted in large aggregate sizes and a high porosity, and also in a low bulk density and soil penetration resistance. The presence of highly crystalline gibbsite resulted in a high density and micropore content, as well as in smaller aggregates. Interpretation of the cross-semivariogram and classification of landscape compartments in terms of the spatial dependence pattern for the relief-dependent physical and mineralogical properties of the soil proved an effective supplementary method for assessing Pearson correlations between the soil physical and mineralogical properties.
Resumo:
Soil surveys are the main source of spatial information on soils and have a range of different applications, mainly in agriculture. The continuity of this activity has however been severely compromised, mainly due to a lack of governmental funding. The purpose of this study was to evaluate the feasibility of two different classifiers (artificial neural networks and a maximum likelihood algorithm) in the prediction of soil classes in the northwest of the state of Rio de Janeiro. Terrain attributes such as elevation, slope, aspect, plan curvature and compound topographic index (CTI) and indices of clay minerals, iron oxide and Normalized Difference Vegetation Index (NDVI), derived from Landsat 7 ETM+ sensor imagery, were used as discriminating variables. The two classifiers were trained and validated for each soil class using 300 and 150 samples respectively, representing the characteristics of these classes in terms of the discriminating variables. According to the statistical tests, the accuracy of the classifier based on artificial neural networks (ANNs) was greater than of the classic Maximum Likelihood Classifier (MLC). Comparing the results with 126 points of reference showed that the resulting ANN map (73.81 %) was superior to the MLC map (57.94 %). The main errors when using the two classifiers were caused by: a) the geological heterogeneity of the area coupled with problems related to the geological map; b) the depth of lithic contact and/or rock exposure, and c) problems with the environmental correlation model used due to the polygenetic nature of the soils. This study confirms that the use of terrain attributes together with remote sensing data by an ANN approach can be a tool to facilitate soil mapping in Brazil, primarily due to the availability of low-cost remote sensing data and the ease by which terrain attributes can be obtained.
Resumo:
Plasmodium sporozoites make a remarkable journey from the mosquito midgut to the mammalian liver. The sporozoite's major surface protein, circumsporozoite protein (CSP), is a multifunctional protein required for sporozoite development and likely mediates several steps of this journey. In this study, we show that CSP has two conformational states, an adhesive conformation in which the C-terminal cell-adhesive domain is exposed and a nonadhesive conformation in which the N terminus masks this domain. We demonstrate that the cell-adhesive domain functions in sporozoite development and hepatocyte invasion. Between these two events, the sporozoite must travel from the mosquito midgut to the mammalian liver, and N-terminal masking of the cell-adhesive domain maintains the sporozoite in a migratory state. In the mammalian host, proteolytic cleavage of CSP regulates the switch to an adhesive conformation, and the highly conserved region I plays a critical role in this process. If the CSP domain architecture is altered such that the cell-adhesive domain is constitutively exposed, the majority of sporozoites do not reach their target organs, and in the mammalian host, they initiate a blood stage infection directly from the inoculation site. These data provide structure-function information relevant to malaria vaccine development.
Resumo:
Local adaptation of populations requires some degree of spatio-temporal isolation. Previous studies of the two dung fly species Scathophaga stercoraria and Sepsis cynipsea have revealed low levels of geographic and altitudinal genetic differentiation in quantitative life history and morphological traits, but instead high degrees of phenotypic plasticity. These patterns suggest that gene flow is extensive despite considerable geographic barriers and large spatio-temporal variation in selection on body size and related traits. In this study we addressed this hypothesis by investigating genetic differentiation of dung fly populations throughout Switzerland based on the same 10 electrophoretic loci in each species. Overall, we found no significant geographic differentiation of populations for either species. This is inconsistent with the higher rates of gene flow expected due to better flying capacity of the larger S. stercoraria. However, heterozygote deficiencies within populations indicated structuring on a finer scale, seen for several loci in S. cynipsea, and for the locus PGM (Phosphoglucomutase) in S. stercoraria. Additionally, S. cynipsea showed a tendency towards a greater gene diversity at higher altitudes, mediated primarily by the locus MDH (malate dehydrogenase), at which a second allele was only present in populations above 1000 m. This may be caused by increased environmental stress at higher altitudes in this warm-adapted species. MDH might thus be a candidate locus subject to thermal selection in this species, but this remains to be corroborated by direct evidence. In S. stercoraria, no altitudinal variation was found.
Resumo:
BACKGROUND: Pathological complete response (pCR) following chemotherapy is strongly associated with both breast cancer subtype and long-term survival. Within a phase III neoadjuvant chemotherapy trial, we sought to determine whether the prognostic implications of pCR, TP53 status and treatment arm (taxane versus non-taxane) differed between intrinsic subtypes. PATIENTS AND METHODS: Patients were randomized to receive either six cycles of anthracycline-based chemotherapy or three cycles of docetaxel then three cycles of eprirubicin/docetaxel (T-ET). pCR was defined as no evidence of residual invasive cancer (or very few scattered tumour cells) in primary tumour and lymph nodes. We used a simplified intrinsic subtypes classification, as suggested by the 2011 St Gallen consensus. Interactions between pCR, TP53 status, treatment arm and intrinsic subtype on event-free survival (EFS), distant metastasis-free survival (DMFS) and overall survival (OS) were studied using a landmark and a two-step approach multivariate analyses. RESULTS: Sufficient data for pCR analyses were available in 1212 (65%) of 1856 patients randomized. pCR occurred in 222 of 1212 (18%) patients: 37 of 496 (7.5%) luminal A, 22 of 147 (15%) luminal B/HER2 negative, 51 of 230 (22%) luminal B/HER2 positive, 43 of 118 (36%) HER2 positive/non-luminal, 69 of 221(31%) triple negative (TN). The prognostic effect of pCR on EFS did not differ between subtypes and was an independent predictor for better EFS [hazard ratio (HR) = 0.40, P < 0.001 in favour of pCR], DMFS (HR = 0.32, P < 0.001) and OS (HR = 0.32, P < 0.001). Chemotherapy arm was an independent predictor only for EFS (HR = 0.73, P = 0.004 in favour of T-ET). The interaction between TP53, intrinsic subtypes and survival outcomes only approached statistical significance for EFS (P = 0.1). CONCLUSIONS: pCR is an independent predictor of favourable clinical outcomes in all molecular subtypes in a two-step multivariate analysis. CLINICALTRIALSGOV: EORTC 10994/BIG 1-00 Trial registration number NCT00017095.
Resumo:
Résumé Suite aux recentes avancées technologiques, les archives d'images digitales ont connu une croissance qualitative et quantitative sans précédent. Malgré les énormes possibilités qu'elles offrent, ces avancées posent de nouvelles questions quant au traitement des masses de données saisies. Cette question est à la base de cette Thèse: les problèmes de traitement d'information digitale à très haute résolution spatiale et/ou spectrale y sont considérés en recourant à des approches d'apprentissage statistique, les méthodes à noyau. Cette Thèse étudie des problèmes de classification d'images, c'est à dire de catégorisation de pixels en un nombre réduit de classes refletant les propriétés spectrales et contextuelles des objets qu'elles représentent. L'accent est mis sur l'efficience des algorithmes, ainsi que sur leur simplicité, de manière à augmenter leur potentiel d'implementation pour les utilisateurs. De plus, le défi de cette Thèse est de rester proche des problèmes concrets des utilisateurs d'images satellite sans pour autant perdre de vue l'intéret des méthodes proposées pour le milieu du machine learning dont elles sont issues. En ce sens, ce travail joue la carte de la transdisciplinarité en maintenant un lien fort entre les deux sciences dans tous les développements proposés. Quatre modèles sont proposés: le premier répond au problème de la haute dimensionalité et de la redondance des données par un modèle optimisant les performances en classification en s'adaptant aux particularités de l'image. Ceci est rendu possible par un système de ranking des variables (les bandes) qui est optimisé en même temps que le modèle de base: ce faisant, seules les variables importantes pour résoudre le problème sont utilisées par le classifieur. Le manque d'information étiquétée et l'incertitude quant à sa pertinence pour le problème sont à la source des deux modèles suivants, basés respectivement sur l'apprentissage actif et les méthodes semi-supervisées: le premier permet d'améliorer la qualité d'un ensemble d'entraînement par interaction directe entre l'utilisateur et la machine, alors que le deuxième utilise les pixels non étiquetés pour améliorer la description des données disponibles et la robustesse du modèle. Enfin, le dernier modèle proposé considère la question plus théorique de la structure entre les outputs: l'intègration de cette source d'information, jusqu'à présent jamais considérée en télédétection, ouvre des nouveaux défis de recherche. Advanced kernel methods for remote sensing image classification Devis Tuia Institut de Géomatique et d'Analyse du Risque September 2009 Abstract The technical developments in recent years have brought the quantity and quality of digital information to an unprecedented level, as enormous archives of satellite images are available to the users. However, even if these advances open more and more possibilities in the use of digital imagery, they also rise several problems of storage and treatment. The latter is considered in this Thesis: the processing of very high spatial and spectral resolution images is treated with approaches based on data-driven algorithms relying on kernel methods. In particular, the problem of image classification, i.e. the categorization of the image's pixels into a reduced number of classes reflecting spectral and contextual properties, is studied through the different models presented. The accent is put on algorithmic efficiency and the simplicity of the approaches proposed, to avoid too complex models that would not be used by users. The major challenge of the Thesis is to remain close to concrete remote sensing problems, without losing the methodological interest from the machine learning viewpoint: in this sense, this work aims at building a bridge between the machine learning and remote sensing communities and all the models proposed have been developed keeping in mind the need for such a synergy. Four models are proposed: first, an adaptive model learning the relevant image features has been proposed to solve the problem of high dimensionality and collinearity of the image features. This model provides automatically an accurate classifier and a ranking of the relevance of the single features. The scarcity and unreliability of labeled. information were the common root of the second and third models proposed: when confronted to such problems, the user can either construct the labeled set iteratively by direct interaction with the machine or use the unlabeled data to increase robustness and quality of the description of data. Both solutions have been explored resulting into two methodological contributions, based respectively on active learning and semisupervised learning. Finally, the more theoretical issue of structured outputs has been considered in the last model, which, by integrating outputs similarity into a model, opens new challenges and opportunities for remote sensing image processing.
Resumo:
BACKGROUND: To compare the prognostic relevance of Masaoka and Müller-Hermelink classifications. METHODS: We treated 71 patients with thymic tumors at our institution between 1980 and 1997. Complete follow-up was achieved in 69 patients (97%) with a mean follow up-time of 8.3 years (range, 9 months to 17 years). RESULTS: Masaoka stage I was found in 31 patients (44.9%), stage II in 17 (24.6%), stage III in 19 (27.6%), and stage IV in 2 (2.9%). The 10-year overall survival rate was 83.5% for stage I, 100% for stage IIa, 58% for stage IIb, 44% for stage III, and 0% for stage IV. The disease-free survival rates were 100%, 70%, 40%, 38%, and 0%, respectively. Histologic classification according to Müller-Hermelink found medullary tumors in 7 patients (10.1%), mixed in 18 (26.1%), organoid in 14 (20.3%), cortical in 11 (15.9%), well-differentiated thymic carcinoma in 14 (20.3%), and endocrine carcinoma in 5 (7.3%), with 10-year overall survival rates of 100%, 75%, 92%, 87.5%, 30%, and 0%, respectively, and 10-year disease-free survival rates of 100%, 100%, 77%, 75%, 37%, and 0%, respectively. Medullary, mixed, and well-differentiated organoid tumors were correlated with stage I and II, and well-differentiated thymic carcinoma and endocrine carcinoma with stage III and IV (p < 0.001). Multivariate analysis showed age, gender, myasthenia gravis, and postoperative adjuvant therapy not to be significant predictors of overall and disease-free survival after complete resection, whereas the Müller-Hermelink and Masaoka classifications were independent significant predictors for overall (p < 0.05) and disease-free survival (p < 0.004; p < 0.0001). CONCLUSIONS: The consideration of staging and histology in thymic tumors has the potential to improve recurrence prediction and patient selection for combined treatment modalities.
Resumo:
ABSTRACT Preservation of mangroves, a very significant ecosystem from a social, economic, and environmental viewpoint, requires knowledge on soil composition, genesis, morphology, and classification. These aspects are of paramount importance to understand the dynamics of sustainability and preservation of this natural resource. In this study mangrove soils in the Subaé river basin were described and classified and inorganic waste concentrations evaluated. Seven pedons of mangrove soil were chosen, five under fluvial influence and two under marine influence and analyzed for morphology. Samples of horizons and layers were collected for physical and chemical analyses, including heavy metals (Pb, Cd, Mn, Zn, and Fe). The moist soils were suboxidic, with Eh values below 350 mV. The pH level of the pedons under fluvial influence ranged from moderately acid to alkaline, while the pH in pedons under marine influence was around 7.0 throughout the profile. The concentration of cations in the sorting complex for all pedons, independent of fluvial or marine influence, indicated the following order: Na+>Mg2+>Ca2+>K+. Mangrove soils from the Subaé river basin under fluvial and marine influence had different morphological, physical, and chemical characteristics. The highest Pb and Cd concentrations were found in the pedons under fluvial influence, perhaps due to their closeness to the mining company Plumbum, while the concentrations in pedon P7 were lowest, due to greater distance from the factory. For containing at least one metal above the reference levels established by the National Oceanic and Atmospheric Administration (United States Environmental Protection Agency), the pedons were classified as potentially toxic. The soils were classified as Gleissolos Tiomórficos Órticos (sálicos) sódico neofluvissólico in according to the Brazilian Soil Classification System, indicating potential toxicity and very poor drainage, except for pedon P7, which was classified in the same subgroup as the others, but different in that the metal concentrations met acceptable standards.
Resumo:
The molecular mechanisms controlling the progression of melanoma from a localized tumor to an invasive and metastatic disease are poorly understood. In the attempt to start defining a functional protein profile of melanoma progression, we have analyzed by LC-MS/MS the proteins associated with detergent resistant membranes (DRMs), which are enriched in cholesterol/sphingolipids-containing membrane rafts, of melanoma cell lines derived from tumors at different stages of progression. Since membrane rafts are involved in several biological processes, including signal transduction and protein trafficking, we hypothesized that the association of proteins with rafts can be regulated during melanoma development and affect protein function and disease progression. We have identified a total of 177 proteins in the DRMs of the cell lines examined. Among these, we have found groups of proteins preferentially associated with DRMs of either less malignant radial growth phase/vertical growth phase (VGP) cells, or aggressive VGP and metastatic cells suggesting that melanoma cells with different degrees of malignancy have different DRM profiles. Moreover, some proteins were found in DRMs of only some cell lines despite being expressed at similar levels in all the cell lines examined, suggesting the existence of mechanisms controlling their association with DRMs. We expect that understanding the mechanisms regulating DRM targeting and the activity of the proteins differentially associated with DRMs in relation to cell malignancy will help identify new molecular determinants of melanoma progression.
Resumo:
PURPOSE: To objectively characterize different heart tissues from functional and viability images provided by composite-strain-encoding (C-SENC) MRI. MATERIALS AND METHODS: C-SENC is a new MRI technique for simultaneously acquiring cardiac functional and viability images. In this work, an unsupervised multi-stage fuzzy clustering method is proposed to identify different heart tissues in the C-SENC images. The method is based on sequential application of the fuzzy c-means (FCM) and iterative self-organizing data (ISODATA) clustering algorithms. The proposed method is tested on simulated heart images and on images from nine patients with and without myocardial infarction (MI). The resulting clustered images are compared with MRI delayed-enhancement (DE) viability images for determining MI. Also, Bland-Altman analysis is conducted between the two methods. RESULTS: Normal myocardium, infarcted myocardium, and blood are correctly identified using the proposed method. The clustered images correctly identified 90 +/- 4% of the pixels defined as infarct in the DE images. In addition, 89 +/- 5% of the pixels defined as infarct in the clustered images were also defined as infarct in DE images. The Bland-Altman results show no bias between the two methods in identifying MI. CONCLUSION: The proposed technique allows for objectively identifying divergent heart tissues, which would be potentially important for clinical decision-making in patients with MI.
Resumo:
OBJECTIVE: To develop and compare two new technologies for diagnosing a contiguous gene syndrome, the Williams-Beuren syndrome (WBS). METHODS: The first proposed method, named paralogous sequence quantification (PSQ), is based on the use of paralogous sequences located on different chromosomes and quantification of specific mismatches present at these loci using pyrosequencing technology. The second exploits quantitative real time polymerase chain reaction (QPCR) to assess the relative quantity of an analysed locus. RESULTS: A correct and unambiguous diagnosis was obtained for 100% of the analysed samples with either technique (n = 165 and n = 155, respectively). These methods allowed the identification of two patients with atypical deletions in a cohort of 182 WBS patients. Both patients presented with mild facial anomalies, mild mental retardation with impaired visuospatial cognition, supravalvar aortic stenosis, and normal growth indices. These observations are consistent with the involvement of GTF2IRD1 or GTF2I in some of the WBS facial features. CONCLUSIONS: Both PSQ and QPCR are robust, easy to interpret, and simple to set up. They represent a competitive alternative for the diagnosis of segmental aneuploidies in clinical laboratories. They have advantages over fluorescence in situ hybridisation or microsatellites/SNP genotyping for detecting short segmental aneuploidies as the former is costly and labour intensive while the latter depends on the informativeness of the polymorphisms.
Resumo:
The methylation status of the O(6)-methylguanine-DNA methyltransferase (MGMT) gene is an important predictive biomarker for benefit from alkylating agent therapy in glioblastoma. Recent studies in anaplastic glioma suggest a prognostic value for MGMT methylation. Investigation of pathogenetic and epigenetic features of this intriguingly distinct behavior requires accurate MGMT classification to assess high throughput molecular databases. Promoter methylation-mediated gene silencing is strongly dependent on the location of the methylated CpGs, complicating classification. Using the HumanMethylation450 (HM-450K) BeadChip interrogating 176 CpGs annotated for the MGMT gene, with 14 located in the promoter, two distinct regions in the CpG island of the promoter were identified with high importance for gene silencing and outcome prediction. A logistic regression model (MGMT-STP27) comprising probes cg1243587 and cg12981137 provided good classification properties and prognostic value (kappa = 0.85; log-rank p < 0.001) using a training-set of 63 glioblastomas from homogenously treated patients, for whom MGMT methylation was previously shown to be predictive for outcome based on classification by methylation-specific PCR. MGMT-STP27 was successfully validated in an independent cohort of chemo-radiotherapy-treated glioblastoma patients (n = 50; kappa = 0.88; outcome, log-rank p < 0.001). Lower prevalence of MGMT methylation among CpG island methylator phenotype (CIMP) positive tumors was found in glioblastomas from The Cancer Genome Atlas than in low grade and anaplastic glioma cohorts, while in CIMP-negative gliomas MGMT was classified as methylated in approximately 50 % regardless of tumor grade. The proposed MGMT-STP27 prediction model allows mining of datasets derived on the HM-450K or HM-27K BeadChip to explore effects of distinct epigenetic context of MGMT methylation suspected to modulate treatment resistance in different tumor types.
Resumo:
Ants provide remarkable examples of equivalent genotypes developing into divergent and discrete phenotypes. Diploid eggs can develop either into queens, which specialize in reproduction, or workers, which participate in cooperative tasks such as building the nest, collecting food, and rearing the young. In contrast, the differentiation between males and females generally depends upon whether eggs are fertilized, with fertilized (diploid) eggs giving rise to females and unfertilized (haploid) eggs giving rise to males. To obtain a comprehensive picture of the relative contributions of gender (sex), caste, developmental stage, and species divergence to gene expression evolution, we investigated gene expression patterns in pupal and adult queens, workers, and males of two species of fire ants, Solenopsis invicta and S. richteri. Microarray hybridizations revealed that variation in gene expression profiles is influenced more by developmental stage than by caste membership, sex, or species identity. The second major contributor to variation in gene expression was the combination of sex and caste. Although workers and queens share equivalent diploid nuclear genomes, they have highly distinctive patterns of gene expression in both the pupal and the adult stages, as might be expected given their extraordinary level of phenotypic differentiation. Overall, the difference in the proportion of differentially expressed genes was greater between workers and males than between workers and queens or queens and males, consistent with the fact that workers and males share neither gender nor reproductive capability. Moreover, between-species comparisons revealed that the greatest difference in gene expression patterns occurred in adult workers, a finding consistent with the fact that adult workers most directly experience the distinct external environments characterizing the different habitats occupied by the two species. Thus, much of the evolution of gene expression in ants may occur in the worker caste, despite the fact that these individuals are largely or completely sterile. Analyses of gene expression evolution revealed a combination of positive selection and relaxation of stabilizing selection as important factors driving the evolution of such genes.
Resumo:
Stream channel erosion in the deep loess soils region of western Iowa causes severe damage along hundreds of miles of streams in twenty-two counties. The goal of this project was to develop information, systems, and procedures for use in making resource allocation decisions related to the protection of transportation facilities and farmland from damages caused by stream channel erosion. Section one of this report provides an introduction. Section two presents an assessment of stream channel conditions from aerial and field reconnaissance conducted in 1993 and 1994 and a classification of the streams based on a six stage model of stream channel evolution. A Geographic Information System is discussed that has been developed to store and analyze data on the stream conditions and affected infrastructure and assist in the planning of stabilization measures. Section three presents an evaluation of two methods for predicting the extent of channel degradation. Section four presents an estimate of costs associated with damages from stream channel erosion since the time of channelization until 1992. Damage to highway bridges represent the highest costs associated with channel erosion, followed by railroad bridges and right-of-way; loss of agricultural land represents the third highest cost. An estimate of costs associated with future channel erosion on western Iowa streams is also presented in section four. Section four also presents a procedure to estimate the benefits and costs of implementing stream stabilization measures. The final section of this report, section five, presents information on the development of the organizational structure and administrative procedures which are being used to plan, coordinate, and implement stream stabilization projects and programs in western Iowa.