923 resultados para classification aided by clustering
Resumo:
PURPOSE: From February 2001 to February 2002, 946 patients with advanced GI stromal tumors (GISTs) treated with imatinib were included in a controlled EORTC/ISG/AGITG (European Organisation for Research and Treatment of Cancer/Italian Sarcoma Group/Australasian Gastro-Intestinal Trials Group) trial. This analysis investigates whether the response classification assessed by RECIST (Response Evaluation Criteria in Solid Tumors), predicts for time to progression (TTP) and overall survival (OS). PATIENTS AND METHODS: Per protocol, the first three disease assessments were done at 2, 4, and 6 months. For the purpose of the analysis (landmark method), disease response was subclassified in six categories: partial response (PR; > 30% size reduction), minor response (MR; 10% to 30% reduction), no change (NC) as either NC- (0% to 10% reduction) or NC+ (0% to 20% size increase), progressive disease (PD; > 20% increase/new lesions), and subjective PD (clinical progression). RESULTS: A total of 906 patients had measurable disease at entry. At all measurement time points, complete response (CR), PR, and MR resulted in similar TTP and OS; this was also true for NC- and NC+, and for PD and subjective PD. Patients were subsequently classified as responders (CR/PR/MR), NC (NC+/NC-), or PD. This three-class response categorization was found to be highly predictive of further progression or survival for the first two measurement points. After 6 months of imatinib, responders (CR/PR/MR) had the same survival prognosis as patients classified as NC. CONCLUSION: RECIST perfectly enables early discrimination between patients who benefited long term from imatinib and those who did not. After 6 months of imatinib, if the patient is not experiencing PD, the pattern of radiologic response by tumor size criteria has no prognostic value for further outcome. Imatinib needs to be continued as long as there is no progression according to RECIST.
Resumo:
Predictive groundwater modeling requires accurate information about aquifer characteristics. Geophysical imaging is a powerful tool for delineating aquifer properties at an appropriate scale and resolution, but it suffers from problems of ambiguity. One way to overcome such limitations is to adopt a simultaneous multitechnique inversion strategy. We have developed a methodology for aquifer characterization based on structural joint inversion of multiple geophysical data sets followed by clustering to form zones and subsequent inversion for zonal parameters. Joint inversions based on cross-gradient structural constraints require less restrictive assumptions than, say, applying predefined petro-physical relationships and generally yield superior results. This approach has, for the first time, been applied to three geophysical data types in three dimensions. A classification scheme using maximum likelihood estimation is used to determine the parameters of a Gaussian mixture model that defines zonal geometries from joint-inversion tomograms. The resulting zones are used to estimate representative geophysical parameters of each zone, which are then used for field-scale petrophysical analysis. A synthetic study demonstrated how joint inversion of seismic and radar traveltimes and electrical resistance tomography (ERT) data greatly reduces misclassification of zones (down from 21.3% to 3.7%) and improves the accuracy of retrieved zonal parameters (from 1.8% to 0.3%) compared to individual inversions. We applied our scheme to a data set collected in northeastern Switzerland to delineate lithologic subunits within a gravel aquifer. The inversion models resolve three principal subhorizontal units along with some important 3D heterogeneity. Petro-physical analysis of the zonal parameters indicated approximately 30% variation in porosity within the gravel aquifer and an increasing fraction of finer sediments with depth.
Resumo:
The paper presents a novel method for monitoring network optimisation, based on a recent machine learning technique known as support vector machine. It is problem-oriented in the sense that it directly answers the question of whether the advised spatial location is important for the classification model. The method can be used to increase the accuracy of classification models by taking a small number of additional measurements. Traditionally, network optimisation is performed by means of the analysis of the kriging variances. The comparison of the method with the traditional approach is presented on a real case study with climate data.
Resumo:
The so-called "enchondromatoses" are skeletal disorders defined by the presence of ectopic cartilaginous tissue within bone tissue. The clinical and radiographic features of the different enchondromatoses are distinct, and grouping them does not reflect a common pathogenesis but simply a similar radiographic appearance and thus the need for a differential diagnosis. Recent advances in the understanding of their molecular and cellular bases confirm the heterogeneous nature of the different enchondromatoses. Some, like Ollier disease, Maffucci disease, metaphyseal chondromatosis with hydroxyglutaric aciduria, and metachondromatosis are produced by a dysregulation of chondrocyte proliferation, while others (such as spondyloenchondrodysplasia or dysspondyloenchondromatosis) are caused by defects in structure or metabolism of cartilage or bone matrix. In other forms (e.g., the dominantly inherited genochondromatoses), the basic defect remains to be determined. The classification, proposed by Spranger and associates in 1978 and tentatively revised twice, was based on the radiographic appearance, the anatomic sites involved, and the mode of inheritance. The new classification proposed here integrates the molecular genetic advances and delineates phenotypic families based on the molecular defects. Reference radiographs are provided to help in the diagnosis of the well-defined forms. In spite of advances, many cases remain difficult to diagnose and classify, implying that more variants remain to be defined at both the clinical and molecular levels. © 2012 Wiley Periodicals, Inc.
Resumo:
Ferrofluids belonging to the series NixFe1 xFe2O4 were synthesised by two different procedures—one by standard co-precipitation techniques, the other by co-precipitation for synthesis of particles and dispersion aided by high-energy ball milling with a view to understand the effect of strain and size anisotropy on the magneto-optical properties of ferrofluids. The birefringence measurements were carried out using a standard ellipsometer. The birefringence signal obtained for chemically synthesised samples was satisfactorily fitted to the standard second Langevin function. The ball-milled ferrofluids showed a deviation and their birefringence was enhanced by an order. This large enhancement in the birefringence value cannot be attributed to the increase in grain size of the samples, considering that the grain sizes of sample synthesised by both modes are comparable; instead, it can be attributed to the lattice strain-induced shape anisotropy(oblation) arising from the high-energy ball-milling process. Thus magnetic-optical (MO) signals can be tuned by ball-milling process, which can find potential applications.
Resumo:
Superparamagnetic nanocomposites based on Y-Fe2O3 and sulphonated polystyrene were synthesised by ion-exchange process and the structural characterisation has been carried out using X-ray diffraction technique. Doping of cobalt in to the Y-Fe2O3 lattice was effected in situ and the doping was varied in the atomic percentage range 1–10. The optical absorption studies show a band gap of 2.84 eV, which is blue shifted by 0.64 eV when compared to the reported values for the bulk samples (2.2 eV). This is explained on the basis of weak quantum confinement. Further size reduction can result in a strong confinement, which can yield transparent magnetic nanocomposites because of further blue shifting. The band gap gets red shifted further with the addition of cobalt in the lattice and this red shift increases with the increase in doping. The observed red shift can be attributed to the strain in the lattice caused by the anisotropy induced by the addition of cobalt. Thus, tuning of bandgap and blue shifting is aided by weak exciton confinement and further red shifting of the bandgap is assisted by cobalt doping.
Resumo:
In recent years there is an apparent shift in research from content based image retrieval (CBIR) to automatic image annotation in order to bridge the gap between low level features and high level semantics of images. Automatic Image Annotation (AIA) techniques facilitate extraction of high level semantic concepts from images by machine learning techniques. Many AIA techniques use feature analysis as the first step to identify the objects in the image. However, the high dimensional image features make the performance of the system worse. This paper describes and evaluates an automatic image annotation framework which uses SURF descriptors to select right number of features and right features for annotation. The proposed framework uses a hybrid approach in which k-means clustering is used in the training phase and fuzzy K-NN classification in the annotation phase. The performance of the system is evaluated using standard metrics.
Resumo:
Ferrofluids belonging to the series NixFe1 xFe2O4 were synthesised by two different procedures—one by standard co-precipitation techniques, the other by co-precipitation for synthesis of particles and dispersion aided by high-energy ball milling with a view to understand the effect of strain and size anisotropy on the magneto-optical properties of ferrofluids. The birefringence measurements were carried out using a standard ellipsometer. The birefringence signal obtained for chemically synthesised samples was satisfactorily fitted to the standard second Langevin function. The ball-milled ferrofluids showed a deviation and their birefringence was enhanced by an order. This large enhancement in the birefringence value cannot be attributed to the increase in grain size of the samples, considering that the grain sizes of sample synthesised by both modes are comparable; instead, it can be attributed to the lattice strain-induced shape anisotropy(oblation) arising from the high-energy ball-milling process. Thus magnetic-optical (MO) signals can be tuned by ball-milling process, which can find potential applications
Resumo:
Knowledge discovery in databases is the non-trivial process of identifying valid, novel potentially useful and ultimately understandable patterns from data. The term Data mining refers to the process which does the exploratory analysis on the data and builds some model on the data. To infer patterns from data, data mining involves different approaches like association rule mining, classification techniques or clustering techniques. Among the many data mining techniques, clustering plays a major role, since it helps to group the related data for assessing properties and drawing conclusions. Most of the clustering algorithms act on a dataset with uniform format, since the similarity or dissimilarity between the data points is a significant factor in finding out the clusters. If a dataset consists of mixed attributes, i.e. a combination of numerical and categorical variables, a preferred approach is to convert different formats into a uniform format. The research study explores the various techniques to convert the mixed data sets to a numerical equivalent, so as to make it equipped for applying the statistical and similar algorithms. The results of clustering mixed category data after conversion to numeric data type have been demonstrated using a crime data set. The thesis also proposes an extension to the well known algorithm for handling mixed data types, to deal with data sets having only categorical data. The proposed conversion has been validated on a data set corresponding to breast cancer. Moreover, another issue with the clustering process is the visualization of output. Different geometric techniques like scatter plot, or projection plots are available, but none of the techniques display the result projecting the whole database but rather demonstrate attribute-pair wise analysis
Resumo:
Short video on laser classification produced by the National Physical Laboratory
Resumo:
L'increment de bases de dades que cada vegada contenen imatges més difícils i amb un nombre més elevat de categories, està forçant el desenvolupament de tècniques de representació d'imatges que siguin discriminatives quan es vol treballar amb múltiples classes i d'algorismes que siguin eficients en l'aprenentatge i classificació. Aquesta tesi explora el problema de classificar les imatges segons l'objecte que contenen quan es disposa d'un gran nombre de categories. Primerament s'investiga com un sistema híbrid format per un model generatiu i un model discriminatiu pot beneficiar la tasca de classificació d'imatges on el nivell d'anotació humà sigui mínim. Per aquesta tasca introduïm un nou vocabulari utilitzant una representació densa de descriptors color-SIFT, i desprès s'investiga com els diferents paràmetres afecten la classificació final. Tot seguit es proposa un mètode par tal d'incorporar informació espacial amb el sistema híbrid, mostrant que la informació de context es de gran ajuda per la classificació d'imatges. Desprès introduïm un nou descriptor de forma que representa la imatge segons la seva forma local i la seva forma espacial, tot junt amb un kernel que incorpora aquesta informació espacial en forma piramidal. La forma es representada per un vector compacte obtenint un descriptor molt adequat per ésser utilitzat amb algorismes d'aprenentatge amb kernels. Els experiments realitzats postren que aquesta informació de forma te uns resultats semblants (i a vegades millors) als descriptors basats en aparença. També s'investiga com diferents característiques es poden combinar per ésser utilitzades en la classificació d'imatges i es mostra com el descriptor de forma proposat juntament amb un descriptor d'aparença millora substancialment la classificació. Finalment es descriu un algoritme que detecta les regions d'interès automàticament durant l'entrenament i la classificació. Això proporciona un mètode per inhibir el fons de la imatge i afegeix invariança a la posició dels objectes dins les imatges. S'ensenya que la forma i l'aparença sobre aquesta regió d'interès i utilitzant els classificadors random forests millora la classificació i el temps computacional. Es comparen els postres resultats amb resultats de la literatura utilitzant les mateixes bases de dades que els autors Aixa com els mateixos protocols d'aprenentatge i classificació. Es veu com totes les innovacions introduïdes incrementen la classificació final de les imatges.
Resumo:
This contribution proposes a powerful technique for two-class imbalanced classification problems by combining the synthetic minority over-sampling technique (SMOTE) and the particle swarm optimisation (PSO) aided radial basis function (RBF) classifier. In order to enhance the significance of the small and specific region belonging to the positive class in the decision region, the SMOTE is applied to generate synthetic instances for the positive class to balance the training data set. Based on the over-sampled training data, the RBF classifier is constructed by applying the orthogonal forward selection procedure, in which the classifier's structure and the parameters of RBF kernels are determined using a PSO algorithm based on the criterion of minimising the leave-one-out misclassification rate. The experimental results obtained on a simulated imbalanced data set and three real imbalanced data sets are presented to demonstrate the effectiveness of our proposed algorithm.
Resumo:
Airborne lidar provides accurate height information of objects on the earth and has been recognized as a reliable and accurate surveying tool in many applications. In particular, lidar data offer vital and significant features for urban land-cover classification, which is an important task in urban land-use studies. In this article, we present an effective approach in which lidar data fused with its co-registered images (i.e. aerial colour images containing red, green and blue (RGB) bands and near-infrared (NIR) images) and other derived features are used effectively for accurate urban land-cover classification. The proposed approach begins with an initial classification performed by the Dempster–Shafer theory of evidence with a specifically designed basic probability assignment function. It outputs two results, i.e. the initial classification and pseudo-training samples, which are selected automatically according to the combined probability masses. Second, a support vector machine (SVM)-based probability estimator is adopted to compute the class conditional probability (CCP) for each pixel from the pseudo-training samples. Finally, a Markov random field (MRF) model is established to combine spatial contextual information into the classification. In this stage, the initial classification result and the CCP are exploited. An efficient belief propagation (EBP) algorithm is developed to search for the global minimum-energy solution for the maximum a posteriori (MAP)-MRF framework in which three techniques are developed to speed up the standard belief propagation (BP) algorithm. Lidar and its co-registered data acquired by Toposys Falcon II are used in performance tests. The experimental results prove that fusing the height data and optical images is particularly suited for urban land-cover classification. There is no training sample needed in the proposed approach, and the computational cost is relatively low. An average classification accuracy of 93.63% is achieved.
Resumo:
Studying joint noise is an important parameter for diagnosing temporomandibular dysfunction. In this study, eight groups (n=9) were formed according to joint dysfunction classification, provided by employing vibration analysis equipment. Parameters for analyzing joint noise were: total vibration energy, peak amplitude, and peak frequency. Mouth opening range was also analyzed. Statistical analysis results for each parameter were significant at 1 %. Each analyzed group presented different noise characteristics. This allowed for inclusion of the groups within a determined value category. The patient group with normal condyle/disk relationship always presented the lowest values. The type of joint noise was characterized by analyzing total integral noise, peak amplitude, peak frequency, and mouth opening. Analyzing joint noise using electrovibratography suggests the type of joint dysfunction and may help to establish a diagnosis, as well as a treatment plan.
Resumo:
Data mining is a relatively new field of research that its objective is to acquire knowledge from large amounts of data. In medical and health care areas, due to regulations and due to the availability of computers, a large amount of data is becoming available [27]. On the one hand, practitioners are expected to use all this data in their work but, at the same time, such a large amount of data cannot be processed by humans in a short time to make diagnosis, prognosis and treatment schedules. A major objective of this thesis is to evaluate data mining tools in medical and health care applications to develop a tool that can help make rather accurate decisions. In this thesis, the goal is finding a pattern among patients who got pneumonia by clustering of lab data values which have been recorded every day. By this pattern we can generalize it to the patients who did not have been diagnosed by this disease whose lab values shows the same trend as pneumonia patients does. There are 10 tables which have been extracted from a big data base of a hospital in Jena for my work .In ICU (intensive care unit), COPRA system which is a patient management system has been used. All the tables and data stored in German Language database.