791 resultados para Rule-Based Classification
Resumo:
In this paper, we propose two active learning algorithms for semiautomatic definition of training samples in remote sensing image classification. Based on predefined heuristics, the classifier ranks the unlabeled pixels and automatically chooses those that are considered the most valuable for its improvement. Once the pixels have been selected, the analyst labels them manually and the process is iterated. Starting with a small and nonoptimal training set, the model itself builds the optimal set of samples which minimizes the classification error. We have applied the proposed algorithms to a variety of remote sensing data, including very high resolution and hyperspectral images, using support vector machines. Experimental results confirm the consistency of the methods. The required number of training samples can be reduced to 10% using the methods proposed, reaching the same level of accuracy as larger data sets. A comparison with a state-of-the-art active learning method, margin sampling, is provided, highlighting advantages of the methods proposed. The effect of spatial resolution and separability of the classes on the quality of the selection of pixels is also discussed.
Resumo:
Several features that can be extracted from digital images of the sky and that can be useful for cloud-type classification of such images are presented. Some features are statistical measurements of image texture, some are based on the Fourier transform of the image and, finally, others are computed from the image where cloudy pixels are distinguished from clear-sky pixels. The use of the most suitable features in an automatic classification algorithm is also shown and discussed. Both the features and the classifier are developed over images taken by two different camera devices, namely, a total sky imager (TSI) and a whole sky imager (WSC), which are placed in two different areas of the world (Toowoomba, Australia; and Girona, Spain, respectively). The performance of the classifier is assessed by comparing its image classification with an a priori classification carried out by visual inspection of more than 200 images from each camera. The index of agreement is 76% when five different sky conditions are considered: clear, low cumuliform clouds, stratiform clouds (overcast), cirriform clouds, and mottled clouds (altocumulus, cirrocumulus). Discussion on the future directions of this research is also presented, regarding both the use of other features and the use of other classification techniques
Resumo:
BACKGROUND: The nuclear receptors are a large family of eukaryotic transcription factors that constitute major pharmacological targets. They exert their combinatorial control through homotypic heterodimerisation. Elucidation of this dimerisation network is vital in order to understand the complex dynamics and potential cross-talk involved. RESULTS: Phylogeny, protein-protein interactions, protein-DNA interactions and gene expression data have been integrated to provide a comprehensive and up-to-date description of the topology and properties of the nuclear receptor interaction network in humans. We discriminate between DNA-binding and non-DNA-binding dimers, and provide a comprehensive interaction map, that identifies potential cross-talk between the various pathways of nuclear receptors. CONCLUSION: We infer that the topology of this network is hub-based, and much more connected than previously thought. The hub-based topology of the network and the wide tissue expression pattern of NRs create a highly competitive environment for the common heterodimerising partners. Furthermore, a significant number of negative feedback loops is present, with the hub protein SHP [NR0B2] playing a major role. We also compare the evolution, topology and properties of the nuclear receptor network with the hub-based dimerisation network of the bHLH transcription factors in order to identify both unique themes and ubiquitous properties in gene regulation. In terms of methodology, we conclude that such a comprehensive picture can only be assembled by semi-automated text-mining, manual curation and integration of data from various sources.
Resumo:
Introduction : L'équipe mobile de soins palliatifs intra hospitalière (EMSP) du Centre Hospitalier Universitaire Vaudois (CHUV) a été mise en place en 1996. Il s'agit d'une des premières équipes interdisciplinaire de consultants mise à disposition d'un hôpital tertiaire. Le CHUV est l'hôpital de proximité de la ville de Lausanne (850 lits) mais aussi un hôpital de référence pour le reste du canton. En 2007, il y a eu 38'359 patients hospitalisés au CHUV. Les facteurs d'évaluation du taux d'utilisation d'une équipe mobile de soins palliatifs consultantes sont variés et complexes. Plusieurs méthodes sont décrites dans la littérature pour tenter de répondre à cette problématique. Avant de pouvoir évaluer l'utilisation de notre équipe mobile consultante de soins palliatifs intra hospitalière, il nous est apparu nécessaire de mieux décrire et définir la population qui meurt dans notre institution. McNamara et collègues ont proposé des critères qui classifient une population palliative comme « minimale », « intermédiaire » ou « maximale ». L'objectif de cette étude est de déterminer le taux de patients décédés au CHUV sur une période de 4 mois (Γ1 février au 31 mai 2007) suivie par notre EMSP en utilisant la méthode de classification «minimal » et « maximal ». Méthode : les archives médicales du CHUV ont été analysées pour chaque patient adulte décédé pendant la période sélectionnée. Les populations « maximal » et « minimal » de ces patients ont été ensuite déterminées selon des critères basés sur les codes diagnostiques figurants sur les certificats de décès. De ces deux populations, nous avons identifié à partir de notre base de données, les patients qui ont été suivie par notre EMSP. Le CHUV utilise les mêmes codes diagnostiques (International Classification of Disease, ICD) que ceux utilisés dans la classification de McNamara. Une recherche pilote effectuée dans les archives médicales du CHUV manuellement en analysant en profondeur l'ensemble du dossier médical a révélé que la classification de la population « minimal » pouvait être biaisée notamment en raison d'une confusion entre la cause directe du décès (complication d'une maladie) et la maladie de base. Nous avons estimé le pourcentage d'erreur de codification en analysé un échantillon randomisé de patients qui remplissait les critères « minimal ». Résultats : sur un total de 294 décès, 263 (89%) remplissaient initialement les critères « maximal » et 83 (28%) les critères «minimal», l'analyse de l'échantillon randomisé de 56 dossiers de patients sur les 180 qui ne remplissaient pas les critères « minimal » ont révélé que 21 (38%) auraient dus être inclus dans la population « minimal ». L'EMSP a vu 67/263 (25.5%) de la population palliative « maximal » et 56/151 (37.1%) de la population palliative « minimal ». Conclusion : cette étude souligne l'utilité de la méthode proposée par McNamara pour déterminer la population de patients palliatifs. Cependant, notre travail illustre aussi une limite importante de l'estimation de la population « minima » en lien avec l'imprécision des causes de décès figurant sur les certificats de décès de notre institution. Nos résultats mettent aussi en lumière que l'EMSP de notre institution est clairement sous- utilisée. Nous prévoyons une étude prospective de plus large envergure utilisant la même méthodologie afin d'approfondir les résultats de cette étude pilote.
Resumo:
ABSTRACT: BACKGROUND: Serologic testing algorithms for recent HIV seroconversion (STARHS) provide important information for HIV surveillance. We have shown that a patient's antibody reaction in a confirmatory line immunoassay (INNO-LIATM HIV I/II Score, Innogenetics) provides information on the duration of infection. Here, we sought to further investigate the diagnostic specificity of various Inno-Lia algorithms and to identify factors affecting it. METHODS: Plasma samples of 714 selected patients of the Swiss HIV Cohort Study infected for longer than 12 months and representing all viral clades and stages of chronic HIV-1 infection were tested blindly by Inno-Lia and classified as either incident (up to 12 m) or older infection by 24 different algorithms. Of the total, 524 patients received HAART, 308 had HIV-1 RNA below 50 copies/mL, and 620 were infected by a HIV-1 non-B clade. Using logistic regression analysis we evaluated factors that might affect the specificity of these algorithms. RESULTS: HIV-1 RNA <50 copies/mL was associated with significantly lower reactivity to all five HIV-1 antigens of the Inno-Lia and impaired specificity of most algorithms. Among 412 patients either untreated or with HIV-1 RNA ≥50 copies/mL despite HAART, the median specificity of the algorithms was 96.5% (range 92.0-100%). The only factor that significantly promoted false-incident results in this group was age, with false-incident results increasing by a few percent per additional year. HIV-1 clade, HIV-1 RNA, CD4 percentage, sex, disease stage, and testing modalities exhibited no significance. Results were similar among 190 untreated patients. CONCLUSIONS: The specificity of most Inno-Lia algorithms was high and not affected by HIV-1 variability, advanced disease and other factors promoting false-recent results in other STARHS. Specificity should be good in any group of untreated HIV-1 patients.
Resumo:
CONTEXT AND OBJECTIVES: A multicentric study was set up to assess the feasibility for Swiss cancer registries of actively retrieving 3 additional variables of epidemiological and a etiological relevance for melanoma, and of potential use for the evaluation of prevention campaigns. MATERIAL AND METHODS: The skin type, family history of melanoma and precise anatomical site were retrieved for melanoma cases registered in 5 Swiss cantons (Neuchâtel, St-Gall and Appenzell, Vaud and Wallis) over 3 to 6 consecutive years (1995-2002). Data were obtained via a short questionnaire administered by the physicians - mostly dermatologists - who originally excised the lesions. As the detailed body site was routinely collected in Ticino, data from this Cancer Registry were included in the body site analysis. Relative melanoma density (RMD) was computed by the ratio of observed to expected numbers of melanomas allowing for body site surface areas, and further adjusted for site-specific melanocyte density. RESULTS: Of the 1,645 questionnaires sent, 1,420 (86.3%) were returned. The detailed cutaneous site and skin type were reliably obtained for 84.7% and 78.7% of questionnaires, and family history was known in 76% of instances. Prevalence of sun-sensitive subjects and patients with melanoma affected first-degree relatives, two target groups for early detection and surveillance campaigns were 54.1% and 3.4%, respectively. After translation into the 4th digit of the International Classification of Diseases for Oncology, the anatomical site codes from printed (original information) and pictorial support (body chart from the questionnaire) concurred for 94.6% of lesions. Discrepancies occurred mostly for lesions on the upper, outer part of the shoulder for which the clinician's textual description was "shoulder blade". This differential misclassification suggests under-estimation by about 10% of melanomas of the upper limbs and an over-estimation of 5% for truncal melanomas. Sites of highest melanoma risk were the face, the shoulder and the upper arm for sexes, the back for men and the leg for women. Three major features of this series were: (1) an unexpectedly high RMD for the face in women (6.2 vs 4.2 in men), (2) the absence of a male predominance for melanomas on the ears, and (3) for the upper limbs, a steady gradient of increasing melanoma density with increasing proximity to the trunk, regardless of sex. DISCUSSION AND CONCLUSION: The feasibility of retrieving the skin type, the precise anatomical location and family history of melanoma in a reliable manner was demonstrated thanks to the collaboration of Swiss dermatologists. Use of a schematic body drawing improves the quality of the anatomical site data and facilitate the reporting task of doctors. Age and sex patterns of RMD paralleled general indicators of sun exposure and behaviour, except for the hand (RMD=0.2). These Swiss results support some site or sun exposure specificity in the aetiology of melanoma.
Resumo:
BACKGROUND: Several studies have established Glioblastoma Multiforme (GBM) prognostic and predictive models based on age and Karnofsky Performance Status (KPS), while very few studies evaluated the prognostic and predictive significance of preoperative MR-imaging. However, to date, there is no simple preoperative GBM classification that also correlates with a highly prognostic genomic signature. Thus, we present for the first time a biologically relevant, and clinically applicable tumor Volume, patient Age, and KPS (VAK) GBM classification that can easily and non-invasively be determined upon patient admission. METHODS: We quantitatively analyzed the volumes of 78 GBM patient MRIs present in The Cancer Imaging Archive (TCIA) corresponding to patients in The Cancer Genome Atlas (TCGA) with VAK annotation. The variables were then combined using a simple 3-point scoring system to form the VAK classification. A validation set (N = 64) from both the TCGA and Rembrandt databases was used to confirm the classification. Transcription factor and genomic correlations were performed using the gene pattern suite and Ingenuity Pathway Analysis. RESULTS: VAK-A and VAK-B classes showed significant median survival differences in discovery (P = 0.007) and validation sets (P = 0.008). VAK-A is significantly associated with P53 activation, while VAK-B shows significant P53 inhibition. Furthermore, a molecular gene signature comprised of a total of 25 genes and microRNAs was significantly associated with the classes and predicted survival in an independent validation set (P = 0.001). A favorable MGMT promoter methylation status resulted in a 10.5 months additional survival benefit for VAK-A compared to VAK-B patients. CONCLUSIONS: The non-invasively determined VAK classification with its implication of VAK-specific molecular regulatory networks, can serve as a very robust initial prognostic tool, clinical trial selection criteria, and important step toward the refinement of genomics-based personalized therapy for GBM patients.
Resumo:
The paper deals with the development and application of the generic methodology for automatic processing (mapping and classification) of environmental data. General Regression Neural Network (GRNN) is considered in detail and is proposed as an efficient tool to solve the problem of spatial data mapping (regression). The Probabilistic Neural Network (PNN) is considered as an automatic tool for spatial classifications. The automatic tuning of isotropic and anisotropic GRNN/PNN models using cross-validation procedure is presented. Results are compared with the k-Nearest-Neighbours (k-NN) interpolation algorithm using independent validation data set. Real case studies are based on decision-oriented mapping and classification of radioactively contaminated territories.
Resumo:
Colorectal cancer (CRC) is a major cause of cancer mortality. Whereas some patients respond well to therapy, others do not, and thus more precise, individualized treatment strategies are needed. To that end, we analyzed gene expression profiles from 1,290 CRC tumors using consensus-based unsupervised clustering. The resultant clusters were then associated with therapeutic response data to the epidermal growth factor receptor-targeted drug cetuximab in 80 patients. The results of these studies define six clinically relevant CRC subtypes. Each subtype shares similarities to distinct cell types within the normal colon crypt and shows differing degrees of 'stemness' and Wnt signaling. Subtype-specific gene signatures are proposed to identify these subtypes. Three subtypes have markedly better disease-free survival (DFS) after surgical resection, suggesting these patients might be spared from the adverse effects of chemotherapy when they have localized disease. One of these three subtypes, identified by filamin A expression, does not respond to cetuximab but may respond to cMET receptor tyrosine kinase inhibitors in the metastatic setting. Two other subtypes, with poor and intermediate DFS, associate with improved response to the chemotherapy regimen FOLFIRI in adjuvant or metastatic settings. Development of clinically deployable assays for these subtypes and of subtype-specific therapies may contribute to more effective management of this challenging disease.
Resumo:
Soil surveys are the main source of spatial information on soils and have a range of different applications, mainly in agriculture. The continuity of this activity has however been severely compromised, mainly due to a lack of governmental funding. The purpose of this study was to evaluate the feasibility of two different classifiers (artificial neural networks and a maximum likelihood algorithm) in the prediction of soil classes in the northwest of the state of Rio de Janeiro. Terrain attributes such as elevation, slope, aspect, plan curvature and compound topographic index (CTI) and indices of clay minerals, iron oxide and Normalized Difference Vegetation Index (NDVI), derived from Landsat 7 ETM+ sensor imagery, were used as discriminating variables. The two classifiers were trained and validated for each soil class using 300 and 150 samples respectively, representing the characteristics of these classes in terms of the discriminating variables. According to the statistical tests, the accuracy of the classifier based on artificial neural networks (ANNs) was greater than of the classic Maximum Likelihood Classifier (MLC). Comparing the results with 126 points of reference showed that the resulting ANN map (73.81 %) was superior to the MLC map (57.94 %). The main errors when using the two classifiers were caused by: a) the geological heterogeneity of the area coupled with problems related to the geological map; b) the depth of lithic contact and/or rock exposure, and c) problems with the environmental correlation model used due to the polygenetic nature of the soils. This study confirms that the use of terrain attributes together with remote sensing data by an ANN approach can be a tool to facilitate soil mapping in Brazil, primarily due to the availability of low-cost remote sensing data and the ease by which terrain attributes can be obtained.
Resumo:
Résumé Suite aux recentes avancées technologiques, les archives d'images digitales ont connu une croissance qualitative et quantitative sans précédent. Malgré les énormes possibilités qu'elles offrent, ces avancées posent de nouvelles questions quant au traitement des masses de données saisies. Cette question est à la base de cette Thèse: les problèmes de traitement d'information digitale à très haute résolution spatiale et/ou spectrale y sont considérés en recourant à des approches d'apprentissage statistique, les méthodes à noyau. Cette Thèse étudie des problèmes de classification d'images, c'est à dire de catégorisation de pixels en un nombre réduit de classes refletant les propriétés spectrales et contextuelles des objets qu'elles représentent. L'accent est mis sur l'efficience des algorithmes, ainsi que sur leur simplicité, de manière à augmenter leur potentiel d'implementation pour les utilisateurs. De plus, le défi de cette Thèse est de rester proche des problèmes concrets des utilisateurs d'images satellite sans pour autant perdre de vue l'intéret des méthodes proposées pour le milieu du machine learning dont elles sont issues. En ce sens, ce travail joue la carte de la transdisciplinarité en maintenant un lien fort entre les deux sciences dans tous les développements proposés. Quatre modèles sont proposés: le premier répond au problème de la haute dimensionalité et de la redondance des données par un modèle optimisant les performances en classification en s'adaptant aux particularités de l'image. Ceci est rendu possible par un système de ranking des variables (les bandes) qui est optimisé en même temps que le modèle de base: ce faisant, seules les variables importantes pour résoudre le problème sont utilisées par le classifieur. Le manque d'information étiquétée et l'incertitude quant à sa pertinence pour le problème sont à la source des deux modèles suivants, basés respectivement sur l'apprentissage actif et les méthodes semi-supervisées: le premier permet d'améliorer la qualité d'un ensemble d'entraînement par interaction directe entre l'utilisateur et la machine, alors que le deuxième utilise les pixels non étiquetés pour améliorer la description des données disponibles et la robustesse du modèle. Enfin, le dernier modèle proposé considère la question plus théorique de la structure entre les outputs: l'intègration de cette source d'information, jusqu'à présent jamais considérée en télédétection, ouvre des nouveaux défis de recherche. Advanced kernel methods for remote sensing image classification Devis Tuia Institut de Géomatique et d'Analyse du Risque September 2009 Abstract The technical developments in recent years have brought the quantity and quality of digital information to an unprecedented level, as enormous archives of satellite images are available to the users. However, even if these advances open more and more possibilities in the use of digital imagery, they also rise several problems of storage and treatment. The latter is considered in this Thesis: the processing of very high spatial and spectral resolution images is treated with approaches based on data-driven algorithms relying on kernel methods. In particular, the problem of image classification, i.e. the categorization of the image's pixels into a reduced number of classes reflecting spectral and contextual properties, is studied through the different models presented. The accent is put on algorithmic efficiency and the simplicity of the approaches proposed, to avoid too complex models that would not be used by users. The major challenge of the Thesis is to remain close to concrete remote sensing problems, without losing the methodological interest from the machine learning viewpoint: in this sense, this work aims at building a bridge between the machine learning and remote sensing communities and all the models proposed have been developed keeping in mind the need for such a synergy. Four models are proposed: first, an adaptive model learning the relevant image features has been proposed to solve the problem of high dimensionality and collinearity of the image features. This model provides automatically an accurate classifier and a ranking of the relevance of the single features. The scarcity and unreliability of labeled. information were the common root of the second and third models proposed: when confronted to such problems, the user can either construct the labeled set iteratively by direct interaction with the machine or use the unlabeled data to increase robustness and quality of the description of data. Both solutions have been explored resulting into two methodological contributions, based respectively on active learning and semisupervised learning. Finally, the more theoretical issue of structured outputs has been considered in the last model, which, by integrating outputs similarity into a model, opens new challenges and opportunities for remote sensing image processing.
Resumo:
More than 60% of neuroendocrine tumours, also called carcinoids, are localised within the gastrointestinal tract. Small bowel neuroendocrine tumours have been diagnosed with increasing frequency over the past 35 years, being the second most frequent tumours of the small intestine. Ileal neuroendocrine tumours diagnosis is late because patients have non-specific symptoms. We have proposed to illustrate as an example the case of a patient, and on its basis, to make a brief review of the literature on small bowel neuroendocrine tumours, resuming several recent changes in the field, concerning classification criteria of these tumours and new recommendations and current advances in diagnosis and treatment. This patient came to our emergency department with a complete bowel obstruction, along with a 2-year history of peristaltic abdominal pain, vomits and diarrhoea episodes. During emergency laparotomy, an ileal stricture was observed, that showed to be a neuroendocrine tumour of the small bowel.
Resumo:
Geographic information systems (GIS) and artificial intelligence (AI) techniques were used to develop an intelligent snow removal asset management system (SRAMS). The system has been evaluated through a case study examining snow removal from the roads in Black Hawk County, Iowa, for which the Iowa Department of Transportation (Iowa DOT) is responsible. The SRAMS is comprised of an expert system that contains the logical rules and expertise of the Iowa DOT’s snow removal experts in Black Hawk County, and a geographic information system to access and manage road data. The system is implemented on a mid-range PC by integrating MapObjects 2.1 (a GIS package), Visual Rule Studio 2.2 (an AI shell), and Visual Basic 6.0 (a programming tool). The system could efficiently be used to generate prioritized snowplowing routes in visual format, to optimize the allocation of assets for plowing, and to track materials (e.g., salt and sand). A test of the system reveals an improvement in snowplowing time by 1.9 percent for moderate snowfall and 9.7 percent for snowstorm conditions over the current manual system.
Resumo:
To analyse the association between chondrocalcinosis and osteoarthritis (OA) of the hands and knees in an unselected elderly rural population. METHODS--A community based cross sectional study was performed in individuals randomly selected from a previous epidemiological survey on the prevalence of chondrocalcinosis in people older than 60 years from Osona county, Catalonia, northeastern Spain. Radiological OA (grade 2 or more of Kellgren's classification) was evaluated in 26 individuals with chondrocalcinosis and in 104 controls. A total of 18 articular areas of both knees (medial and lateral tibiofemoral compartments) and hands (first, second and third metacarpophalangeal (MCP), first carpometacarpal, trapezium-scaphoid, radiocarpal and distal radioulnar joints) were studied. RESULTS--Radiological changes of OA in the knees were more common in subjects with chondrocalcinosis than in those without it, with an odds ratio adjusted for age and gender (aOR) of 4.3 (95% confidence interval (CI) 1.6 to 11.8, p = 0.005). OA was also more frequent in almost all areas of the hands in individuals with chondrocalcinosis, though the difference reached statistical significance only in the MCP joints (aOR 3.1; 95% CI 1.1 to 8.8; p = 0.033). However, taking into account the side and the different joint compartments analysed, the association between chondrocalcinosis and OA was significant only in the lateral tibiofemoral compartment and the left MCP joints. CONCLUSIONS--In an elderly population unselected for their rheumatic complaints, there was a real association between OA and chondrocalcinosis. This association was particularly relevant in the lateral tibiofemoral compartment of the knee and in the first three left MCP joints.
Resumo:
For several years, the lack of consensus on definition, nomenclature, natural history, and biology of serrated polyps (SPs) of the colon has created considerable confusion among pathologists. According to the latest WHO classification, the family of SPs comprises hyperplastic polyps (HPs), sessile serrated adenomas/polyps (SSA/Ps), and traditional serrated adenomas (TSAs). The term SSA/P with dysplasia has replaced the category of mixed hyperplastic/adenomatous polyps (MPs). The present study aimed to evaluate the reproducibility of the diagnosis of SPs based on currently available diagnostic criteria and interactive consensus development. In an initial round, H&E slides of 70 cases of SPs were circulated among participating pathologists across Europe. This round was followed by a consensus discussion on diagnostic criteria. A second round was performed on the same 70 cases using the revised criteria and definitions according to the recent WHO classification. Data were evaluated for inter-observer agreement using Kappa statistics. In the initial round, for the total of 70 cases, a fair overall kappa value of 0.318 was reached, while in the second round overall kappa value improved to moderate (kappa = 0.557; p < 0.001). Overall kappa values for each diagnostic category also significantly improved in the final round, reaching 0.977 for HP, 0.912 for SSA/P, and 0.845 for TSA (p < 0.001). The diagnostic reproducibility of SPs improves when strictly defined, standardized diagnostic criteria adopted by consensus are applied.