205 resultados para Rainfall event classification
Resumo:
Introduction: As part of the MicroArray Quality Control (MAQC)-II project, this analysis examines how the choice of univariate feature-selection methods and classification algorithms may influence the performance of genomic predictors under varying degrees of prediction difficulty represented by three clinically relevant endpoints. Methods: We used gene-expression data from 230 breast cancers (grouped into training and independent validation sets), and we examined 40 predictors (five univariate feature-selection methods combined with eight different classifiers) for each of the three endpoints. Their classification performance was estimated on the training set by using two different resampling methods and compared with the accuracy observed in the independent validation set. Results: A ranking of the three classification problems was obtained, and the performance of 120 models was estimated and assessed on an independent validation set. The bootstrapping estimates were closer to the validation performance than were the cross-validation estimates. The required sample size for each endpoint was estimated, and both gene-level and pathway-level analyses were performed on the obtained models. Conclusions: We showed that genomic predictor accuracy is determined largely by an interplay between sample size and classification difficulty. Variations on univariate feature-selection methods and choice of classification algorithm have only a modest impact on predictor performance, and several statistically equally good predictors can be developed for any given classification problem.
Resumo:
Very little is known about early molecular events triggering epithelial cell differentiation. We have examined the possible role of tyrosine phosphorylation in this process, as observed in cultures of primary mouse keratinocytes after exposure to calcium or 12-O-tetradecanoylphorbol-13-acetate (TPA). Immunoblotting with phosphotyrosine-specific antibodies as well as direct phosphoamino acid analysis revealed that induction of tyrosine phosphorylation occurs as a very early and specific event in keratinocyte differentiation. Very little or no induction of tyrosine phosphorylation was observed in a keratinocyte cell line resistant to the differentiating effects of calcium. Treatment of cells with tyrosine kinase inhibitors prevented induction of tyrosine phosphorylation by calcium and TPA and interfered with the differentiative effects of these agents. These results suggest that specific activation of tyrosine kinase(s) may play an important regulatory role in keratinocyte differentiation.
Resumo:
To compare the impact of meeting specific classification criteria [modified New York (mNY), European Spondyloarthropathy Study Group (ESSG), and Assessment of SpondyloArthritis international Society (ASAS) criteria] on anti-tumor necrosis factor (anti-TNF) drug retention, and to determine predictive factors of better drug survival. All patients fulfilling the ESSG criteria for axial spondyloarthritis (SpA) with available data on the axial ASAS and mNY criteria, and who had received at least one anti-TNF treatment were retrospectively retrieved in a single academic institution in Switzerland. Drug retention was computed using survival analysis (Kaplan-Meier), adjusted for potential confounders. Of the 137 patients classified as having axial SpA using the ESSG criteria, 112 also met the ASAS axial SpA criteria, and 77 fulfilled the mNY criteria. Drug retention rates at 12 and 24 months for the first biologic therapy were not significantly different between the diagnostic groups. Only the small ASAS non-classified axial SpA group (25 patients) showed a nonsignificant trend toward shorter drug survival. Elevated CRP level, but not the presence of bone marrow edema on magnetic resonance imaging (MRI) scans, was associated with significantly better drug retention (OR 7.9, ICR 4-14). In this cohort, anti-TNF drug survival was independent of the classification criteria. Elevated CRP level, but not positive MRI, was associated with better drug retention.
Resumo:
Defining an efficient training set is one of the most delicate phases for the success of remote sensing image classification routines. The complexity of the problem, the limited temporal and financial resources, as well as the high intraclass variance can make an algorithm fail if it is trained with a suboptimal dataset. Active learning aims at building efficient training sets by iteratively improving the model performance through sampling. A user-defined heuristic ranks the unlabeled pixels according to a function of the uncertainty of their class membership and then the user is asked to provide labels for the most uncertain pixels. This paper reviews and tests the main families of active learning algorithms: committee, large margin, and posterior probability-based. For each of them, the most recent advances in the remote sensing community are discussed and some heuristics are detailed and tested. Several challenging remote sensing scenarios are considered, including very high spatial resolution and hyperspectral image classification. Finally, guidelines for choosing the good architecture are provided for new and/or unexperienced user.
Resumo:
Background Individual signs and symptoms are of limited value for the diagnosis of influenza. Objective To develop a decision tree for the diagnosis of influenza based on a classification and regression tree (CART) analysis. Methods Data from two previous similar cohort studies were assembled into a single dataset. The data were randomly divided into a development set (70%) and a validation set (30%). We used CART analysis to develop three models that maximize the number of patients who do not require diagnostic testing prior to treatment decisions. The validation set was used to evaluate overfitting of the model to the training set. Results Model 1 has seven terminal nodes based on temperature, the onset of symptoms and the presence of chills, cough and myalgia. Model 2 was a simpler tree with only two splits based on temperature and the presence of chills. Model 3 was developed with temperature as a dichotomous variable (≥38°C) and had only two splits based on the presence of fever and myalgia. The area under the receiver operating characteristic curves (AUROCC) for the development and validation sets, respectively, were 0.82 and 0.80 for Model 1, 0.75 and 0.76 for Model 2 and 0.76 and 0.77 for Model 3. Model 2 classified 67% of patients in the validation group into a high- or low-risk group compared with only 38% for Model 1 and 54% for Model 3. Conclusions A simple decision tree (Model 2) classified two-thirds of patients as low or high risk and had an AUROCC of 0.76. After further validation in an independent population, this CART model could support clinical decision making regarding influenza, with low-risk patients requiring no further evaluation for influenza and high-risk patients being candidates for empiric symptomatic or drug therapy.
Resumo:
Fanon, Senghor, and Ela took a radical stance in criticising the structures and mechanisms of power in hegemonic situations and relations between colonial subjects and colonial masters. They aimed to liberate African societies by decolonising the mind, culture and religion of colonial subjects. In this respect, we are concerned with the continuities and ruptures of the colonial encounter and its unequal relationships. Switzerland does not have an official colonial history and yet, Swiss companies and migrants were and are part of the world's colonies. In our contribution, we question what makes an event postcolonial : in other words, how are postcolonial relations negotiated in Switzerland? We discuss this question by analysing two annual sacred journeys in Switzerland that have been invented for and by African Christians (clerics and laity) together with the leaders of the Swiss Catholic church : one to the relics of African saints in St. Maurice, canton Valais and the other to the Black Madonna, the Virgin Mary of Einsiedeln, in the canton Schwyz. These events are empowered by the performance of African choirs - their music, dance, and costumes - but to which end and in which way?
Resumo:
BACKGROUND: Sudden cardiac death (SCD) among the young is a rare and devastating event, but its exact incidence in many countries remains unknown. An autopsy is recommended in every case because some of the cardiac pathologies may have a genetic origin, which can have an impact on the living family members. The aims of this retrospective study completed in the canton of Vaud, Switzerland were to determine both the incidence of SCD and the autopsy rate for individuals from 5 to 39 years of age. METHODS: The study was conducted from 2000 to 2007 on the basis of official statistics and analysis of the International Classification of Diseases codes for potential SCDs and other deaths that might have been due to cardiac disease. RESULTS: During the 8 year study period there was an average of 292'546 persons aged 5-39 and there were a total of 1122 deaths, certified as potential SCDs in 3.6% of cases. The calculated incidence is 1.71/100'000 person-years (2.73 for men and 0.69 for women). If all possible cases of SCD (unexplained deaths, drowning, traffic accidents, etc.) are included, the incidence increases to 13.67/100'000 person-years. However, the quality of the officially available data was insufficient to provide an accurate incidence of SCD as well as autopsy rates. The presumed autopsy rate of sudden deaths classified as diseases of the circulatory system is 47.5%. For deaths of unknown cause (11.1% of the deaths), the autopsy was conducted in 13.7% of the cases according to codified data. CONCLUSIONS: The incidence of presumed SCD in the canton of Vaud, Switzerland, is comparable to the data published in the literature for other geographic regions but may be underestimated as it does not take into account other potential SCDs, as unexplained deaths. Increasing the autopsy rate of SCD in the young, better management of information obtained from autopsies as well developing of structured registry could improve the reliability of the statistical data, optimize the diagnostic procedures, and the preventive measures for the family members.
Resumo:
This study presents a classification criteria for two-class Cannabis seedlings. As the cultivation of drug type cannabis is forbidden in Switzerland, law enforcement authorities regularly ask laboratories to determine cannabis plant's chemotype from seized material in order to ascertain that the plantation is legal or not. In this study, the classification analysis is based on data obtained from the relative proportion of three major leaf compounds measured by gas-chromatography interfaced with mass spectrometry (GC-MS). The aim is to discriminate between drug type (illegal) and fiber type (legal) cannabis at an early stage of the growth. A Bayesian procedure is proposed: a Bayes factor is computed and classification is performed on the basis of the decision maker specifications (i.e. prior probability distributions on cannabis type and consequences of classification measured by losses). Classification rates are computed with two statistical models and results are compared. Sensitivity analysis is then performed to analyze the robustness of classification criteria.
Resumo:
Résumé de la thèse L'évolution des systèmes policiers donne une place prépondérante à l'information et au renseignement. Cette transformation implique de développer et de maintenir un ensemble de processus permanent d'analyse de la criminalité, en particulier pour traiter des événements répétitifs ou graves. Dans une organisation aux ressources limitées, le temps consacré au recueil des données, à leur codification et intégration, diminue le temps disponible pour l'analyse et la diffusion de renseignements. Les phases de collecte et d'intégration restent néanmoins indispensables, l'analyse n'étant pas possible sur des données volumineuses n'ayant aucune structure. Jusqu'à présent, ces problématiques d'analyse ont été abordées par des approches essentiellement spécialisées (calculs de hot-sports, data mining, ...) ou dirigées par un seul axe (par exemple, les sciences comportementales). Cette recherche s'inscrit sous un angle différent, une démarche interdisciplinaire a été adoptée. L'augmentation continuelle de la quantité de données à analyser tend à diminuer la capacité d'analyse des informations à disposition. Un bon découpage (classification) des problèmes rencontrés permet de délimiter les analyses sur des données pertinentes. Ces classes sont essentielles pour structurer la mémoire du système d'analyse. Les statistiques policières de la criminalité devraient déjà avoir répondu à ces questions de découpage de la délinquance (classification juridique). Cette décomposition a été comparée aux besoins d'un système de suivi permanent dans la criminalité. La recherche confirme que nos efforts pour comprendre la nature et la répartition du crime se butent à un obstacle, à savoir que la définition juridique des formes de criminalité n'est pas adaptée à son analyse, à son étude. Depuis près de vingt ans, les corps de police de Suisse romande utilisent et développent un système de classification basé sur l'expérience policière (découpage par phénomène). Cette recherche propose d'interpréter ce système dans le cadre des approches situationnelles (approche théorique) et de le confronter aux données « statistiques » disponibles pour vérifier sa capacité à distinguer les formes de criminalité. La recherche se limite aux cambriolages d'habitations, un délit répétitif fréquent. La théorie des opportunités soutien qu'il faut réunir dans le temps et dans l'espace au minimum les trois facteurs suivants : un délinquant potentiel, une cible intéressante et l'absence de gardien capable de prévenir ou d'empêcher le passage à l'acte. Ainsi, le délit n'est possible que dans certaines circonstances, c'est-à-dire dans un contexte bien précis. Identifier ces contextes permet catégoriser la criminalité. Chaque cas est unique, mais un groupe de cas montre des similitudes. Par exemple, certaines conditions avec certains environnements attirent certains types de cambrioleurs. Deux hypothèses ont été testées. La première est que les cambriolages d'habitations ne se répartissent pas uniformément dans les classes formées par des « paramètres situationnels » ; la deuxième que des niches apparaissent en recoupant les différents paramètres et qu'elles correspondent à la classification mise en place par la coordination judiciaire vaudoise et le CICOP. La base de données vaudoise des cambriolages enregistrés entre 1997 et 2006 par la police a été utilisée (25'369 cas). Des situations spécifiques ont été mises en évidence, elles correspondent aux classes définies empiriquement. Dans une deuxième phase, le lien entre une situation spécifique et d'activité d'un auteur au sein d'une même situation a été vérifié. Les observations réalisées dans cette recherche indiquent que les auteurs de cambriolages sont actifs dans des niches. Plusieurs auteurs sériels ont commis des délits qui ne sont pas dans leur niche, mais le nombre de ces infractions est faible par rapport au nombre de cas commis dans la niche. Un système de classification qui correspond à des réalités criminelles permet de décomposer les événements et de mettre en place un système d'alerte et de suivi « intelligent ». Une nouvelle série dans un phénomène sera détectée par une augmentation du nombre de cas de ce phénomène, en particulier dans une région et à une période donnée. Cette nouvelle série, mélangée parmi l'ensemble des délits, ne serait pas forcément détectable, en particulier si elle se déplace. Finalement, la coopération entre les structures de renseignement criminel opérationnel en Suisse romande a été améliorée par le développement d'une plateforme d'information commune et le système de classification y a été entièrement intégré.
Resumo:
Lipids available in fingermark residue represent important targets for enhancement and dating techniques. While it is well known that lipid composition varies among fingermarks of the same donor (intra-variability) and between fingermarks of different donors (inter-variability), the extent of this variability remains uncharacterised. Thus, this worked aimed at studying qualitatively and quantitatively the initial lipid composition of fingermark residue of 25 different donors. Among the 104 detected lipids, 43 were reported for the first time in the literature. Furthermore, palmitic acid, squalene, cholesterol, myristyl myristate and myristyl myristoleate were quantified and their correlation within fingermark residue was highlighted. Ten compounds were then selected and further studied as potential targets for dating or enhancement techniques. It was shown that their relative standard deviation was significantly lower for the intra-variability than for the inter-variability. Moreover, the use of data pretreatments could significantly reduce this variability. Based on these observations, an objective donor classification model was proposed. Hierarchical cluster analysis was conducted on the pre-treated data and the fingermarks of the 25 donors were classified into two main groups, corresponding to "poor" and "rich" lipid donors. The robustness of this classification was tested using fingermark replicates of selected donors. 86% of these replicates were correctly classified, showing the potential of such a donor classification model for research purposes in order to select representative donors based on compounds of interest.
Resumo:
In the recent years, kernel methods have revealed very powerful tools in many application domains in general and in remote sensing image classification in particular. The special characteristics of remote sensing images (high dimension, few labeled samples and different noise sources) are efficiently dealt with kernel machines. In this paper, we propose the use of structured output learning to improve remote sensing image classification based on kernels. Structured output learning is concerned with the design of machine learning algorithms that not only implement input-output mapping, but also take into account the relations between output labels, thus generalizing unstructured kernel methods. We analyze the framework and introduce it to the remote sensing community. Output similarity is here encoded into SVM classifiers by modifying the model loss function and the kernel function either independently or jointly. Experiments on a very high resolution (VHR) image classification problem shows promising results and opens a wide field of research with structured output kernel methods.