899 resultados para binary to multi-class classifiers
Resumo:
Objectives: To assess the potential source of variation that surgeon may add to patient outcome in a clinical trial of surgical procedures. Methods: Two large (n = 1380) parallel multicentre randomized surgical trials were undertaken to compare laparoscopically assisted hysterectomy with conventional methods of abdominal and vaginal hysterectomy; involving 43 surgeons. The primary end point of the trial was the occurrence of at least one major complication. Patients were nested within surgeons giving the data set a hierarchical structure. A total of 10% of patients had at least one major complication, that is, a sparse binary outcome variable. A linear mixed logistic regression model (with logit link function) was used to model the probability of a major complication, with surgeon fitted as a random effect. Models were fitted using the method of maximum likelihood in SAS((R)). Results: There were many convergence problems. These were resolved using a variety of approaches including; treating all effects as fixed for the initial model building; modelling the variance of a parameter on a logarithmic scale and centring of continuous covariates. The initial model building process indicated no significant 'type of operation' across surgeon interaction effect in either trial, the 'type of operation' term was highly significant in the abdominal trial, and the 'surgeon' term was not significant in either trial. Conclusions: The analysis did not find a surgeon effect but it is difficult to conclude that there was not a difference between surgeons. The statistical test may have lacked sufficient power, the variance estimates were small with large standard errors, indicating that the precision of the variance estimates may be questionable.
Resumo:
Several real problems involve the classification of data into categories or classes. Given a data set containing data whose classes are known, Machine Learning algorithms can be employed for the induction of a classifier able to predict the class of new data from the same domain, performing the desired discrimination. Some learning techniques are originally conceived for the solution of problems with only two classes, also named binary classification problems. However, many problems require the discrimination of examples into more than two categories or classes. This paper presents a survey on the main strategies for the generalization of binary classifiers to problems with more than two classes, known as multiclass classification problems. The focus is on strategies that decompose the original multiclass problem into multiple binary subtasks, whose outputs are combined to obtain the final prediction.
Resumo:
In multi-label classification, examples can be associated with multiple labels simultaneously. The task of learning from multi-label data can be addressed by methods that transform the multi-label classification problem into several single-label classification problems. The binary relevance approach is one of these methods, where the multi-label learning task is decomposed into several independent binary classification problems, one for each label in the set of labels, and the final labels for each example are determined by aggregating the predictions from all binary classifiers. However, this approach fails to consider any dependency among the labels. Aiming to accurately predict label combinations, in this paper we propose a simple approach that enables the binary classifiers to discover existing label dependency by themselves. An experimental study using decision trees, a kernel method as well as Naive Bayes as base-learning techniques shows the potential of the proposed approach to improve the multi-label classification performance.
Resumo:
Context. The star HD 87643, exhibiting the ""B[e] phenomenon"", has one of the most extreme infrared excesses for this object class. It harbours a large amount of both hot and cold dust, and is surrounded by an extended reflection nebula. Aims. One of our major goals was to investigate the presence of a companion in HD87643. In addition, the presence of close dusty material was tested through a combination of multi-wavelength high spatial resolution observations. Methods. We observed HD 87643 with high spatial resolution techniques, using the near-IR AMBER/VLTI interferometer with baselines ranging from 60 m to 130 m and the mid-IR MIDI/VLTI interferometer with baselines ranging from 25 m to 65 m. These observations are complemented by NACO/VLT adaptive-optics-corrected images in the K and L-bands, and ESO-2.2m optical Wide-Field Imager large-scale images in the B, V and R-bands. Results. We report the direct detection of a companion to HD 87643 by means of image synthesis using the AMBER/VLTI instrument. The presence of the companion is confirmed by the MIDI and NACO data, although with a lower confidence. The companion is separated by similar to 34 mas with a roughly north-south orientation. The period must be large (several tens of years) and hence the orbital parameters are not determined yet. Binarity with high eccentricity might be the key to interpreting the extreme characteristics of this system, namely a dusty circumstellar envelope around the primary, a compact dust nebulosity around the binary system and a complex extended nebula suggesting past violent ejections.
Resumo:
Establishment of a treatment plan is based on efficacy and easy application by the clinician, and acceptance by the patient. Treatment of adult patients with Class III malocclusion might require orthognathic surgery, especially when the deformity is severe, with a significant impact on facial esthetics. Impacted teeth can remarkably influence treatment planning, which should be precise and concise to allow a reasonably short treatment time with low biologic cost. We report here the case of a 20-year-old man who had a skeletal Class III malocclusion and impaction of the maxillary right canine, leading to remarkable deviation of the maxillary midline; this was his chief complaint. Because of the severely deviated position of the impacted canine, treatment included extraction of the maxillary right canine and left first premolar for midline correction followed by leveling, alignment, correction of compensatory tooth positioning, and orthognathic surgery to correct the skeletal Class III malocclusion because of the severe maxillary deficiency. This treatment approach allowed correction of the maxillary dental midline discrepancy to the midsagittal plane and establishment of good occlusion and optimal esthetics. (Am J Orthod Dentofacial Orthop 2010;137:840-9)
Resumo:
Several studies have reported high performance of simple decision heuristics multi-attribute decision making. In this paper, we focus on situations where attributes are binary and analyze the performance of Deterministic-Elimination-By-Aspects (DEBA) and similar decision heuristics. We consider non-increasing weights and two probabilistic models for the attribute values: one where attribute values are independent Bernoulli randomvariables; the other one where they are binary random variables with inter-attribute positive correlations. Using these models, we show that good performance of DEBA is explained by the presence of cumulative as opposed to simple dominance. We therefore introduce the concepts of cumulative dominance compliance and fully cumulative dominance compliance and show that DEBA satisfies those properties. We derive a lower bound with which cumulative dominance compliant heuristics will choose a best alternative and show that, even with many attributes, this is not small. We also derive an upper bound for the expected loss of fully cumulative compliance heuristics and show that this is moderateeven when the number of attributes is large. Both bounds are independent of the values ofthe weights.
Resumo:
Among the types of remote sensing acquisitions, optical images are certainly one of the most widely relied upon data sources for Earth observation. They provide detailed measurements of the electromagnetic radiation reflected or emitted by each pixel in the scene. Through a process termed supervised land-cover classification, this allows to automatically yet accurately distinguish objects at the surface of our planet. In this respect, when producing a land-cover map of the surveyed area, the availability of training examples representative of each thematic class is crucial for the success of the classification procedure. However, in real applications, due to several constraints on the sample collection process, labeled pixels are usually scarce. When analyzing an image for which those key samples are unavailable, a viable solution consists in resorting to the ground truth data of other previously acquired images. This option is attractive but several factors such as atmospheric, ground and acquisition conditions can cause radiometric differences between the images, hindering therefore the transfer of knowledge from one image to another. The goal of this Thesis is to supply remote sensing image analysts with suitable processing techniques to ensure a robust portability of the classification models across different images. The ultimate purpose is to map the land-cover classes over large spatial and temporal extents with minimal ground information. To overcome, or simply quantify, the observed shifts in the statistical distribution of the spectra of the materials, we study four approaches issued from the field of machine learning. First, we propose a strategy to intelligently sample the image of interest to collect the labels only in correspondence of the most useful pixels. This iterative routine is based on a constant evaluation of the pertinence to the new image of the initial training data actually belonging to a different image. Second, an approach to reduce the radiometric differences among the images by projecting the respective pixels in a common new data space is presented. We analyze a kernel-based feature extraction framework suited for such problems, showing that, after this relative normalization, the cross-image generalization abilities of a classifier are highly increased. Third, we test a new data-driven measure of distance between probability distributions to assess the distortions caused by differences in the acquisition geometry affecting series of multi-angle images. Also, we gauge the portability of classification models through the sequences. In both exercises, the efficacy of classic physically- and statistically-based normalization methods is discussed. Finally, we explore a new family of approaches based on sparse representations of the samples to reciprocally convert the data space of two images. The projection function bridging the images allows a synthesis of new pixels with more similar characteristics ultimately facilitating the land-cover mapping across images.
Resumo:
This article presents an experimental study about the classification ability of several classifiers for multi-classclassification of cannabis seedlings. As the cultivation of drug type cannabis is forbidden in Switzerland lawenforcement authorities regularly ask forensic laboratories to determinate the chemotype of a seized cannabisplant and then to conclude if the plantation is legal or not. This classification is mainly performed when theplant is mature as required by the EU official protocol and then the classification of cannabis seedlings is a timeconsuming and costly procedure. A previous study made by the authors has investigated this problematic [1]and showed that it is possible to differentiate between drug type (illegal) and fibre type (legal) cannabis at anearly stage of growth using gas chromatography interfaced with mass spectrometry (GC-MS) based on therelative proportions of eight major leaf compounds. The aims of the present work are on one hand to continueformer work and to optimize the methodology for the discrimination of drug- and fibre type cannabisdeveloped in the previous study and on the other hand to investigate the possibility to predict illegal cannabisvarieties. Seven classifiers for differentiating between cannabis seedlings are evaluated in this paper, namelyLinear Discriminant Analysis (LDA), Partial Least Squares Discriminant Analysis (PLS-DA), Nearest NeighbourClassification (NNC), Learning Vector Quantization (LVQ), Radial Basis Function Support Vector Machines(RBF SVMs), Random Forest (RF) and Artificial Neural Networks (ANN). The performance of each method wasassessed using the same analytical dataset that consists of 861 samples split into drug- and fibre type cannabiswith drug type cannabis being made up of 12 varieties (i.e. 12 classes). The results show that linear classifiersare not able to manage the distribution of classes in which some overlap areas exist for both classificationproblems. Unlike linear classifiers, NNC and RBF SVMs best differentiate cannabis samples both for 2-class and12-class classifications with average classification results up to 99% and 98%, respectively. Furthermore, RBFSVMs correctly classified into drug type cannabis the independent validation set, which consists of cannabisplants coming from police seizures. In forensic case work this study shows that the discrimination betweencannabis samples at an early stage of growth is possible with fairly high classification performance fordiscriminating between cannabis chemotypes or between drug type cannabis varieties.
Resumo:
The objective of this work was to evaluate sampling density on the prediction accuracy of soil orders, with high spatial resolution, in a viticultural zone of Serra Gaúcha, Southern Brazil. A digital elevation model (DEM), a cartographic base, a conventional soil map, and the Idrisi software were used. Seven predictor variables were calculated and read along with soil classes in randomly distributed points, with sampling densities of 0.5, 1, 1.5, 2, and 4 points per hectare. Data were used to train a decision tree (Gini) and three artificial neural networks: adaptive resonance theory, fuzzy ARTMap; self‑organizing map, SOM; and multi‑layer perceptron, MLP. Estimated maps were compared with the conventional soil map to calculate omission and commission errors, overall accuracy, and quantity and allocation disagreement. The decision tree was less sensitive to sampling density and had the highest accuracy and consistence. The SOM was the less sensitive and most consistent network. The MLP had a critical minimum and showed high inconsistency, whereas fuzzy ARTMap was more sensitive and less accurate. Results indicate that sampling densities used in conventional soil surveys can serve as a reference to predict soil orders in Serra Gaúcha.
Resumo:
This paper deals with second-generation, one-and-a-half generation and ‘‘prolonged sojourner” Trinidadian transnational migrants, who have decided to ‘return’ to the birthplace of their parents. Based on 40 in-depth interviews, the paper considers both the positive and critical things that these youthful transnational migrants report about returning to, and living in, this multi-ethnic plural society and the salience of racial and colour-class stratification as part of their return migration experiences. Our qualitative analysis is based on the narratives provided by these youthful returnees, as relayed ‘‘in their own words”, presenting critical reflections on racism, racial identities and experiences as transnational Trinidadians. It is clear that it is contexts such as contemporary working environments, family and community that act as the reference points for the adaptation ‘‘back home” of this strongly middle-class cohort. We accordingly encounter a diverse, sometimes contesting set of racial issues that emerge as salient concerns for these returnees. The consensus is that matters racial remain as formidable legacies in the hierarchical stratification of Trinidadian society for a sizeable number. Many of our respondents reported the positive aspects of racial affirmation on return. But for another sub-set, the fact that multi-ethnic and multi-cultural mixing are proudly embraced in Trinidad meant that it was felt that return experiences were not overly hindered, or blighted by obstacles of race and colour-class. For these returnees, Trinidad and Tobago is seen as representing a 21st century ‘‘Melting Pot”. But for others the continued existence of racial divisions within society – between ethnic groups and among those of different skin shades – was lamented. In the views of these respondents, too much racial power is still ascribed to ‘near-whiteness’. But for the most part, the returnees felt that where race played a part in their new lives, this generally served to advantage them. However, although the situation in Trinidad appears to have been moderated by assumptions that it remains a racial ‘Melting Pot’, the analysis strongly suggests that the colour-class system of stratification is still playing an essential role, along with racial stereotyping in society at large.
Resumo:
Although some individual techniques of supervised Machine Learning (ML), also known as classifiers, or algorithms of classification, to supply solutions that, most of the time, are considered efficient, have experimental results gotten with the use of large sets of pattern and/or that they have a expressive amount of irrelevant data or incomplete characteristic, that show a decrease in the efficiency of the precision of these techniques. In other words, such techniques can t do an recognition of patterns of an efficient form in complex problems. With the intention to get better performance and efficiency of these ML techniques, were thought about the idea to using some types of LM algorithms work jointly, thus origin to the term Multi-Classifier System (MCS). The MCS s presents, as component, different of LM algorithms, called of base classifiers, and realized a combination of results gotten for these algorithms to reach the final result. So that the MCS has a better performance that the base classifiers, the results gotten for each base classifier must present an certain diversity, in other words, a difference between the results gotten for each classifier that compose the system. It can be said that it does not make signification to have MCS s whose base classifiers have identical answers to the sames patterns. Although the MCS s present better results that the individually systems, has always the search to improve the results gotten for this type of system. Aim at this improvement and a better consistency in the results, as well as a larger diversity of the classifiers of a MCS, comes being recently searched methodologies that present as characteristic the use of weights, or confidence values. These weights can describe the importance that certain classifier supplied when associating with each pattern to a determined class. These weights still are used, in associate with the exits of the classifiers, during the process of recognition (use) of the MCS s. Exist different ways of calculating these weights and can be divided in two categories: the static weights and the dynamic weights. The first category of weights is characterizes for not having the modification of its values during the classification process, different it occurs with the second category, where the values suffers modifications during the classification process. In this work an analysis will be made to verify if the use of the weights, statics as much as dynamics, they can increase the perfomance of the MCS s in comparison with the individually systems. Moreover, will be made an analysis in the diversity gotten for the MCS s, for this mode verify if it has some relation between the use of the weights in the MCS s with different levels of diversity
Resumo:
The mortality caused by snakebites is more damaging than many tropical diseases, such as dengue haemorrhagic fever, cholera, leishmaniasis, schistosomiasis and Chagas disease. For this reason, snakebite envenoming adversely affects health services of tropical and subtropical countries and is recognized as a neglected disease by the World Health Organization. One of the main components of snake venoms is the Lys49-phospholipases A2, which is catalytically inactive but possesses other toxic and pharmacological activities. Preliminary studies with MjTX-I from Bothrops moojeni snake venom revealed intriguing new structural and functional characteristics compared to other bothropic Lys49-PLA2s. We present in this article a comprehensive study with MjTX-I using several techniques, including crystallography, small angle X-ray scattering, analytical size-exclusion chromatography, dynamic light scattering, myographic studies, bioinformatics and molecular phylogenetic analyses.Based in all these experiments we demonstrated that MjTX-I is probably a unique Lys49-PLA2, which may adopt different oligomeric forms depending on the physical-chemical environment. Furthermore, we showed that its myotoxic activity is dramatically low compared to other Lys49-PLA2s, probably due to the novel oligomeric conformations and important mutations in the C-terminal region of the protein. The phylogenetic analysis also showed that this toxin is clearly distinct from other bothropic Lys49-PLA2s, in conformity with the peculiar oligomeric characteristics of MjTX-I and possible emergence of new functionalities inresponse to environmental changes and adaptation to new preys. © 2013 Salvador et al.
Resumo:
Plant phenology is one of the most reliable indicators of species responses to global climate change, motivating the development of new technologies for phenological monitoring. Digital cameras or near remote systems have been efficiently applied as multi-channel imaging sensors, where leaf color information is extracted from the RGB (Red, Green, and Blue) color channels, and the changes in green levels are used to infer leafing patterns of plant species. In this scenario, texture information is a great ally for image analysis that has been little used in phenology studies. We monitored leaf-changing patterns of Cerrado savanna vegetation by taking daily digital images. We extract RGB channels from the digital images and correlate them with phenological changes. Additionally, we benefit from the inclusion of textural metrics for quantifying spatial heterogeneity. Our first goals are: (1) to test if color change information is able to characterize the phenological pattern of a group of species; (2) to test if the temporal variation in image texture is useful to distinguish plant species; and (3) to test if individuals from the same species may be automatically identified using digital images. In this paper, we present a machine learning approach based on multiscale classifiers to detect phenological patterns in the digital images. Our results indicate that: (1) extreme hours (morning and afternoon) are the best for identifying plant species; (2) different plant species present a different behavior with respect to the color change information; and (3) texture variation along temporal images is promising information for capturing phenological patterns. Based on those results, we suggest that individuals from the same species and functional group might be identified using digital images, and introduce a new tool to help phenology experts in the identification of new individuals from the same species in the image and their location on the ground. © 2013 Elsevier B.V. All rights reserved.
Resumo:
PURPOSE Survivin is a member of the inhibitor-of-apoptosis family. Essential for tumor cell survival and overexpressed in most cancers, survivin is a promising target for anti-cancer immunotherapy. Immunogenicity has been demonstrated in multiple cancers. Nonetheless, few clinical trials have demonstrated survivin-vaccine-induced immune responses. EXPERIMENTAL DESIGN This phase I trial was conducted to test whether vaccine EMD640744, a cocktail of five HLA class I-binding survivin peptides in Montanide(®) ISA 51 VG, promotes anti-survivin T-cell responses in patients with solid cancers. The primary objective was to compare immunologic efficacy of EMD640744 at doses of 30, 100, and 300 μg. Secondary objectives included safety, tolerability, and clinical efficacy. RESULTS In total, 49 patients who received ≥2 EMD640744 injections with available baseline- and ≥1 post-vaccination samples [immunologic-diagnostic (ID)-intention-to-treat] were analyzed by ELISpot- and peptide/MHC-multimer staining, revealing vaccine-activated peptide-specific T-cell responses in 31 patients (63 %). This cohort included the per study protocol relevant ID population for the primary objective, i.e., T-cell responses by ELISpot in 17 weeks following first vaccination, as well as subjects who discontinued the study before week 17 but showed responses to the treatment. No dose-dependent effects were observed. In the majority of patients (61 %), anti-survivin responses were detected only after vaccination, providing evidence for de novo induction. Best overall tumor response was stable disease (28 %). EMD640744 was well tolerated; local injection-site reactions constituted the most frequent adverse event. CONCLUSIONS Vaccination with EMD640744 elicited T-cell responses against survivin peptides in the majority of patients, demonstrating the immunologic efficacy of EMD640744.
Resumo:
The educational platform Virtual Science Hub (ViSH) has been developed as part of the GLOBAL excursion European project. ViSH (http://vishub.org/) is a portal where teachers and scientist interact to create virtual excursions to science infrastructures. The main motivation behind the project was to connect teachers - and in consequence their students - to scientific institutions and their wide amount of infrastructures and resources they are working with. Thus the idea of a hub was born that would allow the two worlds of scientists and teachers to connect and to innovate science teaching. The core of the ViSH?s concept design is based on virtual excursions, which allow for a number of pedagogical models to be applied. According to our internal definition a virtual excursion is a tour through some digital context by teachers and pupils on a given topic that is attractive and has an educational purpose. Inquiry-based learning, project-based and problem-based learning are the most prominent approaches that a virtual excursion may serve. The domain specific resources and scientific infrastructures currently available on the ViSH are focusing on life sciences, nano-technology, biotechnology, grid and volunteer computing. The virtual excursion approach allows an easy combination of these resources into interdisciplinary teaching scenarios. In addition, social networking features support the users in collaborating and communicating in relation to these excursions and thus create a community of interest for innovative science teaching. The design and development phases were performed following a participatory design approach. An important aspect in this process was to create design partnerships amongst all actors involved, researchers, developers, infrastructure providers, teachers, social scientists, and pedagogical experts early in the project. A joint sense of ownership was created and important changes during the conceptual phase were implemented in the ViSH due to early user feedback. Technology-wise the ViSH is based on the latest web technologies in order to make it cross-platform compatible so that it works on several operative systems such as Windows, Mac or Linux and multi-device accessible, such as desktop, tablet and mobile devices. The platform has been developed in HTML5, the latest standard for web development, assuring that it can run on any modern browser. In addition to social networking features a core element on the ViSH is the virtual excursions editor. It is a web tool that allows teachers and scientists to create rich mash-ups of learning resources provided by the e-Infrastructures (i.e. remote laboratories and live webcams). These rich mash-ups can be presented in either slides or flashcards format. Taking advantage of the web architecture supported, additional powerful components have been integrated like a recommendation engine to provide personalized suggestions about educational content or interesting users and a videoconference tool to enhance real-time collaboration like MashMeTV (http://www.mashme.tv/).