744 resultados para unsupervised classification
Resumo:
Recently, we have built a classification model that is capable of assigning a given sesquiterpene lactone (STL) into exactly one tribe of the plant family Asteraceae from which the STL has been isolated. Although many plant species are able to biosynthesize a set of peculiar compounds, the occurrence of the same secondary metabolites in more than one tribe of Asteraceae is frequent. Building on our previous work, in this paper, we explore the possibility of assigning an STL to more than one tribe (class) simultaneously. When an object may belong to more than one class simultaneously, it is called multilabeled. In this work, we present a general overview of the techniques available to examine multilabeled data. The problem of evaluating the performance of a multilabeled classifier is discussed. Two particular multilabeled classification methods-cross-training with support vector machines (ct-SVM) and multilabeled k-nearest neighbors (M-L-kNN)were applied to the classification of the STLs into seven tribes from the plant family Asteraceae. The results are compared to a single-label classification and are analyzed from a chemotaxonomic point of view. The multilabeled approach allowed us to (1) model the reality as closely as possible, (2) improve our understanding of the relationship between the secondary metabolite profiles of different Asteraceae tribes, and (3) significantly decrease the number of plant sources to be considered for finding a certain STL. The presented classification models are useful for the targeted collection of plants with the objective of finding plant sources of natural compounds that are biologically active or possess other specific properties of interest.
Resumo:
Developing a unified classification system to replace four of the systems currently used in disability athletics (i.e., track and field) has been widely advocated. The diverse impairments to be included in a unified system require severed assessment methods, results of which cannot be meaningfully compared. Therefore, the taxonomic basis of current classification systems is invalid in a unified system. Biomechanical analysis establishes that force, a vector described in terms of magnitude and direction, is a key determinant of success in all athletic disciplines. It is posited that all impairments to be included in a unified system may be classified as either force magnitude impairments (FMI) or force control impairments (FCI). This framework would provide a valid taxonomic basis for a unified system, creating the opportunity to decrease the number of classes and enhance the viability of disability athletics.
Resumo:
Data mining is the process to identify valid, implicit, previously unknown, potentially useful and understandable information from large databases. It is an important step in the process of knowledge discovery in databases, (Olaru & Wehenkel, 1999). In a data mining process, input data can be structured, seme-structured, or unstructured. Data can be in text, categorical or numerical values. One of the important characteristics of data mining is its ability to deal data with large volume, distributed, time variant, noisy, and high dimensionality. A large number of data mining algorithms have been developed for different applications. For example, association rules mining can be useful for market basket problems, clustering algorithms can be used to discover trends in unsupervised learning problems, classification algorithms can be applied in decision-making problems, and sequential and time series mining algorithms can be used in predicting events, fault detection, and other supervised learning problems (Vapnik, 1999). Classification is among the most important tasks in the data mining, particularly for data mining applications into engineering fields. Together with regression, classification is mainly for predictive modelling. So far, there have been a number of classification algorithms in practice. According to (Sebastiani, 2002), the main classification algorithms can be categorized as: decision tree and rule based approach such as C4.5 (Quinlan, 1996); probability methods such as Bayesian classifier (Lewis, 1998); on-line methods such as Winnow (Littlestone, 1988) and CVFDT (Hulten 2001), neural networks methods (Rumelhart, Hinton & Wiliams, 1986); example-based methods such as k-nearest neighbors (Duda & Hart, 1973), and SVM (Cortes & Vapnik, 1995). Other important techniques for classification tasks include Associative Classification (Liu et al, 1998) and Ensemble Classification (Tumer, 1996).
Resumo:
The task of segmenting cell nuclei from cytoplasm in conventional Papanicolaou (Pap) stained cervical cell images is a classical image analysis problem which may prove to be crucial to the development of successful systems which automate the analysis of Pap smears for detection of cancer of the cervix. Although simple thresholding techniques will extract the nucleus in some cases, accurate unsupervised segmentation of very large image databases is elusive. Conventional active contour models as introduced by Kass, Witkin and Terzopoulos (1988) offer a number of advantages in this application, but suffer from the well-known drawbacks of initialisation and minimisation. Here we show that a Viterbi search-based dual active contour algorithm is able to overcome many of these problems and achieve over 99% accurate segmentation on a database of 20 130 Pap stained cell images. (C) 1998 Elsevier Science B.V. All rights reserved.
Resumo:
Strain-dependent hydraulic conductivities are uniquely defined by an environmental factor, representing applied normal and shear strains, combined with intrinsic material parameters representing mass and component deformation moduli, initial conductivities, and mass structure. The components representing mass moduli and structure are defined in terms of RQD (rock quality designation) and RMR (rock mass rating) to represent the response of a whole spectrum of rock masses, varying from highly fractured (crushed) rock to intact rock. These two empirical parameters determine the hydraulic response of a fractured medium to the induced-deformations The constitutive relations are verified against available published data and applied to study one-dimensional, strain-dependent fluid flow. Analytical results indicate that both normal and shear strains exert a significant influence on the processes of fluid flow and that the magnitude of this influence is regulated by the values of RQD and RMR.
Resumo:
CysView is a web-based application tool that identifies and classifies proteins according to their disulfide connectivity patterns. It accepts a dataset of annotated protein sequences in various formats and returns a graphical representation of cysteine pairing patterns. CysView displays cysteine patterns for those records in the data with disulfide annotations. It allows the viewing of records grouped by connectivity patterns. CysView's utility as an analysis tool was demonstrated by the rapid and correct classification of scorpion toxin entries from GenPept on the basis of their disulfide pairing patterns. It has proved useful for rapid detection of irrelevant and partial records, or those with incomplete annotations. CysView can be used to support distant homology between proteins. CysView is publicly available at http://research.i2r.a-star.edu.sg/CysView/.
Resumo:
Gasteruptiinae is the largest Gasteruptiidae subfamily, with circa 400 species that have been grouped into the worldwide Gasteruption Latreille. Based on a cladistic analysis with 43 morphological characters, 40 ingroup taxa representing all biogeographic regions, and seven outgroups (four Hyptiogastrinae, two Aulacidae and one Evaniidae), I confirm the monophyly of Gasteruptiinae and Gasteruption and recognize three exclusively Neotropical small genera: Plutofoenus Kieffer (revalidated) (southern South America), Spinolafoenus Macedo n. gen. (Chile) and Trilobitofoenus Macedo n. gen. (Central and South America). Gasteruption, supported by four synapomorphies, remains the most speciose genus in the subfamily. The four Gasteruptiinae genera are keyed and described. Seven species are keyed and described or redescribed: Plutofoenus chaeturus (Schletterer) n. comb., P. edwardsi Turner, P. paraguayensis (Schrottky), Spinolafoenus ruficornis (Spinola) n. comb., Trilobitofoenus alvarengai Macedo n. sp., T. plaumanni Macedo n. sp. and T. sericeus (Cameron) n. comb. (lectotype designated).
Resumo:
Objectives To validate the previously proposed classification criteria for Henoch-Schonlein purpura (HSP), childhood polyarteritis nodosa (c-PAN), c-Wegener granulomatosis (c-WG) and c-Takayasu arteritis (c-TA). Methods Step 1: retrospective/prospective webdata collection for children with HSP, c-PAN, c-WG and c-TA with age at diagnosis <= 18 years. Step 2: blinded classification by consensus panel of a representative sample of 280 cases. Step 3: statistical (sensitivity, specificity, area under the curve and.-agreement) and nominal group technique consensus evaluations. Results 827 patients with HSP, 150 with c-PAN, 60 with c-WG, 87 with c-TA and 52 with c-other were compared with each other. A patient was classified as HSP in the presence of purpura or petechiae (mandatory) with lower limb predominance plus one of four criteria: (1) abdominal pain; (2) histopathology (IgA); (3) arthritis or arthralgia; (4) renal involvement. Classification of c-PAN required a systemic inflammatory disease with evidence of necrotising vasculitis OR angiographic abnormalities of medium-/small-sized arteries (mandatory criterion) plus one of five criteria: (1) skin involvement; (2) myalgia/muscle tenderness; (3) hypertension; (4) peripheral neuropathy; (5) renal involvement. Classification of c-WG required three of six criteria: (1) histopathological evidence of granulomatous inflammation; (2) upper airway involvement; (3) laryngo-tracheo-bronchial involvement; (4) pulmonary involvement (x-ray/CT); (5) antineutrophilic cytoplasmic antibody positivity; (6) renal involvement. Classification of c-TA required typical angiographic abnormalities of the aorta or its main branches and pulmonary arteries (mandatory criterion) plus one of five criteria: (1) pulse deficit or claudication; (2) blood pressure discrepancy in any limb; (3) bruits; (4) hypertension; (5) elevated acute phase reactant. Conclusion European League Against Rheumatism/Paediatric Rheumatology International Trials Organisation/Paediatric Rheumatology European Society propose validated classification criteria for HSP, c-PAN, c-WG and c-TA with high sensitivity/specificity.
Resumo:
Background Schizophrenia has been associated with semantic memory impairment and previous studies report a difficulty in accessing semantic category exemplars (Moelter et al. 2005 Schizophr Res 78:209–217). The anterior temporal cortex (ATC) has been implicated in the representation of semantic knowledge (Rogers et al. 2004 Psychol Rev 111(1):205–235). We conducted a high-field (4T) fMRI study with the Category Judgment and Substitution Task (CJAST), an analogue of the Hayling test. We hypothesised that differential activation of the temporal lobe would be observed in schizophrenia patients versus controls. Methods Eight schizophrenia patients (7M : 1F) and eight matched controls performed the CJAST, involving a randomised series of 55 common nouns (from five semantic categories) across three conditions: semantic categorisation, anomalous categorisation and word reading. High-resolution 3D T1-weighted images and GE EPI with BOLD contrast and sparse temporal sampling were acquired on a 4T Bruker MedSpec system. Image processing and analyses were performed with SPM2. Results Differential activation in the left ATC was found for anomalous categorisation relative to category judgment, in patients versus controls. Conclusions We examined semantic memory deficits in schizophrenia using a novel fMRI task. Since the ATC corresponds to an area involved in accessing abstract semantic representations (Moelter et al. 2005), these results suggest schizophrenia patients utilise the same neural network as healthy controls, however it is compromised in the patients and the different ATC activity might be attributable to weakening of category-to-category associations.
Resumo:
In studies assessing the trends in coronary events, such as the World Health Organization (WHO) MONICA Project (multinational MONItoring of trends and determinants of CArdiovascular disease), the main emphasis has been on coronary deaths and non-fatal definite myocardial infarctions (MI). It is, however, possible that the proportion of milder MIs may be increasing because of improvements in treatment and reductions in levels of risk factors. We used the MI register data of the WHO MONICA Project to investigate several definitions for mild non-fatal MIs that would be applicable in various settings and could be used to assess trends in milder coronary events. Of 38 populations participating in the WHO MONICA MI register study, more than half registered a sufficiently wide spectrum of events that it was possible to identify subsets of milder cases. The event rates and case fatality rates of MI are clearly dependent on the spectrum of non-fatal MIs, which are included. On clinical grounds we propose that the original MONICA category ''non-fatal possible MI'' could bt:divided into two groups: ''non fatal probable MI'' and ''prolonged chest pain.'' Non-fatal probable MIs are cases, which in addition to ''typical symptoms'' have electrocardiogram (EGG) or enzyme changes suggesting cardiac ischemia, but not severe enough to fulfil the criteria for non-fatal definite MI In more than half of the MONICA Collaborating Centers, the registration of MI covers these milder events reasonably well. Proportions of non-fatal probable MIs vary less between populations than do proportions of non fatal possible MIs. Also rates of non-fatal probable MI are somewhat more highly correlated with rates of fatal events and non-fatal definite MI. These findings support the validity of the category of non-fatal probable MI. In each center the increase in event rates and the decrease in case-fatality due to the inclusion of non-fatal probable MI was lar er for women than men. For the WHO MONICA Project and other epidemiological studies the proposed category of non-fatal probable MIs can be used for assessing trends in rates of milder MI. Copyright (C) 1997 Elsevier Science Inc.
Resumo:
The aim of a clinical classification of pulmonary hypertension (PH) is to group together different manifestations of disease sharing similarities in pathophysiologic mechanisms, clinical presentation, and therapeutic approaches. In 2003, during the 3rd World Symposium on Pulmonary Hypertension, the clinical classification of PH initially adopted in 1998 during the 2nd World Symposium was slightly modified. During the 4th World Symposium held in 2008, it was decided to maintain the general architecture and philosophy of the previous clinical classifications. The modifications adopted during this meeting principally concern Group 1, pulmonary arterial hypertension (PAH). This subgroup includes patients with PAH with a family history or patients with idiopathic PAH with germline mutations (e. g., bone morphogenetic protein receptor-2, activin receptor-like kinase type 1, and endoglin). In the new classification, schistosomiasis and chronic hemolytic anemia appear as separate entities in the subgroup of PAH associated with identified diseases. Finally, it was decided to place pulmonary venoocclusive disease and pulmonary capillary hemangiomatosis in a separate group, distinct from but very close to Group 1 (now called Group 1`). Thus, Group 1 of PAH is now more homogeneous. (J Am Coll Cardiol 2009;54:S43-54) (C) 2009 by the American College of Cardiology Foundation