989 resultados para multivariate classification
Resumo:
Multivariate classification methods were used to evaluate data on the concentrations of eight metals in human senile lenses measured by atomic absorption spectrometry. Principal components analysis and hierarchical clustering separated senile cataract lenses, nuclei from cataract lenses, and normal lenses into three classes on the basis of the eight elements. Stepwise discriminant analysis was applied to give discriminant functions with five selected variables. Results provided by the linear learning machine method were also satisfactory; the k-nearest neighbour method was less useful.
Resumo:
Site-specific management requires accurate knowledge of the spatial variation in a range of soil properties within fields. This involves considerable sampling effort, which is costly. Ancillary data, such as crop yield, elevation and apparent electrical conductivity (ECa) of the soil, can provide insight into the spatial variation of some soil properties. A multivariate classification with spatial constraint imposed by the variogram was used to classify data from two arable crop fields. The yield data comprised 5 years of crop yield, and the ancillary data 3 years of yield data, elevation and ECa. Information on soil chemical and physical properties was provided by intensive surveys of the soil. Multivariate variograms computed from these data were used to constrain sites spatially within classes to increase their contiguity. The constrained classifications resulted in coherent classes, and those based on the ancillary data were similar to those from the soil properties. The ancillary data seemed to identify areas in the field where the soil is reasonably homogeneous. The results of targeted sampling showed that these classes could be used as a basis for management and to guide future sampling of the soil.
Resumo:
Multivariate classification techniques have proven to be powerful tools for distinguishing experimental conditions in single sessions of functional magnetic resonance imaging (fMRI) data. But they are vulnerable to a considerable penalty in classification accuracy when applied across sessions or participants, calling into question the degree to which fine-grained encodings are shared across subjects. Here, we introduce joint learning techniques, where feature selection is carried out using a held-out subset of a target dataset, before training a linear classifier on a source dataset. Single trials of functional MRI data from a covert property generation task are classified with regularized regression techniques to predict the semantic class of stimuli. With our selection techniques (joint ranking feature selection (JRFS) and disjoint feature selection (DJFS)), classification performance during cross-session prediction improved greatly, relative to feature selection on the source session data only. Compared with JRFS, DJFS showed significant improvements for cross-participant classification. And when using a groupwise training, DJFS approached the accuracies seen for prediction across different sessions from the same participant. Comparing several feature selection strategies, we found that a simple univariate ANOVA selection technique or a minimal searchlight (one voxel in size) is appropriate, compared with larger searchlights.
Resumo:
The Representative Soil Sampling Scheme of England and Wales has recorded information on the soil of agricultural land in England and Wales since 1969. It is a valuable source of information about the soil in the context of monitoring for sustainable agricultural development. Changes in soil nutrient status and pH were examined over the period 1971-2001. Several methods of statistical analysis were applied to data from the surveys during this period. The main focus here is on the data for 1971, 1981, 1991 and 2001. The results of examining change over time in general show that levels of potassium in the soil have increased, those of magnesium have remained fairly constant, those of phosphorus have declined and pH has changed little. Future sampling needs have been assessed in the context of monitoring, to determine the mean at a given level of confidence and tolerable error and to detect change in the mean over time at these same levels over periods of 5 and 10 years. The results of a non-hierarchical multivariate classification suggest that England and Wales could be stratified to optimize future sampling and analysis. To monitor soil quality and health more generally than for agriculture, more of the country should be sampled and a wider range of properties recorded.
Resumo:
Autism Spectrum Disorder (ASD) is diagnosed on the basis of behavioral symptoms, but cognitive abilities may also be useful in characterizing individuals with ASD. One hundred seventy-eight high-functioning male adults, half with ASD and half without, completed tasks assessing IQ, a broad range of cognitive skills, and autistic and comorbid symptomatology. The aims of the study were, first, to determine whether significant differences existed between cases and controls on cognitive tasks, and whether cognitive profiles, derived using a multivariate classification method with data from multiple cognitive tasks, could distinguish between the two groups. Second, to establish whether cognitive skill level was correlated with degree of autistic symptom severity, and third, whether cognitive skill level was correlated with degree of comorbid psychopathology. Fourth, cognitive characteristics of individuals with Asperger Syndrome (AS) and high-functioning autism (HFA) were compared. After controlling for IQ, ASD and control groups scored significantly differently on tasks of social cognition, motor performance, and executive function (P's < 0.05). To investigate cognitive profiles, 12 variables were entered into a support vector machine (SVM), which achieved good classification accuracy (81%) at a level significantly better than chance (P < 0.0001). After correcting for multiple correlations, there were no significant associations between cognitive performance and severity of either autistic or comorbid symptomatology. There were no significant differences between AS and HFA groups on the cognitive tasks. Cognitive classification models could be a useful aid to the diagnostic process when used in conjunction with other data sources-including clinical history.
Resumo:
A series of quali- and quantitative analyses were conducted to evaluate the variability of spinner dolphin whistles from the Fernando de Noronha Archipelago off Brazil. Nine variables were extracted from each whistle contour, and the whistle contours shapes were classified into the seven categories described in Driscoll (1995). The analysis showed mean beginning and ending frequencies values of 10.78 and 12.74 kHz, respectively. on average, whistle duration was relatively short, with mean values around 0.495 s (N=702). Comparative analyses were also conducted to investigate the relationship between the obtained results and those presented in previous studies. When comparing averages, the results of the study of Oswald et al. (2003) in the Tropical Eastern Pacific (TEP) presented less significant differences in relation to this study; only whistle duration differed significantly between both works. The results of multivariate classification tests also pointed TEP population as the closest related to the population studied here. The similarities between such disjunct populations might be attributed to a more recent isolation event (the closing of the Panama Isthmus) than the divergence that has driven North and South Atlantic populations apart. (c) 2006 Acoustical Society of America.
Resumo:
This paper presents a method for multidimensional classification (Cluster Analysis), as well as a 8 bits microcomputer program in BASIC that automates the process. A practical application of the model is also presented. -English summary
Resumo:
ABSTRACT: In this study, the mineral composition of leaves and teas of medicinal plants was evaluated. Ca, Cu, Fe, Mg, Mn e Zn were determined in the samples using flame atomic absorption spectrometry. Principal component analysis was applied to discriminate the samples studied. The samples were divided within the 2 groups according to their mineral composition. Copper and iron were the variables that contributed most to the separation of the samples followed by Ca, Mg, Mn and Zn. The information in the principal component analysis was confirmed by the dendrogram obtained by hierarchical cluster analysis.
Resumo:
A series of quali- and quantitative analyses were conducted to evaluate the variability of spinner dolphin whistles from the Fernando de Noronha Archipelago off Brazil. Nine variables were extracted from each whistle contour, and the whistle contours shapes were classified into the seven categories described in Driscoll (1995). The analysis showed mean beginning and ending frequencies values of 10.78 and 12.74 kHz, respectively. on average, whistle duration was relatively short, with mean values around 0.495 s (N=702). Comparative analyses were also conducted to investigate the relationship between the obtained results and those presented in previous studies. When comparing averages, the results of the study of Oswald et al. (2003) in the Tropical Eastern Pacific (TEP) presented less significant differences in relation to this study; only whistle duration differed significantly between both works. The results of multivariate classification tests also pointed TEP population as the closest related to the population studied here. The similarities between such disjunct populations might be attributed to a more recent isolation event (the closing of the Panama Isthmus) than the divergence that has driven North and South Atlantic populations apart. (c) 2006 Acoustical Society of America.
Resumo:
Méthodologie: Simulation; Analyse discriminante linéaire et logistique; Arbres de classification; Réseaux de neurones en base radiale
Resumo:
The use of near infrared (NIR) hyperspectral imaging and hyperspectral image analysis for distinguishing between hard, intermediate and soft maize kernels from inbred lines was evaluated. NIR hyperspectral images of two sets (12 and 24 kernels) of whole maize kernels were acquired using a Spectral Dimensions MatrixNIR camera with a spectral range of 960-1662 nm and a sisuChema SWIR (short wave infrared) hyperspectral pushbroom imaging system with a spectral range of 1000-2498 nm. Exploratory principal component analysis (PCA) was used on absorbance images to remove background, bad pixels and shading. On the cleaned images. PCA could be used effectively to find histological classes including glassy (hard) and floury (soft) endosperm. PCA illustrated a distinct difference between glassy and floury endosperm along principal component (PC) three on the MatrixNIR and PC two on the sisuChema with two distinguishable clusters. Subsequently partial least squares discriminant analysis (PLS-DA) was applied to build a classification model. The PLS-DA model from the MatrixNIR image (12 kernels) resulted in root mean square error of prediction (RMSEP) value of 0.18. This was repeated on the MatrixNIR image of the 24 kernels which resulted in RMSEP of 0.18. The sisuChema image yielded RMSEP value of 0.29. The reproducible results obtained with the different data sets indicate that the method proposed in this paper has a real potential for future classification uses.
Resumo:
Brazilian sugarcane spirits were analyzed to elucidate similarities and dissimilarities by principal component analysis. Nine aldehydes, six alcohols, and six metal cations were identified and quantified. Isobutanol (LD 202.9 mu gL-1), butiraldehyde (0.08-0.5 mu gL-1), ethanol (39-47% v/v), and copper (371-6068 mu gL-1) showed marked similarities, but the concentration levels of n-butanol (1.6-7.3 mu gL-1), sec-butanol (LD 89 mu gL-1), formaldehyde (0.1-0.74 mu gL-1), valeraldehyde (0.04-0.31 mu gL-1), iron (8.6-139.1 mu gL-1), and magnesium (LD 1149 mu gL-1) exhibited differences from samples.
Resumo:
In order to differentiate and characterize Madeira wines according to main grape varieties, the volatile composition (higher alcohols, fatty acids, ethyl esters and carbonyl compounds) was determined for 36 monovarietal Madeira wine samples elaborated from Boal, Malvazia, Sercial and Verdelho white grape varieties. The study was carried out by headspace solid-phase microextraction technique (HS-SPME), in dynamic mode, coupled with gas chromatography–mass spectrometry (GC–MS). Corrected peak area data for 42 analytes from the above mentioned chemical groups was used for statistical purposes. Principal component analysis (PCA) was applied in order to determine the main sources of variability present in the data sets and to establish the relation between samples (objects) and volatile compounds (variables). The data obtained by GC–MS shows that the most important contributions to the differentiation of Boal wines are benzyl alcohol and (E)-hex-3-en-1-ol. Ethyl octadecanoate, (Z)-hex-3-en-1-ol and benzoic acid are the major contributions in Malvazia wines and 2-methylpropan-1-ol is associated to Sercial wines. Verdelho wines are most correlated with 5-(ethoxymethyl)-furfural, nonanone and cis-9-ethyldecenoate. A 96.4% of prediction ability was obtained by the application of stepwise linear discriminant analysis (SLDA) using the 19 variables that maximise the variance of the initial data set.
Resumo:
Zones of mixing between shallow groundwaters of different composition were unravelled by two-way regionalized classification, a technique based on correspondence analysis (CA), cluster analysis (ClA) and discriminant analysis (DA), aided by gridding, map-overlay and contouring tools. The shallow groundwaters are from a granitoid plutonite in the Funda o region (central Portugal). Correspondence analysis detected three natural clusters in the working dataset: 1, weathering; 2, domestic effluents; 3, fertilizers. Cluster analysis set an alternative distribution of the samples by the three clusters. Group memberships obtained by correspondence analysis and by cluster analysis were optimized by discriminant analysis, gridded memberships as follows: codes 1, 2 or 3 were used when classification by correspondence analysis and cluster analysis produced the same results; code 0 when the grid node was first assigned to cluster 1 and then to cluster 2 or vice versa (mixing between weathering and effluents); code 4 in the other cases (mixing between agriculture and the other influences). Code-3 areas were systematically surrounded by code-4 areas, an observation attributed to hydrodynamic dispersion. Accordingly, the extent of code-4 areas in two orthogonal directions was assumed proportional to the longitudinal and transverse dispersivities of local soils. The results (0.7-16.8 and 0.4-4.3 m, respectively) are acceptable at the macroscopic scale. The ratios between longitudinal and transverse dispersivities (1.2-11.1) are also in agreement with results obtained by other studies.