936 resultados para Nonparametric discriminant analysis


Relevância:

90.00% 90.00%

Publicador:

Resumo:

A compositional multivariate approach is used to analyse regional scale soil geochemical data obtained as part of the Tellus Project generated by the Geological Survey Northern Ireland (GSNI). The multi-element total concentration data presented comprise XRF analyses of 6862 rural soil samples collected at 20cm depths on a non-aligned grid at one site per 2 km2. Censored data were imputed using published detection limits. Using these imputed values for 46 elements (including LOI), each soil sample site was assigned to the regional geology map provided by GSNI initially using the dominant lithology for the map polygon. Northern Ireland includes a diversity of geology representing a stratigraphic record from the Mesoproterozoic, up to and including the Palaeogene. However, the advance of ice sheets and their meltwaters over the last 100,000 years has left at least 80% of the bedrock covered by superficial deposits, including glacial till and post-glacial alluvium and peat. The question is to what extent the soil geochemistry reflects the underlying geology or superficial deposits. To address this, the geochemical data were transformed using centered log ratios (clr) to observe the requirements of compositional data analysis and avoid closure issues. Following this, compositional multivariate techniques including compositional Principal Component Analysis (PCA) and minimum/maximum autocorrelation factor (MAF) analysis method were used to determine the influence of underlying geology on the soil geochemistry signature. PCA showed that 72% of the variation was determined by the first four principal components (PC’s) implying “significant” structure in the data. Analysis of variance showed that only 10 PC’s were necessary to classify the soil geochemical data. To consider an improvement over PCA that uses the spatial relationships of the data, a classification based on MAF analysis was undertaken using the first 6 dominant factors. Understanding the relationship between soil geochemistry and superficial deposits is important for environmental monitoring of fragile ecosystems such as peat. To explore whether peat cover could be predicted from the classification, the lithology designation was adapted to include the presence of peat, based on GSNI superficial deposit polygons and linear discriminant analysis (LDA) undertaken. Prediction accuracy for LDA classification improved from 60.98% based on PCA using 10 principal components to 64.73% using MAF based on the 6 most dominant factors. The misclassification of peat may reflect degradation of peat covered areas since the creation of superficial deposit classification. Further work will examine the influence of underlying lithologies on elemental concentrations in peat composition and the effect of this in classification analysis.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Carbon and nitrogen stable isotope values were determined in Pacific white shrimp (Litopenaeus vannamei) with the objective of discriminating animals produced through aquaculture practices from those extracted from the wild. Farmed animals were collected at semi-intensive shrimp farms in Mexico and Ecuador. Fisheries-derived shrimps were caught in different fishing areas representing two estuarine systems and four open sea locations in Mexico and Ecuador. Carbon and nitrogen stable isotope values (13CVPDB and 15NAIR) allowed clear differentiation of wild from farmed animals. 13CVPDB and 15NAIR values in shrimps collected in the open sea were isotopically enriched (−16.99‰ and 11.57‰), indicating that these organisms belong to higher trophic levels than farmed animals. 13CVPDB and 15NAIR values of farmed animals (−19.72‰ and 7.85‰, respectively) partially overlapped with values measured in animals collected in estuaries (−18.46‰ and 5.38‰, respectively). Canonical discriminant analysis showed that when used separately and in conjunction, 13CVPDB and I5NAIR values were powerful discriminatory variables and demonstrate the viability of isotopic evaluations to distinguish wild-caught shrimps from aquaculture shrimps. Methodological improvements will define a verification tool to support shrimp traceability protocols.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The elemental analysis of soil is useful in forensic and environmental sciences. Methods were developed and optimized for two laser-based multi-element analysis techniques: laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS) and laser-induced breakdown spectroscopy (LIBS). This work represents the first use of a 266 nm laser for forensic soil analysis by LIBS. Sample preparation methods were developed and optimized for a variety of sample types, including pellets for large bulk soil specimens (470 mg) and sediment-laden filters (47 mg), and tape-mounting for small transfer evidence specimens (10 mg). Analytical performance for sediment filter pellets and tape-mounted soils was similar to that achieved with bulk pellets. An inter-laboratory comparison exercise was designed to evaluate the performance of the LA-ICP-MS and LIBS methods, as well as for micro X-ray fluorescence (μXRF), across multiple laboratories. Limits of detection (LODs) were 0.01-23 ppm for LA-ICP-MS, 0.25-574 ppm for LIBS, 16-4400 ppm for µXRF, and well below the levels normally seen in soils. Good intra-laboratory precision (≤ 6 % relative standard deviation (RSD) for LA-ICP-MS; ≤ 8 % for µXRF; ≤ 17 % for LIBS) and inter-laboratory precision (≤ 19 % for LA-ICP-MS; ≤ 25 % for µXRF) were achieved for most elements, which is encouraging for a first inter-laboratory exercise. While LIBS generally has higher LODs and RSDs than LA-ICP-MS, both were capable of generating good quality multi-element data sufficient for discrimination purposes. Multivariate methods using principal components analysis (PCA) and linear discriminant analysis (LDA) were developed for discriminations of soils from different sources. Specimens from different sites that were indistinguishable by color alone were discriminated by elemental analysis. Correct classification rates of 94.5 % or better were achieved in a simulated forensic discrimination of three similar sites for both LIBS and LA-ICP-MS. Results for tape-mounted specimens were nearly identical to those achieved with pellets. Methods were tested on soils from USA, Canada and Tanzania. Within-site heterogeneity was site-specific. Elemental differences were greatest for specimens separated by large distances, even within the same lithology. Elemental profiles can be used to discriminate soils from different locations and narrow down locations even when mineralogy is similar.

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Blast is a major disease of rice in Brazil, the largest rice-producing country outside Asia. This study aimed to assess the genetic structure and mating-type frequency in a contemporary Pyricularia oryzae population, which caused widespread epidemics during the 2012/13 season in the Brazilian lowland subtropical region. Symptomatic leaves and panicles were sampled at flooded rice fields in the states of Rio Grande do Sul (RS, 34 fields) and Santa Catarina (SC, 21 fields). The polymorphism at ten simple sequence repeats (SSR or microsatellite) loci and the presence of MAT1-1 or MAT1-2 idiomorphs were assessed in a population comprised of 187 isolates. Only the MAT1-2 idiomorph was found and 162 genotypes were identified by the SSR analysis. A discriminant analysis of principal components (DAPC) of SSR data resolved four genetic groups, which were strongly associated with the cultivar of origin of the isolates. There was high level of genotypic diversity and moderate level of gene diversity regardless whether isolates were grouped in subpopulations based on geographic region, cultivar host or cultivar within region. While regional subpopulations were weakly differentiated, high genetic differentiation was found among subpopulations comprised of isolates from different cultivars. The data suggest that the rice blast pathogen population in southern Brazil is comprised of clonal lineages that are adapting to specific cultivar hosts. Farmers should avoid the use of susceptible cultivars over large areas and breeders should focus at enlarging the genetic basis of new cultivars.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Frankfurters are widely consumed all over the world, and the production requires a wide range of meat and non-meat ingredients. Due to these characteristics, frankfurters are products that can be easily adulterated with lower value meats, and the presence of undeclared species. Adulterations are often still difficult to detect, due the fact that the adulterant components are usually very similar to the authentic product. In this work, FT-Raman spectroscopy was employed as a rapid technique for assessing the quality of frankfurters. Based on information provided by the Raman spectra, a multivariate classification model was developed to identify the frankfurter type. The aim was to study three types of frankfurters (chicken, turkey and mixed meat) according to their Raman spectra, based on the fatty vibrational bands. Classification model was built using partial least square discriminant analysis (PLS-DA) and the performance model was evaluated in terms of sensitivity, specificity, accuracy, efficiency and Matthews's correlation coefficient. The PLS-DA models give sensitivity and specificity values on the test set in the ranges of 88%-100%, showing good performance of the classification models. The work shows the Raman spectroscopy with chemometric tools can be used as an analytical tool in quality control of frankfurters.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

A method using the ring-oven technique for pre-concentration in filter paper discs and near infrared hyperspectral imaging is proposed to identify four detergent and dispersant additives, and to determine their concentration in gasoline. Different approaches were used to select the best image data processing in order to gather the relevant spectral information. This was attained by selecting the pixels of the region of interest (ROI), using a pre-calculated threshold value of the PCA scores arranged as histograms, to select the spectra set; summing up the selected spectra to achieve representativeness; and compensating for the superimposed filter paper spectral information, also supported by scores histograms for each individual sample. The best classification model was achieved using linear discriminant analysis and genetic algorithm (LDA/GA), whose correct classification rate in the external validation set was 92%. Previous classification of the type of additive present in the gasoline is necessary to define the PLS model required for its quantitative determination. Considering that two of the additives studied present high spectral similarity, a PLS regression model was constructed to predict their content in gasoline, while two additional models were used for the remaining additives. The results for the external validation of these regression models showed a mean percentage error of prediction varying from 5 to 15%.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

OBJECTIVE: The aim of this study was to translate the Structured Clinical Interview for Mood Spectrum into Brazilian Portuguese, measuring its reliability, validity, and defining scores for bipolar disorders. METHOD: Questionnaire was translated (into Brazilian Portuguese) and back-translated into English. Sample consisted of 47 subjects with bipolar disorder, 47 with major depressive disorder, 18 with schizophrenia and 22 controls. Inter-rater reliability was tested in 20 subjects with bipolar disorder and MDD. Internal consistency was measured using the Kuder Richardson formula. Forward stepwise discriminant analysis was performed. Scores were compared between groups; manic (M), depressive (D) and total (T) threshold scores were calculated through receiver operating characteristic (ROC) curves. RESULTS: Kuder Richardson coefficients were between 0.86 and 0.94. Intraclass correlation coefficient was 0.96 (CI 95 % 0.93-0.97). Subjects with bipolar disorder had higher M and T, and similar D scores, when compared to major depressive disorder (ANOVA, p < 0.001). The sub-domains that best discriminated unipolar and bipolar subjects were manic energy and manic mood. M had the best area under the curve (0.909), and values of M equal to or greater than 30 yielded 91.5% sensitivity and 74.5% specificity. CONCLUSION: Structured Clinical Interview for Mood Spectrum has good reliability and validity. Cut-off of 30 best differentiates subjects with bipolar disorder vs. unipolar depression. A cutoff score of 30 or higher in the mania sub-domain is appropriate to help make a distinction between subjects with bipolar disorder and those with unipolar depression.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The concentration of 15 polycyclic aromatic hydrocarbons (PAHs) in 57 samples of distillates (cachaça, rum, whiskey, and alcohol fuel) has been determined by HPLC-Fluorescence detection. The quantitative analytical profile of PAHs treated by Partial Least Square - Discriminant Analysis (PLS-DA) provided a good classification of the studied spirits based on their PAHs content. Additionally, the classification of the sugar cane derivatives according to the harvest practice was obtained treating the analytical data by Linear Discriminant Analysis (LDA), using naphthalene, acenaphthene, fluorene, phenanthrene, anthracene, fluoranthene, pyrene, benz[a]anthracene, benz[b]fluoranthene, and benz[g,h,i]perylene, as a chemical descriptors.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

One hundred fifteen cachaça samples derived from distillation in copper stills (73) or in stainless steels (42) were analyzed for thirty five itens by chromatography and inductively coupled plasma optical emission spectrometry. The analytical data were treated through Factor Analysis (FA), Partial Least Square Discriminant Analysis (PLS-DA) and Quadratic Discriminant Analysis (QDA). The FA explained 66.0% of the database variance. PLS-DA showed that it is possible to distinguish between the two groups of cachaças with 52.8% of the database variance. QDA was used to build up a classification model using acetaldehyde, ethyl carbamate, isobutyl alcohol, benzaldehyde, acetic acid and formaldehyde as chemical descriptors. The model presented 91.7% of accuracy on predicting the apparatus in which unknown samples were distilled.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This work proposes a new approach using a committee machine of artificial neural networks to classify masses found in mammograms as benign or malignant. Three shape factors, three edge-sharpness measures, and 14 texture measures are used for the classification of 20 regions of interest (ROIs) related to malignant tumors and 37 ROIs related to benign masses. A group of multilayer perceptrons (MLPs) is employed as a committee machine of neural network classifiers. The classification results are reached by combining the responses of the individual classifiers. Experiments involving changes in the learning algorithm of the committee machine are conducted. The classification accuracy is evaluated using the area A. under the receiver operating characteristics (ROC) curve. The A, result for the committee machine is compared with the A, results obtained using MLPs and single-layer perceptrons (SLPs), as well as a linear discriminant analysis (LDA) classifier Tests are carried out using the student's t-distribution. The committee machine classifier outperforms the MLP SLP, and LDA classifiers in the following cases: with the shape measure of spiculation index, the A, values of the four methods are, in order 0.93, 0.84, 0.75, and 0.76; and with the edge-sharpness measure of acutance, the values are 0.79, 0.70, 0.69, and 0.74. Although the features with which improvement is obtained with the committee machines are not the same as those that provided the maximal value of A(z) (A(z) = 0.99 with some shape features, with or without the committee machine), they correspond to features that are not critically dependent on the accuracy of the boundaries of the masses, which is an important result. (c) 2008 SPIE and IS&T.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Medium density fiberboard (MDF) is an engineered wood product formed by breaking down selected lignin-cellulosic material residuals into fibers, combining it with wax and a resin binder, and then forming panels by applying high temperature and pressure. Because the raw material in the industrial process is ever-changing, the panel industry requires methods for monitoring the composition of their products. The aim of this study was to estimate the ratio of sugarcane (SC) bagasse to Eucalyptus wood in MDF panels using near infrared (NIR) spectroscopy. Principal component analysis (PCA) and partial least square (PLS) regressions were performed. MDF panels having different bagasse contents were easily distinguished from each other by the PCA of their NIR spectra with clearly different patterns of response. The PLS-R models for SC content of these MDF samples presented a strong coefficient of determination (0.96) between the NIR-predicted and Lab-determined values and a low standard error of prediction (similar to 1.5%) in the cross-validations. A key role of resins (adhesives), cellulose, and lignin for such PLS-R calibrations was shown. PLS-DA model correctly classified ninety-four percent of MDF samples by cross-validations and ninety-eight percent of the panels by independent test set. These NIR-based models can be useful to quickly estimate sugarcane bagasse vs. Eucalyptus wood content ratio in unknown MDF samples and to verify the quality of these engineered wood products in an online process.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Online music databases have increased significantly as a consequence of the rapid growth of the Internet and digital audio, requiring the development of faster and more efficient tools for music content analysis. Musical genres are widely used to organize music collections. In this paper, the problem of automatic single and multi-label music genre classification is addressed by exploring rhythm-based features obtained from a respective complex network representation. A Markov model is built in order to analyse the temporal sequence of rhythmic notation events. Feature analysis is performed by using two multi-variate statistical approaches: principal components analysis (unsupervised) and linear discriminant analysis (supervised). Similarly, two classifiers are applied in order to identify the category of rhythms: parametric Bayesian classifier under the Gaussian hypothesis (supervised) and agglomerative hierarchical clustering (unsupervised). Qualitative results obtained by using the kappa coefficient and the obtained clusters corroborated the effectiveness of the proposed method.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Quality control of toys for avoiding children exposure to potentially toxic elements is of utmost relevance and it is a common requirement in national and/or international norms for health and safety reasons. Laser-induced breakdown spectroscopy (LIBS) was recently evaluated at authors` laboratory for direct analysis of plastic toys and one of the main difficulties for the determination of Cd. Cr and Pb was the variety of mixtures and types of polymers. As most norms rely on migration (lixiviation) protocols, chemometric classification models from LIBS spectra were tested for sampling toys that present potential risk of Cd, Cr and Pb contamination. The classification models were generated from the emission spectra of 51 polymeric toys and by using Partial Least Squares - Discriminant Analysis (PLS-DA), Soft Independent Modeling of Class Analogy (SIMCA) and K-Nearest Neighbor (KNN). The classification models and validations were carried out with 40 and 11 test samples, respectively. Best results were obtained when KNN was used, with corrected predictions varying from 95% for Cd to 100% for Cr and Pb. (C) 2011 Elsevier B.V. All rights reserved.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

This study analyzed inter-individual variability of the temporal structure applied in basketball throwing. Ten experienced male athletes in basketball throwing were filmed and a number of kinematic movement parameters analyzed. A biomechanical model provided the relative timing of the shoulder, elbow and wrist joint movements. Inter-individual variability was analyzed using sequencing and relative timing of tem phases of the throw. To compare the variability of the movement phases between subjects a discriminant analysis and an ANOVA were applied. The Tukey test was applied to determine where differences occurred. The significance level was p = 0.05. Inter-individual variability was explained by three concomitant factors: (a) a precision control strategy, (b) a velocity control strategy and (c) intrinsic characteristics of the subjects. Therefore, despite the fact that some actions are common to the basketball throwing pattern each performed demonstrated particular and individual characteristics.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

Solid waste of the automobile industry containing large amounts of heavy metals might affect the emission of greenhouse gases (GHG) when applied to the soil. Accumulation of inorganic chemical elements in the environment generally occurs due to human activity (industry, agriculture, mining and waste landfills). Residues from human activities may release heavy metals to the soil solution, causing toxicity to plants and other soil organisms. Heavy metals may also be adsorbed to clay minerals and/or complexed by the soil organic matter, becoming a potential source of pollutants. Not much is known about the behavior of solid wastes in tropical soil as regarded as source of greenhouse gases (GHG). The emission of GHG (CO(2), CH(4) and N(2)O) was evaluated in incubated soil samples collected in an area contaminated with a solid residue from an automobile industry. Samples were randomly collected at 0 to 0.2 m (a mix of soil and residue), 0.2 to 0.4 m (only residue) and 0.4 to 0.6 m (only soil). A contiguous uncontaminated area, cultivated with sugarcane, was also sampled following the same protocol. Canonical Discriminant Analysis and Principal Component Analysis were applied to the data to evaluate the GHG emission rates. Emission rates of GHG were greater in the samples from the contaminated than the sugarcane area, particularly high during the first days of incubation. CO(2) emissions were greater in samples collected at the upper layer for both areas, while CH(4) and N(2)O emissions were similar in all samples. The emission rates of CH(4) were the most efficient variables to differentiate contaminated and uncontaminated areas.