945 resultados para Stepwise Discriminant Analysis
Resumo:
A microwave-assisted extraction (MAE) procedure to isolate phenolic compounds from almond skin byproducts was optimized. A three-level, three-factor Box–Behnken design was used to evaluate the effect of almond skin weight, microwave power, and irradiation time on total phenolic content (TPC) and antioxidant activity (DPPH). Almond skin weight was the most important parameter in the studied responses. The best extraction was achieved using 4 g, 60 s, 100 W, and 60 mL of 70% (v/v) ethanol. TPC, antioxidant activity (DPPH, FRAP), and chemical composition (HPLC-DAD-ESI-MS/MS) were determined by using the optimized method from seven different almond cultivars. Successful discrimination was obtained for all cultivars by using multivariate linear discriminant analysis (LDA), suggesting the influence of cultivar type on polyphenol content and antioxidant activity. The results show the potential of almond skin as a natural source of phenolics and the effectiveness of MAE for the reutilization of these byproducts.
Resumo:
This research evaluates pattern recognition techniques on a subclass of big data where the dimensionality of the input space (p) is much larger than the number of observations (n). Specifically, we evaluate massive gene expression microarray cancer data where the ratio κ is less than one. We explore the statistical and computational challenges inherent in these high dimensional low sample size (HDLSS) problems and present statistical machine learning methods used to tackle and circumvent these difficulties. Regularization and kernel algorithms were explored in this research using seven datasets where κ < 1. These techniques require special attention to tuning necessitating several extensions of cross-validation to be investigated to support better predictive performance. While no single algorithm was universally the best predictor, the regularization technique produced lower test errors in five of the seven datasets studied.
Resumo:
The article investigates the division between member states of the European Union considering the aspect of their level of information and communication technology (ICT) development focusing on e-learning. With the help of discriminant analysis the countries are categorized into groups based on their ICT maturity and e-learning literacy level of development. Making a comparison with a benchmarking tool, the ITU (International Telecommunication Union)’s ICT Development Index (IDI) the results are confirmed partly correct. The article tries to find economical explanations for the re-grouping of the countries ranking. Finally the author examines the reliability of Hungary’s ranking results and the factors which may affect this divergence from the real picture.
Resumo:
This study is an exploratory analysis of an operational measure for resource development strategies, and an exploratory analysis of internal organizational contingencies influencing choices of these strategies in charitable nonprofit organizations. The study provides conceptual guidance for advancing understanding about resource development in the nonprofit sector. The statistical findings are, however, inconclusive without further rigorous examination. A three category typology based on organization technology is initially presented to define the strategies. Three dimensions of internal organizational contingencies explored represent organization identity, professional staff, and boards of directors. Based on relevant literature and key informant interviews, an original survey was administered by mail to a national sample of nonprofit organizations. The survey collected data on indicators of the proposed strategy types and selected contingencies. Factor analysis extracted two of the initial categories in the typology. The Building Resource Development Infrastructure Strategy encompasses information technology, personnel, legal structures, and policies facilitating fund development. The Building Resource Development Infrastructure Strategy encompasses the mission, service niche, and type of service delivery forming the basis for seeking financial support. Linear regressions with each strategy type as the dependent variable identified distinct and common contingencies which may partly explain choices of strategies. Discriminant analysis suggests the potential predictive accuracy of the contingencies. Follow-up case studies with survey respondents provide additional criteria for operationalizing future measures of resource development strategies, and support and expand the analysis on contingencies. The typology offers a beginning framework for defining alternative approaches to resource development, and for exploring organization capacity specific to each approach. Contingencies that may be integral components of organization capacity are funding, leadership frame, background and experience, staff and volunteer effort, board member support, and relationships in the external environment. Based on these findings, management questions are offered for nonprofit organization stakeholders to consider in planning for resource development. Lessons learned in designing and conducting this study are also provided to enhance future related research. ^
Resumo:
The elemental analysis of soil is useful in forensic and environmental sciences. Methods were developed and optimized for two laser-based multi-element analysis techniques: laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS) and laser-induced breakdown spectroscopy (LIBS). This work represents the first use of a 266 nm laser for forensic soil analysis by LIBS. Sample preparation methods were developed and optimized for a variety of sample types, including pellets for large bulk soil specimens (470 mg) and sediment-laden filters (47 mg), and tape-mounting for small transfer evidence specimens (10 mg). Analytical performance for sediment filter pellets and tape-mounted soils was similar to that achieved with bulk pellets. An inter-laboratory comparison exercise was designed to evaluate the performance of the LA-ICP-MS and LIBS methods, as well as for micro X-ray fluorescence (μXRF), across multiple laboratories. Limits of detection (LODs) were 0.01-23 ppm for LA-ICP-MS, 0.25-574 ppm for LIBS, 16-4400 ppm for μXRF, and well below the levels normally seen in soils. Good intra-laboratory precision (≤ 6 % relative standard deviation (RSD) for LA-ICP-MS; ≤ 8 % for μXRF; ≤ 17 % for LIBS) and inter-laboratory precision (≤ 19 % for LA-ICP-MS; ≤ 25 % for μXRF) were achieved for most elements, which is encouraging for a first inter-laboratory exercise. While LIBS generally has higher LODs and RSDs than LA-ICP-MS, both were capable of generating good quality multi-element data sufficient for discrimination purposes. Multivariate methods using principal components analysis (PCA) and linear discriminant analysis (LDA) were developed for discriminations of soils from different sources. Specimens from different sites that were indistinguishable by color alone were discriminated by elemental analysis. Correct classification rates of 94.5 % or better were achieved in a simulated forensic discrimination of three similar sites for both LIBS and LA-ICP-MS. Results for tape-mounted specimens were nearly identical to those achieved with pellets. Methods were tested on soils from USA, Canada and Tanzania. Within-site heterogeneity was site-specific. Elemental differences were greatest for specimens separated by large distances, even within the same lithology. Elemental profiles can be used to discriminate soils from different locations and narrow down locations even when mineralogy is similar.
Resumo:
Classification procedures, including atmospheric correction satellite images as well as classification performance utilizing calibration and validation at different levels, have been investigated in the context of a coarse land-cover classification scheme for the Pachitea Basin. Two different correction methods were tested against no correction in terms of reflectance correction towards a common response for pseudo-invariant features (PIF). The accuracy of classifications derived from each of the three methods was then assessed in a discriminant analysis using crossvalidation at pixel, polygon, region, and image levels. Results indicate that only regression adjusted images using PIFs show no significant difference between images in any of the bands. A comparison of classifications at different levels suggests though that at pixel, polygon, and region levels the accuracy of the classifications do not significantly differ between corrected and uncorrected images. Spatial patterns of land-cover were analyzed in terms of colonization history, infrastructure, suitability of the land, and landownership. The actual use of the land is driven mainly by the ability to access the land and markets as is obvious in the distribution of land cover as a function of distance to rivers and roads. When considering all rivers and roads a threshold distance at which disproportional agro-pastoral land cover switches from over represented to under represented is at about 1km. Best land use suggestions seem not to affect the choice of land use. Differences in abundance of land cover between watersheds are more prevailing than differences between colonist and indigenous groups.
Resumo:
Bird vocalisations are often essential for sex recognition, especially in species that show little morphological sex dimorphism. Brown skuas (Catharacta antarctica lonnbergi), which exhibit uniform plumage across both sexes, emit three main calls: the long call, the alarm call and the contact call. We tested the potential for sex recognition in brown skua calls of 42 genetically sexed individuals by analysing 8-12 acoustic parameters in the temporal and frequency domains of each call type. For every call type, we failed to find sex differences in any of the acoustic parameters measured. Stepwise discriminant function analysis (DFA) revealed that sexes cannot be unambiguously classified, with increasing uncertainty of correct classification from contact calls to long calls to alarm calls. Consequently, acoustic signalling is probably not the key mechanism for sex recognition in brown skuas.
Resumo:
This work outlines the theoretical advantages of multivariate methods in biomechanical data, validates the proposed methods and outlines new clinical findings relating to knee osteoarthritis that were made possible by this approach. New techniques were based on existing multivariate approaches, Partial Least Squares (PLS) and Non-negative Matrix Factorization (NMF) and validated using existing data sets. The new techniques developed, PCA-PLS-LDA (Principal Component Analysis – Partial Least Squares – Linear Discriminant Analysis), PCA-PLS-MLR (Principal Component Analysis – Partial Least Squares –Multiple Linear Regression) and Waveform Similarity (based on NMF) were developed to address the challenging characteristics of biomechanical data, variability and correlation. As a result, these new structure-seeking technique revealed new clinical findings. The first new clinical finding relates to the relationship between pain, radiographic severity and mechanics. Simultaneous analysis of pain and radiographic severity outcomes, a first in biomechanics, revealed that the knee adduction moment’s relationship to radiographic features is mediated by pain in subjects with moderate osteoarthritis. The second clinical finding was quantifying the importance of neuromuscular patterns in brace effectiveness for patients with knee osteoarthritis. I found that brace effectiveness was more related to the patient’s unbraced neuromuscular patterns than it was to mechanics, and that these neuromuscular patterns were more complicated than simply increased overall muscle activity, as previously thought.
Resumo:
A compositional multivariate approach is used to analyse regional scale soil geochemical data obtained as part of the Tellus Project generated by the Geological Survey Northern Ireland (GSNI). The multi-element total concentration data presented comprise XRF analyses of 6862 rural soil samples collected at 20cm depths on a non-aligned grid at one site per 2 km2. Censored data were imputed using published detection limits. Using these imputed values for 46 elements (including LOI), each soil sample site was assigned to the regional geology map provided by GSNI initially using the dominant lithology for the map polygon. Northern Ireland includes a diversity of geology representing a stratigraphic record from the Mesoproterozoic, up to and including the Palaeogene. However, the advance of ice sheets and their meltwaters over the last 100,000 years has left at least 80% of the bedrock covered by superficial deposits, including glacial till and post-glacial alluvium and peat. The question is to what extent the soil geochemistry reflects the underlying geology or superficial deposits. To address this, the geochemical data were transformed using centered log ratios (clr) to observe the requirements of compositional data analysis and avoid closure issues. Following this, compositional multivariate techniques including compositional Principal Component Analysis (PCA) and minimum/maximum autocorrelation factor (MAF) analysis method were used to determine the influence of underlying geology on the soil geochemistry signature. PCA showed that 72% of the variation was determined by the first four principal components (PC’s) implying “significant” structure in the data. Analysis of variance showed that only 10 PC’s were necessary to classify the soil geochemical data. To consider an improvement over PCA that uses the spatial relationships of the data, a classification based on MAF analysis was undertaken using the first 6 dominant factors. Understanding the relationship between soil geochemistry and superficial deposits is important for environmental monitoring of fragile ecosystems such as peat. To explore whether peat cover could be predicted from the classification, the lithology designation was adapted to include the presence of peat, based on GSNI superficial deposit polygons and linear discriminant analysis (LDA) undertaken. Prediction accuracy for LDA classification improved from 60.98% based on PCA using 10 principal components to 64.73% using MAF based on the 6 most dominant factors. The misclassification of peat may reflect degradation of peat covered areas since the creation of superficial deposit classification. Further work will examine the influence of underlying lithologies on elemental concentrations in peat composition and the effect of this in classification analysis.
Resumo:
Carbon and nitrogen stable isotope values were determined in Pacific white shrimp (Litopenaeus vannamei) with the objective of discriminating animals produced through aquaculture practices from those extracted from the wild. Farmed animals were collected at semi-intensive shrimp farms in Mexico and Ecuador. Fisheries-derived shrimps were caught in different fishing areas representing two estuarine systems and four open sea locations in Mexico and Ecuador. Carbon and nitrogen stable isotope values (13CVPDB and 15NAIR) allowed clear differentiation of wild from farmed animals. 13CVPDB and 15NAIR values in shrimps collected in the open sea were isotopically enriched (−16.99‰ and 11.57‰), indicating that these organisms belong to higher trophic levels than farmed animals. 13CVPDB and 15NAIR values of farmed animals (−19.72‰ and 7.85‰, respectively) partially overlapped with values measured in animals collected in estuaries (−18.46‰ and 5.38‰, respectively). Canonical discriminant analysis showed that when used separately and in conjunction, 13CVPDB and I5NAIR values were powerful discriminatory variables and demonstrate the viability of isotopic evaluations to distinguish wild-caught shrimps from aquaculture shrimps. Methodological improvements will define a verification tool to support shrimp traceability protocols.
Resumo:
The elemental analysis of soil is useful in forensic and environmental sciences. Methods were developed and optimized for two laser-based multi-element analysis techniques: laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS) and laser-induced breakdown spectroscopy (LIBS). This work represents the first use of a 266 nm laser for forensic soil analysis by LIBS. Sample preparation methods were developed and optimized for a variety of sample types, including pellets for large bulk soil specimens (470 mg) and sediment-laden filters (47 mg), and tape-mounting for small transfer evidence specimens (10 mg). Analytical performance for sediment filter pellets and tape-mounted soils was similar to that achieved with bulk pellets. An inter-laboratory comparison exercise was designed to evaluate the performance of the LA-ICP-MS and LIBS methods, as well as for micro X-ray fluorescence (μXRF), across multiple laboratories. Limits of detection (LODs) were 0.01-23 ppm for LA-ICP-MS, 0.25-574 ppm for LIBS, 16-4400 ppm for µXRF, and well below the levels normally seen in soils. Good intra-laboratory precision (≤ 6 % relative standard deviation (RSD) for LA-ICP-MS; ≤ 8 % for µXRF; ≤ 17 % for LIBS) and inter-laboratory precision (≤ 19 % for LA-ICP-MS; ≤ 25 % for µXRF) were achieved for most elements, which is encouraging for a first inter-laboratory exercise. While LIBS generally has higher LODs and RSDs than LA-ICP-MS, both were capable of generating good quality multi-element data sufficient for discrimination purposes. Multivariate methods using principal components analysis (PCA) and linear discriminant analysis (LDA) were developed for discriminations of soils from different sources. Specimens from different sites that were indistinguishable by color alone were discriminated by elemental analysis. Correct classification rates of 94.5 % or better were achieved in a simulated forensic discrimination of three similar sites for both LIBS and LA-ICP-MS. Results for tape-mounted specimens were nearly identical to those achieved with pellets. Methods were tested on soils from USA, Canada and Tanzania. Within-site heterogeneity was site-specific. Elemental differences were greatest for specimens separated by large distances, even within the same lithology. Elemental profiles can be used to discriminate soils from different locations and narrow down locations even when mineralogy is similar.
Resumo:
Blast is a major disease of rice in Brazil, the largest rice-producing country outside Asia. This study aimed to assess the genetic structure and mating-type frequency in a contemporary Pyricularia oryzae population, which caused widespread epidemics during the 2012/13 season in the Brazilian lowland subtropical region. Symptomatic leaves and panicles were sampled at flooded rice fields in the states of Rio Grande do Sul (RS, 34 fields) and Santa Catarina (SC, 21 fields). The polymorphism at ten simple sequence repeats (SSR or microsatellite) loci and the presence of MAT1-1 or MAT1-2 idiomorphs were assessed in a population comprised of 187 isolates. Only the MAT1-2 idiomorph was found and 162 genotypes were identified by the SSR analysis. A discriminant analysis of principal components (DAPC) of SSR data resolved four genetic groups, which were strongly associated with the cultivar of origin of the isolates. There was high level of genotypic diversity and moderate level of gene diversity regardless whether isolates were grouped in subpopulations based on geographic region, cultivar host or cultivar within region. While regional subpopulations were weakly differentiated, high genetic differentiation was found among subpopulations comprised of isolates from different cultivars. The data suggest that the rice blast pathogen population in southern Brazil is comprised of clonal lineages that are adapting to specific cultivar hosts. Farmers should avoid the use of susceptible cultivars over large areas and breeders should focus at enlarging the genetic basis of new cultivars.
Resumo:
Frankfurters are widely consumed all over the world, and the production requires a wide range of meat and non-meat ingredients. Due to these characteristics, frankfurters are products that can be easily adulterated with lower value meats, and the presence of undeclared species. Adulterations are often still difficult to detect, due the fact that the adulterant components are usually very similar to the authentic product. In this work, FT-Raman spectroscopy was employed as a rapid technique for assessing the quality of frankfurters. Based on information provided by the Raman spectra, a multivariate classification model was developed to identify the frankfurter type. The aim was to study three types of frankfurters (chicken, turkey and mixed meat) according to their Raman spectra, based on the fatty vibrational bands. Classification model was built using partial least square discriminant analysis (PLS-DA) and the performance model was evaluated in terms of sensitivity, specificity, accuracy, efficiency and Matthews's correlation coefficient. The PLS-DA models give sensitivity and specificity values on the test set in the ranges of 88%-100%, showing good performance of the classification models. The work shows the Raman spectroscopy with chemometric tools can be used as an analytical tool in quality control of frankfurters.
Resumo:
The role of key cell cycle regulation genes such as, CDKN1B, CDKN2A, CDKN2B, and CDKN2C in sporadic medullary thyroid carcinoma (s-MTC) is still largely unknown. In order to evaluate the influence of inherited polymorphisms of these genes on the pathogenesis of s-MTC, we used TaqMan SNP genotyping to examine 45 s-MTC patients carefully matched with 98 controls. A multivariate logistic regression analysis demonstrated that CDKN1B and CDKN2A genes were related to s-MTC susceptibility. The rs2066827*GT+GG CDKN1B genotype was more frequent in s-MTC patients (62.22%) than in controls (40.21%), increasing the susceptibility to s-MTC (OR=2.47; 95% CI=1.048-5.833; P=0.038). By contrast, the rs11515*CG+GG of CDKN2A gene was more frequent in the controls (32.65%) than in patients (15.56%), reducing the risk for s-MTC (OR=0.174; 95% CI=0.048-0.627; P=0.0075). A stepwise regression analysis indicated that two genotypes together could explain 11% of the total s-MTC risk. In addition, a relationship was found between disease progression and the presence of alterations in the CDKN1A (rs1801270), CDKN2C (rs12885), and CDKN2B (rs1063192) genes. WT rs1801270 CDKN1A patients presented extrathyroidal tumor extension more frequently (92%) than polymorphic CDKN1A rs1801270 patients (50%; P=0.0376). Patients with the WT CDKN2C gene (rs12885) presented larger tumors (2.9±1.8 cm) than polymorphic patients (1.5±0.7 cm; P=0.0324). On the other hand, patients with the polymorphic CDKN2B gene (rs1063192) presented distant metastases (36.3%; P=0.0261). In summary, we demonstrated that CDKN1B and CDKN2A genes are associated with susceptibility, whereas the inherited genetic profile of CDKN1A, CDKN2B, and CDKN2C is associated with aggressive features of tumors. This study suggests that profiling cell cycle genes may help define the risk and characterize s-MTC aggressiveness.
Resumo:
A method using the ring-oven technique for pre-concentration in filter paper discs and near infrared hyperspectral imaging is proposed to identify four detergent and dispersant additives, and to determine their concentration in gasoline. Different approaches were used to select the best image data processing in order to gather the relevant spectral information. This was attained by selecting the pixels of the region of interest (ROI), using a pre-calculated threshold value of the PCA scores arranged as histograms, to select the spectra set; summing up the selected spectra to achieve representativeness; and compensating for the superimposed filter paper spectral information, also supported by scores histograms for each individual sample. The best classification model was achieved using linear discriminant analysis and genetic algorithm (LDA/GA), whose correct classification rate in the external validation set was 92%. Previous classification of the type of additive present in the gasoline is necessary to define the PLS model required for its quantitative determination. Considering that two of the additives studied present high spectral similarity, a PLS regression model was constructed to predict their content in gasoline, while two additional models were used for the remaining additives. The results for the external validation of these regression models showed a mean percentage error of prediction varying from 5 to 15%.