196 resultados para Chemometrics
Resumo:
Current methods for quality control of sugar cane are performed in extracted juice using several methodologies, often requiring appreciable time and chemicals (eventually toxic), making the methods not green and expensive. The present study proposes the use of X-ray spectrometry together with chemometric methods as an innovative and alternative technique for determining sugar cane quality parameters, specifically sucrose concentration, POL, and fiber content. Measurements in stem, leaf, and juice were performed, and those applied directly in stem provided the best results. Prediction models for sugar cane stem determinations with a single 60 s irradiation using portable X-ray fluorescence equipment allows estimating the % sucrose, % fiber, and POL simultaneously. Average relative deviations in the prediction step of around 8% are acceptable if considering that field measurements were done. These results may indicate the best period to cut a particular crop as well as for evaluating the quality of sugar cane for the sugar and alcohol industries.
Resumo:
Gunshot residues (GSR) can be used in forensic evaluations to obtain information about the type of gun and ammunition used in a crime. In this work, we present our efforts to develop a promising new method to discriminate the type of gun [four different guns were used: two handguns (0.38 revolver and 0.380 pistol) and two long-barrelled guns (12-calibre pump-action shotgun and 0.38 repeating rifle)] and ammunition (five different types: normal, semi-jacketed, full-jacketed, green, and 3T) used by a suspect. The proposed approach is based on information obtained from cyclic voltammograms recorded in solutions containing GSR collected from the hands of the shooters, using a gold microelectrode; the information was further analysed by non-supervised pattern-recognition methods [(Principal Component Analysis (PCA) and Hierarchical Cluster Analysis (HCA)]. In all cases (gun and ammunition discrimination), good separation among different samples in the score plots and dendrograms was achieved. (C) 2012 Elsevier B.V. All rights reserved.
Resumo:
In this thesis some multivariate spectroscopic methods for the analysis of solutions are proposed. Spectroscopy and multivariate data analysis form a powerful combination for obtaining both quantitative and qualitative information and it is shown how spectroscopic techniques in combination with chemometric data evaluation can be used to obtain rapid, simple and efficient analytical methods. These spectroscopic methods consisting of spectroscopic analysis, a high level of automation and chemometric data evaluation can lead to analytical methods with a high analytical capacity, and for these methods, the term high-capacity analysis (HCA) is suggested. It is further shown how chemometric evaluation of the multivariate data in chromatographic analyses decreases the need for baseline separation. The thesis is based on six papers and the chemometric tools used are experimental design, principal component analysis (PCA), soft independent modelling of class analogy (SIMCA), partial least squares regression (PLS) and parallel factor analysis (PARAFAC). The analytical techniques utilised are scanning ultraviolet-visible (UV-Vis) spectroscopy, diode array detection (DAD) used in non-column chromatographic diode array UV spectroscopy, high-performance liquid chromatography with diode array detection (HPLC-DAD) and fluorescence spectroscopy. The methods proposed are exemplified in the analysis of pharmaceutical solutions and serum proteins. In Paper I a method is proposed for the determination of the content and identity of the active compound in pharmaceutical solutions by means of UV-Vis spectroscopy, orthogonal signal correction and multivariate calibration with PLS and SIMCA classification. Paper II proposes a new method for the rapid determination of pharmaceutical solutions by the use of non-column chromatographic diode array UV spectroscopy, i.e. a conventional HPLC-DAD system without any chromatographic column connected. In Paper III an investigation is made of the ability of a control sample, of known content and identity to diagnose and correct errors in multivariate predictions something that together with use of multivariate residuals can make it possible to use the same calibration model over time. In Paper IV a method is proposed for simultaneous determination of serum proteins with fluorescence spectroscopy and multivariate calibration. Paper V proposes a method for the determination of chromatographic peak purity by means of PCA of HPLC-DAD data. In Paper VI PARAFAC is applied for the decomposition of DAD data of some partially separated peaks into the pure chromatographic, spectral and concentration profiles.
Resumo:
The present PhD thesis was focused on the development and application of chemical methodology (Py-GC-MS) and data-processing method by multivariate data analysis (chemometrics). The chromatographic and mass spectrometric data obtained with this technique are particularly suitable to be interpreted by chemometric methods such as PCA (Principal Component Analysis) as regards data exploration and SIMCA (Soft Independent Models of Class Analogy) for the classification. As a first approach, some issues related to the field of cultural heritage were discussed with a particular attention to the differentiation of binders used in pictorial field. A marker of egg tempera the phosphoric acid esterified, a pyrolysis product of lecithin, was determined using HMDS (hexamethyldisilazane) rather than the TMAH (tetramethylammonium hydroxide) as a derivatizing reagent. The validity of analytical pyrolysis as tool to characterize and classify different types of bacteria was verified. The FAMEs chromatographic profiles represent an important tool for the bacterial identification. Because of the complexity of the chromatograms, it was possible to characterize the bacteria only according to their genus, while the differentiation at the species level has been achieved by means of chemometric analysis. To perform this study, normalized areas peaks relevant to fatty acids were taken into account. Chemometric methods were applied to experimental datasets. The obtained results demonstrate the effectiveness of analytical pyrolysis and chemometric analysis for the rapid characterization of bacterial species. Application to a samples of bacterial (Pseudomonas Mendocina), fungal (Pleorotus ostreatus) and mixed- biofilms was also performed. A comparison with the chromatographic profiles established the possibility to: • Differentiate the bacterial and fungal biofilms according to the (FAMEs) profile. • Characterize the fungal biofilm by means the typical pattern of pyrolytic fragments derived from saccharides present in the cell wall. • Individuate the markers of bacterial and fungal biofilm in the same mixed-biofilm sample.
Resumo:
The combination of advanced ultraperformance liquid chromatography coupled with mass spectrometry, chemometrics, and genetically modified mice provide an attractive raft of technologies with which to examine the metabolism of xenobiotics. Here, a reexamination of the metabolism of the food mutagen PhIP (2-amino-1-methyl-6-phenylimidazo[4,5-b]pyridine), the suspect carcinogen areca alkaloids (arecoline, arecaidine, and arecoline 1-oxide), the hormone supplement melatonin, and the metabolism of the experimental cancer therapeutic agent aminoflavone is presented. In all cases, the metabolic maps of the xenobiotics were considerably enlarged, providing new insights into their toxicology. The inclusion of transgenic mice permitted unequivocal attribution of individual and often novel metabolic pathways to particular enzymes. Last, a future perspective for xenobiotic metabolomics is discussed and its impact on the metabolome is described. The studies reviewed here are not specific to the mouse and can be adapted to study xenobiotic metabolism in any animal species, including humans. The view through the metabolometer is unique and visualizes a metabolic space that contains both established and unknown metabolites of a xenobiotic, thereby enhancing knowledge of their modes of toxic action.
Resumo:
The advances in computational biology have made simultaneous monitoring of thousands of features possible. The high throughput technologies not only bring about a much richer information context in which to study various aspects of gene functions but they also present challenge of analyzing data with large number of covariates and few samples. As an integral part of machine learning, classification of samples into two or more categories is almost always of interest to scientists. In this paper, we address the question of classification in this setting by extending partial least squares (PLS), a popular dimension reduction tool in chemometrics, in the context of generalized linear regression based on a previous approach, Iteratively ReWeighted Partial Least Squares, i.e. IRWPLS (Marx, 1996). We compare our results with two-stage PLS (Nguyen and Rocke, 2002A; Nguyen and Rocke, 2002B) and other classifiers. We show that by phrasing the problem in a generalized linear model setting and by applying bias correction to the likelihood to avoid (quasi)separation, we often get lower classification error rates.
Resumo:
Independent Components Analysis is a Blind Source Separation method that aims to find the pure source signals mixed together in unknown proportions in the observed signals under study. It does this by searching for factors which are mutually statistically independent. It can thus be classified among the latent-variable based methods. Like other methods based on latent variables, a careful investigation has to be carried out to find out which factors are significant and which are not. Therefore, it is important to dispose of a validation procedure to decide on the optimal number of independent components to include in the final model. This can be made complicated by the fact that two consecutive models may differ in the order and signs of similarly-indexed ICs. As well, the structure of the extracted sources can change as a function of the number of factors calculated. Two methods for determining the optimal number of ICs are proposed in this article and applied to simulated and real datasets to demonstrate their performance.
Resumo:
A combined chemometrics-metabolomics approach [excitation–emission matrix (EEM) fluorescence spectroscopy, nuclear magnetic resonance (NMR) and high performance liquid chromatography–mass spectrometry (HPLC–MS)] was used to analyse the rhizodeposition of the tritrophic system: tomato, the plant-parasitic nematode Meloidogyne javanica and the nematode-egg parasitic fungus Pochonia chlamydosporia. Exudates from M. javanica roots were sampled at root penetration (early) and gall development (late). EMM indicated that late root exudates from M. javanica treatments contained more aromatic amino acid compounds than the rest (control, P. chlamydosporia or P. chlamydosporia and M. javanica). 1H NMR showed that organic acids (acetate, lactate, malate, succinate and formic acid) and one unassigned aromatic compound (peak no. 22) were the most relevant metabolites in root exudates. Robust principal component analysis (PCA) grouped early exudates for nematode (PC1) or fungus presence (PC3). PCA found (PC1, 73.31 %) increased acetate and reduced lactate and an unassigned peak no. 22 characteristic of M. javanica root exudates resulting from nematode invasion and feeding. An increase of peak no. 22 (PC3, 4.82 %) characteristic of P. chlamydosporia exudates could be a plant “primer” defence. In late ones in PC3 (8.73 %) the presence of the nematode grouped the samples. HPLC–MS determined rhizosphere fingerprints of 16 (early) and 25 (late exudates) m/z signals, respectively. Late signals were exclusive from M. javanica exudates confirming EEM and 1H NMR results. A 235 m/z signal reduced in M. javanica root exudates (early and late) could be a repressed plant defense. This metabolomic approach and other rhizosphere -omics studies could help to improve plant growth and reduce nematode damage sustainably.
Resumo:
Consumers expect organic, free-range and corn-fed chicken to be nutritionally wholesome and have premium flavour characters. Interrelationships between flavour, fatty acids and antioxidants of retailed breasts were explored using simple correlations and chemometrics. Saturated fatty acid C16:0, and n-6 polyunsaturated C20:4 and C22:4 contents were correlated with lipid oxidation products (thiobarbituric acid reactive substances) and in partial least-squares regression (PLS1) with 32 high-resonance gas chromatography (flame ionization) flavour components (r2>0.90), and also linked (r2>0.80) to antioxidants (-tocopherol, glutathione and catalase). A further 10 high-resonance gas chromatography nitrogen phosphorus detector flavour components were correlated (r 2>0.85) with C18:3(n-3) content. Chicken character was correlated with C18:3(n-3), and C18:3(n-6) inversely with oily, off-flavour and lipid oxidation. Sweet, fruity and oily aromas were linked in PLS1 with 13 specific fatty acids (r2>0.6), and bland taste with total summed (six) fatty acid fractions (r2>0.81). Specific antioxidants were correlated with sweet, fruity and chicken aromas, and -tocopherol inversely with lipid oxidation. PLS2 confirmed relationships between fatty acid composition, antioxidants and the subsets of 32 and 10 flavour components. Clear relationships were thus observed between lipid and antioxidant compositions and flavour in chicken breast meat.
Resumo:
Objective In this study, we have used a chemometrics-based method to correlate key liposomal adjuvant attributes with in-vivo immune responses based on multivariate analysis. Methods The liposomal adjuvant composed of the cationic lipid dimethyldioctadecylammonium bromide (DDA) and trehalose 6,6-dibehenate (TDB) was modified with 1,2-distearoyl-sn-glycero-3-phosphocholine at a range of mol% ratios, and the main liposomal characteristics (liposome size and zeta potential) was measured along with their immunological performance as an adjuvant for the novel, postexposure fusion tuberculosis vaccine, Ag85B-ESAT-6-Rv2660c (H56 vaccine). Partial least square regression analysis was applied to correlate and cluster liposomal adjuvants particle characteristics with in-vivo derived immunological performances (IgG, IgG1, IgG2b, spleen proliferation, IL-2, IL-5, IL-6, IL-10, IFN-γ). Key findings While a range of factors varied in the formulations, decreasing the 1,2-distearoyl-sn-glycero-3-phosphocholine content (and subsequent zeta potential) together built the strongest variables in the model. Enhanced DDA and TDB content (and subsequent zeta potential) stimulated a response skewed towards a cell mediated immunity, with the model identifying correlations with IFN-γ, IL-2 and IL-6. Conclusion This study demonstrates the application of chemometrics-based correlations and clustering, which can inform liposomal adjuvant design.
Resumo:
In this work, it was developed and validated methodologies that were based on the use of Infrared Spectroscopy Mid (MIR) combined with multivariate calibration Square Partial Least (PLS) to quantify adulterants such as soybean oil and residual soybean oil in methyl and ethyl palm biodiesels in the concentration range from 0.25 to 30.00 (%), as well as to determine methyl and ethyl palm biodiesel content in their binary mixtures with diesel in the concentration range from 0.25 to 30.00 (%). The prediction results showed that PLS models constructed are satisfactory. Errors Mean Square Forecast (RMSEP) of adulteration and content determination showed values of 0.2260 (%), with mean error (EM) with values below 1.93 (%). The models also showed a strong correlation between actual and predicted values, staying above 0.99974. No systematic errors were observed, in accordance to ASTM E1655- 05. Thus the built PLS models, may be a promising alternative in the quality control of this fuel for possible adulterations or to content determination.
Resumo:
Biodiesel is a renewable fuel derived from vegetable oils or animal fats, which can be a total or partial substitute for diesel. Since 2005, this fuel was introduced in the Brazilian energy matrix through Law 11.097 that determines the percentage of biodiesel added to diesel oil as well as monitoring the insertion of this fuel in market. The National Agency of Petroleum, Natural Gas and Biofuels (ANP) establish the obligation of adding 7% (v/v) of biodiesel to diesel commercialized in the country, making crucial the analytical control of this content. Therefore, in this study were developed and validated methodologies based on the use of Mid Infrared Spectroscopy (MIR) and Multivariate Calibration by Partial Least Squares (PLS) to quantify the methyl and ethyl biodiesels content of cotton and jatropha in binary blends with diesel at concentration range from 1.00 to 30.00% (v/v), since this is the range specified in standard ABNT NBR 15568. The biodiesels were produced from two routes, using ethanol or methanol, and evaluated according to the parameters: oxidative stability, water content, kinematic viscosity and density, presenting results according to ANP Resolution No. 45/2014. The built PLS models were validated on the basis of ASTM E1655-05 for Infrared Spectroscopy and Multivariate Calibration and ABNT NBR 15568, with satisfactory results due to RMSEP (Root Mean Square Error of Prediction) values below 0.08% (<0.1%), correlation coefficients (R) above 0.9997 and the absence of systematic error (bias). Therefore, the methodologies developed can be a promising alternative in the quality control of this fuel.
Resumo:
Abstract Honey is a high value food commodity with recognized nutraceutical properties. A primary driver of the value of honey is its floral origin. The feasibility of applying multivariate data analysis to various chemical parameters for the discrimination of honeys was explored. This approach was applied to four authentic honeys with different floral origins (rata, kamahi, clover and manuka) obtained from producers in New Zealand. Results from elemental profiling, stable isotope analysis, metabolomics (UPLC-QToF MS), and NIR, FT-IR, and Raman spectroscopic fingerprinting were analyzed. Orthogonal partial least square discriminant analysis (OPLS-DA) was used to determine which technique or combination of techniques provided the best classification and prediction abilities. Good prediction values were achieved using metabolite data (for all four honeys, Q2 = 0.52; for manuka and clover, Q2 = 0.76) and the trace element/isotopic data (for manuka and clover, Q2 = 0.65), while the other chemical parameters showed promise when combined (for manuka and clover, Q2 = 0.43).
Resumo:
Motivated by environmental protection concerns, monitoring the flue gas of thermal power plant is now often mandatory due to the need to ensure that emission levels stay within safe limits. Optical based gas sensing systems are increasingly employed for this purpose, with regression techniques used to relate gas optical absorption spectra to the concentrations of specific gas components of interest (NOx, SO2 etc.). Accurately predicting gas concentrations from absorption spectra remains a challenging problem due to the presence of nonlinearities in the relationships and the high-dimensional and correlated nature of the spectral data. This article proposes a generalized fuzzy linguistic model (GFLM) to address this challenge. The GFLM is made up of a series of “If-Then” fuzzy rules. The absorption spectra are input variables in the rule antecedent. The rule consequent is a general nonlinear polynomial function of the absorption spectra. Model parameters are estimated using least squares and gradient descent optimization algorithms. The performance of GFLM is compared with other traditional prediction models, such as partial least squares, support vector machines, multilayer perceptron neural networks and radial basis function networks, for two real flue gas spectral datasets: one from a coal-fired power plant and one from a gas-fired power plant. The experimental results show that the generalized fuzzy linguistic model has good predictive ability, and is competitive with alternative approaches, while having the added advantage of providing an interpretable model.
Resumo:
Motivated by environmental protection concerns, monitoring the flue gas of thermal power plant is now often mandatory due to the need to ensure that emission levels stay within safe limits. Optical based gas sensing systems are increasingly employed for this purpose, with regression techniques used to relate gas optical absorption spectra to the concentrations of specific gas components of interest (NOx, SO2 etc.). Accurately predicting gas concentrations from absorption spectra remains a challenging problem due to the presence of nonlinearities in the relationships and the high-dimensional and correlated nature of the spectral data. This article proposes a generalized fuzzy linguistic model (GFLM) to address this challenge. The GFLM is made up of a series of “If-Then” fuzzy rules. The absorption spectra are input variables in the rule antecedent. The rule consequent is a general nonlinear polynomial function of the absorption spectra. Model parameters are estimated using least squares and gradient descent optimization algorithms. The performance of GFLM is compared with other traditional prediction models, such as partial least squares, support vector machines, multilayer perceptron neural networks and radial basis function networks, for two real flue gas spectral datasets: one from a coal-fired power plant and one from a gas-fired power plant. The experimental results show that the generalized fuzzy linguistic model has good predictive ability, and is competitive with alternative approaches, while having the added advantage of providing an interpretable model.