49 resultados para PLS-DA
Resumo:
Visible and near infrared (vis-NIR) spectroscopy is widely used to detect soil properties. The objective of this study is to evaluate the combined effect of moisture content (MC) and the modeling algorithm on prediction of soil organic carbon (SOC) and pH. Partial least squares (PLS) and the Artificial neural network (ANN) for modeling of SOC and pH at different MC levels were compared in terms of efficiency in prediction of regression. A total of 270 soil samples were used. Before spectral measurement, dry soil samples were weighed to determine the amount of water to be added by weight to achieve the specified gravimetric MC levels of 5, 10, 15, 20, and 25 %. A fiber-optic vis-NIR spectrophotometer (350-2500 nm) was used to measure spectra of soil samples in the diffuse reflectance mode. Spectra preprocessing and PLS regression were carried using Unscrambler® software. Statistica® software was used for ANN modeling. The best prediction result for SOC was obtained using the ANN (RMSEP = 0.82 % and RPD = 4.23) for soil samples with 25 % MC. The best prediction results for pH were obtained with PLS for dry soil samples (RMSEP = 0.65 % and RPD = 1.68) and soil samples with 10 % MC (RMSEP = 0.61 % and RPD = 1.71). Whereas the ANN showed better performance for SOC prediction at all MC levels, PLS showed better predictive accuracy of pH at all MC levels except for 25 % MC. Therefore, based on the data set used in the current study, the ANN is recommended for the analyses of SOC at all MC levels, whereas PLS is recommended for the analysis of pH at MC levels below 20 %.
Resumo:
O objetivo deste trabalho foi avaliar a eficiência do método de regressão via componentes independentes (ICR) na estimação de valores genéticos genômicos e dos efeitos de marcadores SNP para características de carcaça de uma população F2 de suínos (Piau x linhagem comercial). Os métodos foram avaliados por meio da concordância entre os valores genéticos preditos e os fenótipos corrigidos, observados por validação cruzada, e também foram comparados com outros métodos geralmente utilizados para os mesmos propósitos, tais como RR-BLUP, PCR e PLS. Os métodos ICR e PCR apresentam resultados similares, mas o método ICR apresenta maiores valores de acurácia.
Resumo:
The determination of zirconium-hafnium mixtures is one of the most critical problem of the analytical chemistry, on account of the close similarity of their chemical properties. The spectrophotometric determination proposed by Yagodin et al. show not many practical applications due to the significant spectral interference on the 200-220 nm region. In this work we propound the use of a multivariate calibration method called partial least squares ( PLS ) for colorimetric determination of these mixtures. By using PLS and 16 calibration mixtures we obtained a model which permits determination of zirconium and hafnium with accuracy of about 1-2% and 10-20%, respectively. Using conventional univariate calibration the inaccuracy of the determination is about 10-25% for zirconium and above 57% for hafnium.
Resumo:
The aim of this work is to present a tutorial on Multivariate Calibration, a tool which is nowadays necessary in basically most laboratories but very often misused. The basic concepts of preprocessing, principal component analysis (PCA), principal component regression (PCR) and partial least squares (PLS) are given. The two basic steps on any calibration procedure: model building and validation are fully discussed. The concepts of cross validation (to determine the number of factors to be used in the model), leverage and studentized residuals (to detect outliers) for the validation step are given. The whole calibration procedure is illustrated using spectra recorded for ternary mixtures of 2,4,6 trinitrophenolate, 2,4 dinitrophenolate and 2,5 dinitrophenolate followed by the concentration prediction of these three chemical species during a diffusion experiment through a hydrophobic liquid membrane. MATLAB software is used for numerical calculations. Most of the commands for the analysis are provided in order to allow a non-specialist to follow step by step the analysis.
Resumo:
One of the major interests in soil analysis is the evaluation of its chemical, physical and biological parameters, which are indicators of soil quality (the most important is the organic matter). Besides there is a great interest in the study of humic substances and on the assessment of pollutants, such as pesticides and heavy metals, in soils. Chemometrics is a powerful tool to deal with these problems and can help soil researchers to extract much more information from their data. In spite of this, the presence of these kinds of strategies in the literature has obtained projection only recently. The utilization of chemometric methods in soil analysis is evaluated in this article. The applications will be divided in four parts (with emphasis in the first two): (i) descriptive and exploratory methods based on Principal Component Analysis (PCA); (ii) multivariate calibration methods (MLR, PCR and PLS); (iii) methods such as Evolving Factor Analysis and SIMPLISMA; and (iv) artificial intelligence methods, such as Artificial Neural Networks.
Resumo:
Genetic algorithm was used for variable selection in simultaneous determination of mixtures of glucose, maltose and fructose by mid infrared spectroscopy. Different models, using partial least squares (PLS) and multiple linear regression (MLR) with and without data pre-processing, were used. Based on the results obtained, it was verified that a simpler model (multiple linear regression with variable selection by genetic algorithm) produces results comparable to more complex methods (partial least squares). The relative errors obtained for the best model was around 3% for the sugar determination, which is acceptable for this kind of determination.
Resumo:
A model based on chemical structure was developed for the accurate prediction of octanol/water partition coefficient (K OW) of polychlorinated biphenyls (PCBs), which are molecules of environmental interest. Partial least squares (PLS) was used to build the regression model. Topological indices were used as molecular descriptors. Variable selection was performed by Hierarchical Cluster Analysis (HCA). In the modeling process, the experimental K OW measured for 30 PCBs by thin-layer chromatography - retention time (TLC-RT) has been used. The developed model (Q² = 0,990 and r² = 0,994) was used to estimate the log K OW values for the 179 PCB congeners whose K OW data have not yet been measured by TLC-RT method. The results showed that topological indices can be very useful to predict the K OW.
Resumo:
Dilutions of methylmetacrylate ranging between 1 and 50 ppm were obtained from a stock solution of 1 ml of monomer in 100 ml of deionised water, and were analyzed by an absorption spectrophotometer in the UV-visible. Absorbance values were used to develop a calibration model based on the PLS, with the aim to determine new sample concentrations. The number of latent variables used was 6, with the standard errors of calibration and prediction found to be 0,048 ml/100 ml and 0,058 ml/100 ml. The calibration model was successfully used to calculate the concentration of monomer released in water, where complete dentures were kept for one hour after polymerization.
Resumo:
A method is presented for the choice of spectral regions when absorption measurements are coupled to chemometric tools to perform quantitative analyses. The method is based on the spectral distribution of the relative standard deviation of concentration (s c/c). It has been applied to the development of PLS-FTNIR calibration models for the determination of density and MON of gasoline, and ethanol content and density of ethanol fuel. The new method was also compared with the correlation (R²) method and has proved to generate PLS calibration models that present better accuracy and precision than those based on R².
Resumo:
The application of analytical procedures based on multivariate calibration models has been limited in several areas due to requirements of validation and certification of the model. Procedures for validation are presented based on the determination of figures of merit, such as precision (mean, repeatability, intermediate), accuracy, sensitivity, analytical sensitivity, selectivity, signal-to-noise ratio and confidence intervals for PLS models. An example is discussed of a model for polymorphic purity control of carbamazepine by NIR diffuse reflectance spectroscopy. The results show that multivariate calibration models can be validated to fulfill the requirements imposed by industry and standardization agencies.
Resumo:
A simple method was proposed for determination of paracetamol and ibuprofen in tablets, based on UV measurements and partial least squares. The procedure was performed at pH 10.5, in the concentration ranges 3.00-15.00 µg ml-1 (paracetamol) and 2.40-12.00 µg ml-1 (ibuprofen). The model was able to predict paracetamol and ibuprofen in synthetic mixtures with root mean squares errors of prediction of 0.12 and 0.17 µg ml-1, respectively. Figures of merit (sensitivity, limit of detection and precision) were also estimated. The results achieved for the determination of these drugs in pharmaceutical formulations were in agreement with label claims and verified by HPLC.
Resumo:
In this work, the artificial neural networks (ANN) and partial least squares (PLS) regression were applied to UV spectral data for quantitative determination of thiamin hydrochloride (VB1), riboflavin phosphate (VB2), pyridoxine hydrochloride (VB6) and nicotinamide (VPP) in pharmaceutical samples. For calibration purposes, commercial samples in 0.2 mol L-1 acetate buffer (pH 4.0) were employed as standards. The concentration ranges used in the calibration step were: 0.1 - 7.5 mg L-1 for VB1, 0.1 - 3.0 mg L-1 for VB2, 0.1 - 3.0 mg L-1 for VB6 and 0.4 - 30.0 mg L-1 for VPP. From the results it is possible to verify that both methods can be successfully applied for these determinations. The similar error values were obtained by using neural network or PLS methods. The proposed methodology is simple, rapid and can be easily used in quality control laboratories.
Resumo:
Diffuse reflectance near-infrared (DR-NIR) spectroscopy associated with partial least squares (PLS) multivariate calibration is proposed for a direct, non-destructive, determination of total nitrogen in wheat leaves. The procedure was developed for an Analytical Instrumental Analysis course, carried out at the Institute of Chemistry of the State University of Campinas. The DR-NIR results are in good agreement with those obtained by the Kjeldhal standard procedure, with a relative error of less than ± 3% and the method may be used for teaching purposes as well as for routine analysis.
Resumo:
Two spectrophotometric methods are described for the simultaneous determination of ezetimibe (EZE) and simvastatin (SIM) in pharmaceutical preparations. The obtained data was evaluated by using two different chemometric techniques, Principal Component Regression (PCR) and Partial Least-Squares (PLS-1). In these techniques, the concentration data matrix was prepared by using the mixtures containing these drugs in methanol. The absorbance data matrix corresponding to the concentration data matrix was obtained by the measurements of absorbances in the range of 240 - 300 nm in the intervals with Δλ = 1 nm at 61 wavelengths in their zero order spectra, then, calibration or regression was obtained by using the absorbance data matrix and concentration data matrix for the prediction of the unknown concentrations of EZE and SIM in their mixture. The procedure did not require any separation step. The linear range was found to be 5 - 20 µg mL-1 for EZE and SIM in both methods. The accuracy and precision of the methods were assessed. These methods were successfully applied to a pharmaceutical preparation, tablet; and the results were compared with each other.
Resumo:
In this work an analytical methodology for the determination of relevant physicochemical parameters of prato cheese is reported, using infrared spectroscopy (DRIFT) and partial least squares regression (PLS). Several multivariate models were developed, using different spectral regions and preprocessing routines. In general, good precision and accuracy was observed for all studied parameters (fat, protein, moisture, total solids, ashes and pH) with standard deviations comparable with those provided by the conventional methodologies. The implantation of this multivariate routine involves significant analytical advantages, including reduction of cost and time of analysis, minimization of human errors, and elimination of chemical residues.