28 resultados para partial least-squares regression
Resumo:
Near infrared spectroscopy (NIRS) can be used for the on-line, non-invasive assessment of fruit for eating quality attributes such as total soluble solids (TSS). The robustness of multivariate calibration models, based on NIRS in a partial transmittance optical geometry, for the assessment of TSS of intact rockmelons (Cucumis melo) was assessed. The mesocarp TSS was highest around the fruit equator and increased towards the seed cavity. Inner mesocarp TSS levels decreased towards both the proximal and distal ends of the fruit, but more so towards the proximal end. The equatorial region of the fruit was chosen as representative of the fruit for near infrared assessment of TSS. The spectral window for model development was optimised at 695-1045 nm, and the data pre-treatment procedure was optimised to second-derivative absorbance without scatter correction. The 'global' modified partial least squares (MPLS) regression modelling procedure of WINISI (ver. 1.04) was found to be superior with respect to root mean squared error of prediction (RMSEP) and bias for model predictions of TSS across seasons, compared with the 'local' MPLS regression procedure. Updating of the model with samples selected randomly from the independent validation population demonstrated improvement in both RMSEP and bias with addition of approximately 15 samples.
Resumo:
A commercial non-specific gas sensor array system was evaluated in terms of its capability to monitor the odour abatement performance of a biofiltration system developed for treating emissions from a commercial piggery building. The biofiltration system was a modular system comprising an inlet ducting system, humidifier and closed-bed biofilter. It also included a gravimetric moisture monitoring and water application system for precise control of moisture content of an organic woodchip medium. Principal component analysis (PCA) of the sensor array measurements indicated that the biofilter outlet air was significantly different to both inlet air of the system and post-humidifier air. Data pre-processing techniques including normalising and outlier handling were applied to improve the odour discrimination performance of the non-specific gas sensor array. To develop an odour quantification model using the sensor array responses of the non-specific sensor array, PCA regression, artificial neural network (ANN) and partial least squares (PLS) modelling techniques were applied. The correlation coefficient (r(2)) values of the PCA, ANN, and PLS models were 0.44, 0.62 and 0.79, respectively.
Resumo:
Hydrogen cyanide (HCN) is a toxic chemical that can potentially cause mild to severe reactions in animals when grazing forage sorghum. Developing technologies to monitor the level of HCN in the growing crop would benefit graziers, so that they can move cattle into paddocks with acceptable levels of HCN. In this study, we developed near-infrared spectroscopy (MRS) calibrations to estimate HCN in forage sorghum and hay. The full spectral NIRS range (400-2498 nm) was used as well as specific spectral ranges within the full spectral range, i.e., visible (400-750 nm), shortwave (800-1100 nm) and near-infrared (NIR) (1100-2498 nm). Using the full spectrum approach and partial least-squares (PLS), the calibration produced a coefficient of determination (R-2) = 0.838 and standard error of cross-validation (SECV) = 0.040%, while the validation set had a R-2 = 0.824 with a low standard error of prediction (SEP = 0.047%). When using a multiple linear regression (MLR) approach, the best model (NIR spectra) produced a R-2 = 0.847 and standard error of calibration (SEC) = 0.050% and a R-2 = 0.829 and SEP = 0.057% for the validation set. The MLR models built from these spectral regions all used nine wavelengths. Two specific wavelengths 2034 and 2458 nm were of interest, with the former associated with C=O carbonyl stretch and the latter associated with C-N-C stretching. The most accurate PLS and MLR models produced a ratio of standard error of prediction to standard deviation of 3.4 and 3.0, respectively, suggesting that the calibrations could be used for screening breeding material. The results indicated that it should be feasible to develop calibrations using PLS or MLR models for a number of users, including breeding programs to screen for genotypes with low HCN, as well as graziers to monitor crop status to help with grazing efficiency.
Resumo:
New algorithms for the continuous wavelet transform are developed that are easy to apply, each consisting of a single-pass finite impulse response (FIR) filter, and several times faster than the fastest existing algorithms. The single-pass filter, named WT-FIR-1, is made possible by applying constraint equations to least-squares estimation of filter coefficients, which removes the need for separate low-pass and high-pass filters. Non-dyadic two-scale relations are developed and it is shown that filters based on them can work more efficiently than dyadic ones. Example applications to the Mexican hat wavelet are presented.
Resumo:
Near infrared (NIR) spectroscopy was investigated as a potential rapid method of estimating fish age from whole otoliths of Saddletail snapper (Lutjanus malabaricus). Whole otoliths from 209 Saddletail snapper were extracted and the NIR spectral characteristics were acquired over a spectral range of 800–2780 nm. Partial least-squares models (PLS) were developed from the diffuse reflectance spectra and reference-validated age estimates (based on traditional sectioned otolith increments) to predict age for independent otolith samples. Predictive models developed for a specific season and geographical location performed poorly against a different season and geographical location. However, overall PLS regression statistics for predicting a combined population incorporating both geographic location and season variables were: coefficient of determination (R2) = 0.94, root mean square error of prediction (RMSEP) = 1.54 for age estimation, indicating that Saddletail age could be predicted within 1.5 increment counts. This level of accuracy suggests the method warrants further development for Saddletail snapper and may have potential for other fish species. A rapid method of fish age estimation could have the potential to reduce greatly both costs of time and materials in the assessment and management of commercial fisheries.
Resumo:
Recent decreases in costs, and improvements in performance, of silicon array detectors open a range of potential applications of relevance to plant physiologists, associated with spectral analysis in the visible and short-wave near infra-red (far-red) spectrum. The performance characteristics of three commercially available ‘miniature’ spectrometers based on silicon array detectors operating in the 650–1050-nm spectral region (MMS1 from Zeiss, S2000 from Ocean Optics, and FICS from Oriel, operated with a Larry detector) were compared with respect to the application of non-invasive prediction of sugar content of fruit using near infra-red spectroscopy (NIRS). The FICS–Larry gave the best wavelength resolution; however, the narrow slit and small pixel size of the charge-coupled device detector resulted in a very low sensitivity, and this instrumentation was not considered further. Wavelength resolution was poor with the MMS1 relative to the S2000 (e.g. full width at half maximum of the 912 nm Hg peak, 13 and 2 nm for the MMS1 and S2000, respectively), but the large pixel height of the array used in the MMS1 gave it sensitivity comparable to the S2000. The signal-to-signal standard error ratio of spectra was greater by an order of magnitude with the MMS1, relative to the S2000, at both near saturation and low light levels. Calibrations were developed using reflectance spectra of filter paper soaked in range of concentrations (0–20% w/v) of sucrose, using a modified partial least squares procedure. Calibrations developed with the MMS1 were superior to those developed using the S2000 (e.g. coefficient of correlation of 0.90 and 0.62, and standard error of cross-validation of 1.9 and 5.4%, respectively), indicating the importance of high signal to noise ratio over wavelength resolution to calibration accuracy. The design of a bench top assembly using the MMS1 for the non-invasive assessment of mesocarp sugar content of (intact) melon fruit is reported in terms of light source and angle between detector and light source, and optimisation of math treatment (derivative condition and smoothing function).
Resumo:
The fatty acid composition of ground nuts (Arachis hypogaea L.) commonly known as peanuts, is an important consideration when a new variety is being released. The composition impacts on nutrition and, importantly, self-life of peanut products. To select for suitable breeding material, it was necessary to develop a rapid, non-derstructive and cost-efficient method. Near infrared spectroscopy was chosen as that methodology. Calibrations were developed for two major fatty-acid components, oleic and linoleic acids and two minor components, palmitic and stearic acids, as well as total oil content. Partial least squares models indicated a high level of precision with a squared multiple correlation coefficient of greater than 0.90 for each constitutent. Standard errors for prediction for oleic, linoleic, palmitic, stearic acids and total oil content were 6.4%, 4.5%, 0.8%, 0.9% and 1.3% respectively. The results demonstrated that reasonable calibrations could be developed to predict oil composition and content of peanuts for a breeding programme.
Resumo:
Identification of major contributors to odour annoyance in areas with multiple emission sources is necessary to address and resolve odour disputes. In an effort to develop an appropriate tool for this task, odour samples were collected on-site at a piggery and an abattoir (the major odour sources in the area) and at surrounding off-site areas, then analysed using a commercial non-specific chemical sensor array to develop an odour fingerprint database. The developed odour fingerprint database was analysed using two pattern recognition algorithms including a partial least squares-discriminant analysis (PLS-DA) and a Kohonen self-organising map (KSOM). The KSOM model could identify odour samples sourced from the piggery shed 15, piggery pond 8, piggery pond 9, abattoir, motel and others with mean percentage values of 77.5, 65.0, 90.2, 75.7, 44.8 and 64.6%, respectively.
Resumo:
The use of near infrared (NIR) hyperspectral imaging and hyperspectral image analysis for distinguishing between hard, intermediate and soft maize kernels from inbred lines was evaluated. NIR hyperspectral images of two sets (12 and 24 kernels) of whole maize kernels were acquired using a Spectral Dimensions MatrixNIR camera with a spectral range of 960-1662 nm and a sisuChema SWIR (short wave infrared) hyperspectral pushbroom imaging system with a spectral range of 1000-2498 nm. Exploratory principal component analysis (PCA) was used on absorbance images to remove background, bad pixels and shading. On the cleaned images. PCA could be used effectively to find histological classes including glassy (hard) and floury (soft) endosperm. PCA illustrated a distinct difference between glassy and floury endosperm along principal component (PC) three on the MatrixNIR and PC two on the sisuChema with two distinguishable clusters. Subsequently partial least squares discriminant analysis (PLS-DA) was applied to build a classification model. The PLS-DA model from the MatrixNIR image (12 kernels) resulted in root mean square error of prediction (RMSEP) value of 0.18. This was repeated on the MatrixNIR image of the 24 kernels which resulted in RMSEP of 0.18. The sisuChema image yielded RMSEP value of 0.29. The reproducible results obtained with the different data sets indicate that the method proposed in this paper has a real potential for future classification uses.
Resumo:
Fourier Transform (FT)-near infra-red spectroscopy (NIRS) was investigated as a non-invasive technique for estimating percentage (%) dry matter of whole intact 'Hass' avocado fruit. Partial least squares (PLS) calibration models were developed from the diffuse reflectance spectra to predict % dry matter, taking into account effects of seasonal variation. It is found that seasonal variability has a significant effect on model predictive performance for dry matter in avocados. The robustness of the calibration model, which in general limits the application for the technique, was found to increase across years (seasons) when more seasonal variability was included in the calibration set. The R-v(2) and RMSEP for the single season prediction models predicting on an independent season ranged from 0.09 to 0.61 and 2.63 to 5.00, respectively, while for the two season models predicting on the third independent season, they ranged from 0.34 to 0.79 and 2.18 to 2.50, respectively. The bias for single season models predicting an independent season was as high as 4.429 but <= 1.417 for the two season combined models. The calibration model encompassing fruit from three consecutive years yielded predictive statistics of R-v(2) = 0.89, RMSEP = 1.43% dry matter with a bias of -0.021 in the range 16.1-39.7% dry matter for the validation population encompassing independent fruit from the three consecutive years. Relevant spectral information for all calibration models was obtained primarily from oil, carbohydrate and water absorbance bands clustered in the 890-980, 1005-1050, 1330-1380 and 1700-1790 nm regions. These results indicate the potential of FT-NIRS, in diffuse reflectance mode, to non-invasively predict the % dry matter of whole 'Hass' avocado fruit and the importance of the development of a calibration model that incorporates seasonal variation. Crown Copyright (c) 2012 Published by Elsevier B.V. All rights reserved.
Resumo:
The study examined the potential of Near Infrared Reflectance (NIR) spectroscopy for field diagnosis of hybrids between Corymbia (formerly Eucalyptus) species. NIR profiles were generated by scanning foliage from a total of 383 hybrid and 533 parental seedlings grown in a common garden and partial least squares discriminant analysis was used to test three-way model power to assign individuals to their appropriate taxon; either a parental or F1 hybrid class. Using the optimised conditions, fresh foliage from eight-month-old seedlings and a handheld NIR instrument (950–1800 nm), the mean assignment rates for the three hybrid groups ranged from 76% to 90%. Hybrid-parent contrast of NIR spectra deviated more so than parent–parent contrast. The F1 taxon assignment rates were usually higher than those for parents at 100% and 72%, respectively. Hybrid resolution was even greater for 2nd generation backcross hybrids. Similar to studies of morphology, taxon assignments tended to be more accurate for hybrid groups in which the parental taxa were more divergent. The practical application of this technique for hybrid diagnosis of seedlings in the nursery will require careful attention to control environmental factors because seedling age and storage effects influenced the ability of NIR to identify hybrids. The technique may also necessitate the generation of comparable reference populations, although exclusions approaches to analysis may circumvent the need for reference populations. The application of NIR in field diagnosis will be further complicated by the need to generate global models across environments but such models have been obtained for reliable prediction of chemistries in other situations.
Resumo:
BACKGROUND: In order to rapidly and efficiently screen potential biofuel feedstock candidates for quintessential traits, robust high-throughput analytical techniques must be developed and honed. The traditional methods of measuring lignin syringyl/guaiacyl (S/G) ratio can be laborious, involve hazardous reagents, and/or be destructive. Vibrational spectroscopy can furnish high-throughput instrumentation without the limitations of the traditional techniques. Spectral data from mid-infrared, near-infrared, and Raman spectroscopies was combined with S/G ratios, obtained using pyrolysis molecular beam mass spectrometry, from 245 different eucalypt and Acacia trees across 17 species. Iterations of spectral processing allowed the assembly of robust predictive models using partial least squares (PLS). RESULTS: The PLS models were rigorously evaluated using three different randomly generated calibration and validation sets for each spectral processing approach. Root mean standard errors of prediction for validation sets were lowest for models comprised of Raman (0.13 to 0.16) and mid-infrared (0.13 to 0.15) spectral data, while near-infrared spectroscopy led to more erroneous predictions (0.18 to 0.21). Correlation coefficients (r) for the validation sets followed a similar pattern: Raman (0.89 to 0.91), mid-infrared (0.87 to 0.91), and near-infrared (0.79 to 0.82). These statistics signify that Raman and mid-infrared spectroscopy led to the most accurate predictions of S/G ratio in a diverse consortium of feedstocks. CONCLUSION: Eucalypts present an attractive option for biofuel and biochemical production. Given the assortment of over 900 different species of Eucalyptus and Corymbia, in addition to various species of Acacia, it is necessary to isolate those possessing ideal biofuel traits. This research has demonstrated the validity of vibrational spectroscopy to efficiently partition different potential biofuel feedstocks according to lignin S/G ratio, significantly reducing experiment and analysis time and expense while providing non-destructive, accurate, global, predictive models encompassing a diverse array of feedstocks.
Resumo:
Spot measurements of methane emission rate (n = 18 700) by 24 Angus steers fed mixed rations from GrowSafe feeders were made over 3- to 6-min periods by a GreenFeed emission monitoring (GEM) unit. The data were analysed to estimate daily methane production (DMP; g/day) and derived methane yield (MY; g/kg dry matter intake (DMI)). A one-compartment dose model of spot emission rate v. time since the preceding meal was compared with the models of Wood (1967) and Dijkstra et al. (1997) and the average of spot measures. Fitted values for DMP were calculated from the area under the curves. Two methods of relating methane and feed intakes were then studied: the classical calculation of MY as DMP/DMI (kg/day); and a novel method of estimating DMP from time and size of preceding meals using either the data for only the two meals preceding a spot measurement, or all meals for 3 days prior. Two approaches were also used to estimate DMP from spot measurements: fitting of splines on a 'per-animal per-day' basis and an alternate approach of modelling DMP after each feed event by least squares (using Solver), summing (for each animal) the contributions from each feed event by best-fitting a one-compartment model. Time since the preceding meal was of limited value in estimating DMP. Even when the meal sizes and time intervals between a spot measurement and all feeding events in the previous 72 h were assessed, only 16.9% of the variance in spot emission rate measured by GEM was explained by this feeding information. While using the preceding meal alone gave a biased (underestimate) of DMP, allowing for a longer feed history removed this bias. A power analysis taking into account the sources of variation in DMP indicated that to obtain an estimate of DMP with a 95% confidence interval within 5% of the observed 64 days mean of spot measures would require 40 animals measured over 45 days (two spot measurements per day) or 30 animals measured over 55 days. These numbers suggest that spot measurements could be made in association with feed efficiency tests made over 70 days. Spot measurements of enteric emissions can be used to define DMP but the number of animals and samples are larger than are needed when day-long measures are made.