23 resultados para Least-squares technique
Resumo:
The pattern classification is one of the machine learning subareas that has the most outstanding. Among the various approaches to solve pattern classification problems, the Support Vector Machines (SVM) receive great emphasis, due to its ease of use and good generalization performance. The Least Squares formulation of SVM (LS-SVM) finds the solution by solving a set of linear equations instead of quadratic programming implemented in SVM. The LS-SVMs provide some free parameters that have to be correctly chosen to achieve satisfactory results in a given task. Despite the LS-SVMs having high performance, lots of tools have been developed to improve them, mainly the development of new classifying methods and the employment of ensembles, in other words, a combination of several classifiers. In this work, our proposal is to use an ensemble and a Genetic Algorithm (GA), search algorithm based on the evolution of species, to enhance the LSSVM classification. In the construction of this ensemble, we use a random selection of attributes of the original problem, which it splits the original problem into smaller ones where each classifier will act. So, we apply a genetic algorithm to find effective values of the LS-SVM parameters and also to find a weight vector, measuring the importance of each machine in the final classification. Finally, the final classification is obtained by a linear combination of the decision values of the LS-SVMs with the weight vector. We used several classification problems, taken as benchmarks to evaluate the performance of the algorithm and compared the results with other classifiers
Resumo:
Natural gas, although basically composed by light hydrocarbons, also presents contaminant gases in its composition, such as CO2 (carbon dioxide) and H2S (hydrogen sulfide). The H2S, which commonly occurs in oil and gas exploration and production activities, causes damages in oil and natural gas pipelines. Consequently, the removal of hydrogen sulfide gas will result in an important reduction in operating costs. Also, it is essential to consider the better quality of the oil to be processed in the refinery, thus resulting in benefits in economic, environmental and social areas. All this facts demonstrate the need for the development and improvement in hydrogen sulfide scavengers. Currently, the oil industry uses several processes for hydrogen sulfide removal from natural gas. However, these processes produce amine derivatives which can cause damage in distillation towers, can cause clogging of pipelines by formation of insoluble precipitates, and also produce residues with great environmental impact. Therefore, it is of great importance the obtaining of a stable system, in inorganic or organic reaction media, able to remove hydrogen sulfide without formation of by-products that can affect the quality and cost of natural gas processing, transport, and distribution steps. Seeking the study, evaluation and modeling of mass transfer and kinetics of hydrogen removal, in this study it was used an absorption column packed with Raschig rings, where the natural gas, with H2S as contaminant, passed through an aqueous solution of inorganic compounds as stagnant liquid, being this contaminant gas absorbed by the liquid phase. This absorption column was coupled with a H2S detection system, with interface with a computer. The data and the model equations were solved by the least squares method, modified by Levemberg-Marquardt. In this study, in addition to the water, it were used the following solutions: sodium hydroxide, potassium permanganate, ferric chloride, copper sulfate, zinc chloride, potassium chromate, and manganese sulfate, all at low concentrations (»10 ppm). These solutions were used looking for the evaluation of the interference between absorption physical and chemical parameters, or even to get a better mass transfer coefficient, as in mixing reactors and absorption columns operating in counterflow. In this context, the evaluation of H2S removal arises as a valuable procedure for the treatment of natural gas and destination of process by-products. The study of the obtained absorption curves makes possible to determine the mass transfer predominant stage in the involved processes, the mass transfer volumetric coefficients, and the equilibrium concentrations. It was also performed a kinetic study. The obtained results showed that the H2S removal kinetics is greater for NaOH. Considering that the study was performed at low concentrations of chemical reagents, it was possible to check the effect of secondary reactions in the other chemicals, especially in the case of KMnO4, which shows that your by-product, MnO2, acts in H2S absorption process. In addition, CuSO4 and FeCl3 also demonstrated to have good efficiency in H2S removal
Resumo:
Waste stabilization ponds (WSP) have been widely used for sewage treatment in hot climate regions because they are economic and environmentally sustainable. In the present study a WSP complex comprising a primary facultative pond (PFP) followed by two maturation ponds (MP-1 and MP-2) was studied, in the city of Natal-RN. The main objective was to study the bio-degradability of organic matter through the determination of the kinetic constant k throughout the system. The work was carried out in two phases. In the first, the variability in BOD, COD and TOC concentrations and an analysis of the relations between these parameters, in the influent raw sewage, pond effluents and in specific areas inside the ponds was studied. In the second stage, the decay rate for organic matter (k) was determined throughout the system based on BOD tests on the influent sewage, pond effluents and water column samples taken from fixed locations within the ponds, using the mathematical methods of Least Squares and the Thomas equation. Subsequently k was estimated as a function of a hydrodynamic model determined from the dispersion number (d), using empirical methods and a Partial Hydrodynamic Evaluation (PHE), obtained from tracer studies in a section of the primary facultative pond corresponding to 10% of its total length. The concentrations of biodegradable organic matter, measured as BOD and COD, gradually reduced through the series of ponds, giving overall removal efficiencies of 71.95% for BOD and of 52.45% for COD. Determining the values for k, in the influent and effluent samples of the ponds using the mathematical method of Least Squares, gave the following values respectively: primary facultative pond (0,23 day-1 and 0,09 day-1), maturation 1 (0,04 day-1 and 0,03 day-1) and maturation 2 (0,03 day-1 and 0,08 day-1). When using the Thomas method, the values of k in the influents and effluents of the ponds were: primary facultative pond (0,17 day-1 and 0,07 day-1), maturation 1 (0,02 day-1 and 0,01 day-1) and maturation 2 (0,01 day-1 and 0,02 day-1). From the Partial Hydrodynamic Evaluation, in the first section of the facultative pond corresponding to 10% of its total length, it can be concluded from the dispersion number obtained of d = 0.04, that the hydraulic regime is one of dispersed flow with a kinetic constant value of 0.20 day-1
Resumo:
In this work calibration models were constructed to determine the content of total lipids and moisture in powdered milk samples. For this, used the near-infrared spectroscopy by diffuse reflectance, combined with multivariate calibration. Initially, the spectral data were submitted to correction of multiplicative light scattering (MSC) and Savitzsky-Golay smoothing. Then, the samples were divided into subgroups by application of hierarchical clustering analysis of the classes (HCA) and Ward Linkage criterion. Thus, it became possible to build regression models by partial least squares (PLS) that allowed the calibration and prediction of the content total lipid and moisture, based on the values obtained by the reference methods of Soxhlet and 105 ° C, respectively . Therefore, conclude that the NIR had a good performance for the quantification of samples of powdered milk, mainly by minimizing the analysis time, not destruction of the samples and not waste. Prediction models for determination of total lipids correlated (R) of 0.9955, RMSEP of 0.8952, therefore the average error between the Soxhlet and NIR was ± 0.70%, while the model prediction to content moisture correlated (R) of 0.9184, RMSEP, 0.3778 and error of ± 0.76%
Resumo:
In this work, the quantitative analysis of glucose, triglycerides and cholesterol (total and HDL) in both rat and human blood plasma was performed without any kind of pretreatment of samples, by using near infrared spectroscopy (NIR) combined with multivariate methods. For this purpose, different techniques and algorithms used to pre-process data, to select variables and to build multivariate regression models were compared between each other, such as partial least squares regression (PLS), non linear regression by artificial neural networks, interval partial least squares regression (iPLS), genetic algorithm (GA), successive projections algorithm (SPA), amongst others. Related to the determinations of rat blood plasma samples, the variables selection algorithms showed satisfactory results both for the correlation coefficients (R²) and for the values of root mean square error of prediction (RMSEP) for the three analytes, especially for triglycerides and cholesterol-HDL. The RMSEP values for glucose, triglycerides and cholesterol-HDL obtained through the best PLS model were 6.08, 16.07 e 2.03 mg dL-1, respectively. In the other case, for the determinations in human blood plasma, the predictions obtained by the PLS models provided unsatisfactory results with non linear tendency and presence of bias. Then, the ANN regression was applied as an alternative to PLS, considering its ability of modeling data from non linear systems. The root mean square error of monitoring (RMSEM) for glucose, triglycerides and total cholesterol, for the best ANN models, were 13.20, 10.31 e 12.35 mg dL-1, respectively. Statistical tests (F and t) suggest that NIR spectroscopy combined with multivariate regression methods (PLS and ANN) are capable to quantify the analytes (glucose, triglycerides and cholesterol) even when they are present in highly complex biological fluids, such as blood plasma
Resumo:
The aim of this study was to evaluate the potential of near-infrared reflectance spectroscopy (NIRS) as a rapid and non-destructive method to determine the soluble solid content (SSC), pH and titratable acidity of intact plums. Samples of plum with a total solids content ranging from 5.7 to 15%, pH from 2.72 to 3.84 and titratable acidity from 0.88 a 3.6% were collected from supermarkets in Natal-Brazil, and NIR spectra were acquired in the 714 2500 nm range. A comparison of several multivariate calibration techniques with respect to several pre-processing data and variable selection algorithms, such as interval Partial Least Squares (iPLS), genetic algorithm (GA), successive projections algorithm (SPA) and ordered predictors selection (OPS), was performed. Validation models for SSC, pH and titratable acidity had a coefficient of correlation (R) of 0.95 0.90 and 0.80, as well as a root mean square error of prediction (RMSEP) of 0.45ºBrix, 0.07 and 0.40%, respectively. From these results, it can be concluded that NIR spectroscopy can be used as a non-destructive alternative for measuring the SSC, pH and titratable acidity in plums
Resumo:
Aiming to consumer s safety the presence of pathogenic contaminants in foods must be monitored because they are responsible for foodborne outbreaks that depending on the level of contamination can ultimately cause the death of those who consume them. In industry is necessary that this identification be fast and profitable. This study shows the utility and application of near-infrared (NIR) transflectance spectroscopy as an alternative method for the identification and classification of Escherichia coli and Salmonella Enteritidis in commercial fruit pulp (pineapple). Principal Component Analysis (PCA), Independent Modeling of Class Analogy (SIMCA) and Discriminant Analysis Partial Least Squares (PLS-DA) were used in the analysis. It was not possible to obtain total separation between samples using PCA and SIMCA. The PLS-DA showed good performance in prediction capacity reaching 87.5% for E. coli and 88.3% for S. Enteritides, respectively. The best models were obtained for the PLS-DA with second derivative spectra treated with a sensitivity and specificity of 0.87 and 0.83, respectively. These results suggest that the NIR spectroscopy and PLS-DA can be used to discriminate and detect bacteria in the fruit pulp
Resumo:
The Tucunduba Dam, is west of Fortaleza, Ceará State. The seismic monitoring of the area, with an analogical station and seven digital stations, had beginning on June 11, 1997. The digital stations, operated from June to November 1997. The data collected in the period of digital monitoring was analyzed for determination of hypocenters, focal mechanisms, and shear-wave anisotropy analysis. For determination of hypocenters, it was possible to find an active zone of nearly 1 km in length, with depth between 4.5 and 5.2 km. A 60AZ/88SE fault plane was determined using the least-squares method and hypocenters of a selected set of 16 earthquakes recorded. Focal mechanisms were determined, in the composite fault plane solution, a strike-slip fault, trending nearly E-W, was found. Single fault plane solutions were obteined to some earthquakes presented mean values of 65 (azimuth), and 80 (dip). Shear-wave anisotropy was found in the data. Polarization directions and travel time delays, between S spliting waves, were determined. It was not possible to obtain any conclusion on the cause of the observed anisotropy. It is not clear if there is correlation between seismicity and mapped faults in the area, although the directions obtained starting from the hipocentros and focal mechanism are they are consistent with directions, observed in the area, photo, topographic and fractures directions observed in the area