879 resultados para predictive regression


Relevância:

20.00% 20.00%

Publicador:

Resumo:

Extended Hildebrand Solubility Approach (EHSA) developed by Martin et al. was applied to evaluate the solubility of ketoprofen (KTP) in ethanol + water cosolvent mixtures at 298.15 K. Calculated values of molar volume and solubility parameter for KTP were used. A good predictive capacity of EHSA was found by using a regular polynomial model in order five to correlate the W interaction parameter and the solubility parameters of cosolvent mixtures (δmix). Nevertheless, the deviations obtained in the estimated solubilities with respect to the experimental solubilities were on the same order like those obtained directly by means of an empiric regression of the logarithmic experimental solubilities as a function of δmix values.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The goal of this work is the development and validation of an analytical method for fast quantification of sibutramine in pharmaceutical formulations, using diffuse reflectance infrared spectroscopy and partial least square regression. The multivariate model was elaborated from 22 mixtures containing sibutramine and excipients (lactose, microcrystalline cellulose, colloidal silicon dioxide and magnesium stearate) and using fragmented (750-1150/ 1350-1500/ 1850-1950/ 2600-2900 cm-1) and smoothing spectral data. Using 10 latent variables, excellent predictive capacity were observed in the calibration (n=20, RMSEC=0.004, R= 0.999) and external validation (n=5, RMSEC= 9.36, R=0.999) phases. In the analysis of synthetic mixtures the precision (SD=3,47%) was compatible with the rules of the Agencia Nacional de Vigilância Sanitária (ANVISA-Brazil). In the analysis of commercial drugs good agreement was observed between spectroscopic and chromatographic methods.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Acetylation was performed to reduce the polarity of wood and increase its compatibility with polymer matrices for the production of composites. These reactions were performed first as a function of acetic acid and anhydride concentration in a mixture catalyzed by sulfuric acid. A concentration of 50%/50% (v/v) of acetic acid and anhydride was found to produced the highest conversion rate between the functional groups. After these reactions, the kinetics were investigated by varying times and temperatures using a 3² factorial design, and showed time was the most relevant parameter in determining the conversion of hydroxyl into carbonyl groups.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Analytical curves are normally obtained from discrete data by least squares regression. The least squares regression of data involving significant error in both x and y values should not be implemented by ordinary least squares (OLS). In this work, the use of orthogonal distance regression (ODR) is discussed as an alternative approach in order to take into account the error in the x variable. Four examples are presented to illustrate deviation between the results from both regression methods. The examples studied show that, in some situations, ODR coefficients must substitute for those of OLS, and, in other situations, the difference is not significant.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We propose an analytical method based on fourier transform infrared-attenuated total reflectance (FTIR-ATR) spectroscopy to detect the adulteration of petrodiesel and petrodiesel/palm biodiesel blends with African crude palm oil. The infrared spectral fingerprints from the sample analysis were used to perform principal components analysis (PCA) and to construct a prediction model using partial least squares (PLS) regression. The PCA results separated the samples into three groups, allowing identification of those subjected to adulteration with palm oil. The obtained model shows a good predictive capacity for determining the concentration of palm oil in petrodiesel/biodiesel blends. Advantages of the proposed method include cost-effectiveness and speed; it is also environmentally friendly.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Perinteisesti ajoneuvojen markkinointikampanjoissa kohderyhmät muodostetaan yksinkertaisella kriteeristöllä koskien henkilön tai hänen ajoneuvonsa ominaisuuksia. Ennustavan analytiikan avulla voidaan tuottaa kohderyhmänmuodostukseen teknisesti kompleksisia mutta kuitenkin helppokäyttöisiä menetelmiä. Tässä työssä on sovellettu luokittelu- ja regressiomenetelmiä uuden auton ostajien joukkoon. Tämän työn menetelmiksi on rajattu tukivektorikone sekä Coxin regressiomalli. Coxin regression avulla on tutkittu elinaika-analyysien soveltuvuutta ostotapahtuman tapahtumahetken mallintamiseen. Luokittelu tukivektorikonetta käyttäen onnistuu tehtävässään noin 72% tapauksissa. Tukivektoriregressiolla mallinnetun hankintahetken virheen keskiarvo on noin neljä kuukautta. Työn tulosten perusteella myös elinaika-analyysin käyttö ostotapahtuman tapahtumahetken mallintamiseen on menetelmänä käyttökelpoinen.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The focus of this dissertation is the motivational influences on transfer in higher education and professional training contexts. To estimate these motivational influences, the dissertation includes seven individual studies that are structured in two parts. Part I, Dimensions, aims at identifying the dimensionality of motivation to transfer and its structural relations with training-related antecedents and outcomes. Part II, Boundary Conditions, aims at testing the predictive validity of motivation theories used in contemporary training research under different study conditions. Data in this dissertation was gathered from multi-item questionnaires, which were analyzed differently in Part I and Part II. Studies in Part I employed exploratory and confirmatory factor analysis, structural equation modeling, partial least squares (PLS) path modeling, and mediation analysis. Studies in Part II used artifact distribution meta-analysis, (nested) subgroup analysis, and weighted least squares (WLS) multiple regression. Results demonstrate that motivation to transfer can be conceptualized as a three-dimensional construct, including autonomous motivation to transfer, controlled motivation to transfer, and intention to transfer, given a theoretical framework informed by expectancy theory, self-determination theory, and the theory of planned behavior. Results also demonstrate that a range of boundary conditions moderates motivational influences on transfer. To test the predictive validity of expectancy theory, social cognitive theory, and the theory of goal orientations under different study settings, a total of 17 boundary conditions were meta-analyzed, including age; assessment criterion; assessment source; attendance policy; collaboration among trainees; computer support; instruction; instrument used to measure motivation; level of education; publication type; social training context; SS/SMC bias; study setting; survey modality; type of knowledge being trained; use of a control group; and work context. Together, the findings cumulated in this thesis support the basic premise that motivation is centrally important for transfer, but that motivational influences need to be understood from a more differentiated perspective than commonly found in the literature, in order to account for several dimensions and boundary conditions. The results of this dissertation across the seven individual studies are reflected in terms of their implications for theory development and their significance for training evaluation and the design of training environments. Limitations and directions to take in future research are discussed.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Machine learning provides tools for automated construction of predictive models in data intensive areas of engineering and science. The family of regularized kernel methods have in the recent years become one of the mainstream approaches to machine learning, due to a number of advantages the methods share. The approach provides theoretically well-founded solutions to the problems of under- and overfitting, allows learning from structured data, and has been empirically demonstrated to yield high predictive performance on a wide range of application domains. Historically, the problems of classification and regression have gained the majority of attention in the field. In this thesis we focus on another type of learning problem, that of learning to rank. In learning to rank, the aim is from a set of past observations to learn a ranking function that can order new objects according to how well they match some underlying criterion of goodness. As an important special case of the setting, we can recover the bipartite ranking problem, corresponding to maximizing the area under the ROC curve (AUC) in binary classification. Ranking applications appear in a large variety of settings, examples encountered in this thesis include document retrieval in web search, recommender systems, information extraction and automated parsing of natural language. We consider the pairwise approach to learning to rank, where ranking models are learned by minimizing the expected probability of ranking any two randomly drawn test examples incorrectly. The development of computationally efficient kernel methods, based on this approach, has in the past proven to be challenging. Moreover, it is not clear what techniques for estimating the predictive performance of learned models are the most reliable in the ranking setting, and how the techniques can be implemented efficiently. The contributions of this thesis are as follows. First, we develop RankRLS, a computationally efficient kernel method for learning to rank, that is based on minimizing a regularized pairwise least-squares loss. In addition to training methods, we introduce a variety of algorithms for tasks such as model selection, multi-output learning, and cross-validation, based on computational shortcuts from matrix algebra. Second, we improve the fastest known training method for the linear version of the RankSVM algorithm, which is one of the most well established methods for learning to rank. Third, we study the combination of the empirical kernel map and reduced set approximation, which allows the large-scale training of kernel machines using linear solvers, and propose computationally efficient solutions to cross-validation when using the approach. Next, we explore the problem of reliable cross-validation when using AUC as a performance criterion, through an extensive simulation study. We demonstrate that the proposed leave-pair-out cross-validation approach leads to more reliable performance estimation than commonly used alternative approaches. Finally, we present a case study on applying machine learning to information extraction from biomedical literature, which combines several of the approaches considered in the thesis. The thesis is divided into two parts. Part I provides the background for the research work and summarizes the most central results, Part II consists of the five original research articles that are the main contribution of this thesis.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The increasing demand of consumer markets for the welfare of birds in poultry house has motivated many scientific researches to monitor and classify the welfare according to the production environment. Given the complexity between the birds and the environment of the aviary, the correct interpretation of the conduct becomes an important way to estimate the welfare of these birds. This study obtained multiple logistic regression models with capacity of estimating the welfare of broiler breeders in relation to the environment of the aviaries and behaviors expressed by the birds. In the experiment, were observed several behaviors expressed by breeders housed in a climatic chamber under controlled temperatures and three different ammonia concentrations from the air monitored daily. From the analysis of the data it was obtained two logistic regression models, of which the first model uses a value of ammonia concentration measured by unit and the second model uses a binary value to classify the ammonia concentration that is assigned by a person through his olfactory perception. The analysis showed that both models classified the broiler breeder's welfare successfully.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The broiler rectal temperature (t rectal) is one of the most important physiological responses to classify the animal thermal comfort. Therefore, the aim of this study was to adjust regression models in order to predict the rectal temperature (t rectal) of broiler chickens under different thermal conditions based on age (A) and a meteorological variable (air temperature - t air) or a thermal comfort index (temperature and humidity index -THI or black globe humidity index - BGHI) or a physical quantity enthalpy (H). In addition, through the inversion of these models and the expected t rectal intervals for each age, the comfort limits of t air, THI, BGHI and H for the chicks in the heating phase were determined, aiding in the validation of the equations and the preliminary limits for H. The experimental data used to adjust the mathematical models were collected in two commercial poultry farms, with Cobb chicks, from 1 to 14 days of age. It was possible to predict the t rectal of conditions from the expected t rectal and determine the lower and superior comfort thresholds of broilers satisfactorily by applying the four models adjusted; as well as to invert the models for prediction of the environmental H for the chicks first 14 days of life.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Singular Value Decomposition (SVD), Principal Component Analysis (PCA) and Multiple Linear Regression (MLR) are some of the mathematical pre- liminaries that are discussed prior to explaining PLS and PCR models. Both PLS and PCR are applied to real spectral data and their di erences and similarities are discussed in this thesis. The challenge lies in establishing the optimum number of components to be included in either of the models but this has been overcome by using various diagnostic tools suggested in this thesis. Correspondence analysis (CA) and PLS were applied to ecological data. The idea of CA was to correlate the macrophytes species and lakes. The di erences between PLS model for ecological data and PLS for spectral data are noted and explained in this thesis. i

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The aim of this thesis is to examine stock returns as predictive indicators to macroeconomic variables in BRIC-countries, Japan, USA and euro area. We picked to represent macroeconomic variables interest rate, inflation, currency, gross domestic product and industrial production. For the beginning we examined previous studies and theory about the subject. Hypothesis of this thesis were derived from the previous studies. To conduct the results we used tests such augmented Dickey-Fuller, Engle-Granger co-integration, Granger causality and lagged distribution model. According to results stock returns do predictive macroeconomic variables and specifically changes of GDP and industrial production. There were few evidences of stock returns predictive power of inflation.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Purpose To compare the predictive capability of HPV and Pap smear tests for screening pre-cancerous lesions of the cervix over a three-year follow-up, in a population of users of the Brazilian National Health System (SUS). Methods This is a retrospective cohort study of 2,032 women with satisfactory results for Pap smear and HPV tests using second-generation hybrid capture,made in a previous study. We followed them for 36 months with data obtained from medical records, the Cervix Cancer Information System (SISCOLO), and the Mortality Information System (SIM). The outcome was a histological diagnosis of cervical intraepithelial neoplasia grade 2 or more advanced lesions (CIN2ş). We constructed progression curves of the baseline test results for the period, using the Kaplan-Meier method, and estimated sensitivity, specificity, positive and negative predictive value, and positive and negative likelihood ratios for each test. Results A total of 1,440 women had at least one test during follow-up. Progression curves of the baseline test results indicated differences in capability to detect CIN2ş (p < 0.001) with significantly greater capability when both tests were abnormal, followed by only a positive HPV test. The HPV test was more sensitive than the Pap smear (88.7% and 73.6%, respectively; p < 0.05) and had a better negative likelihood ratio (0.13 and 0.30, respectively). Specificity and positive likelihood ratio of the tests were similar. Conclusions These findings corroborate the importance of HPV test as a primary cervical cancer screening.