Biblioteca Digital

915 resultados para Logistic regression model

Investigation of the factors influencing the survival of Bifidobacterium longum in model acidic solutions and fruit juices

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The survival of Bifidobacterium longum NCIMB 8809 was studied during refrigerated storage for 6 weeks in model solutions, based on which a mathematical model was constructed describing cell survival as a function of pH, citric acid, protein and dietary fibre. A Central Composite Design (CCD) was developed studying the influence of four factors at three levels, i.e., pH (3.2–4), citric acid (2–15 g/l), protein (0–10 g/l), and dietary fibre (0–8 g/l). In total, 31 experimental runs were carried out. Analysis of variance (ANOVA) of the regression model demonstrated that the model fitted well the data. From the regression coefficients it was deduced that all four factors had a statistically significant (P < 0.05) negative effect on the log decrease [log10N0 week−log10N6 week], with the pH and citric acid being the most influential ones. Cell survival during storage was also investigated in various types of juices, including orange, grapefruit, blackcurrant, pineapple, pomegranate and strawberry. The highest cell survival (less than 0.4 log decrease) after 6 weeks of storage was observed in orange and pineapple, both of which had a pH of about 3.8. Although the pH of grapefruit and blackcurrant was similar (pH ∼3.2), the log decrease of the former was ∼0.5 log, whereas of the latter was ∼0.7 log. One reason for this could be the fact that grapefruit contained a high amount of citric acid (15.3 g/l). The log decrease in pomegranate and strawberry juices was extremely high (∼8 logs). The mathematical model was able to predict adequately the cell survival in orange, grapefruit, blackcurrant, and pineapple juices. However, the model failed to predict the cell survival in pomegranate and strawberry, most likely due to the very high levels of phenolic compounds in these two juices.

What determines the out-of-home placement of children in the USA?

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Using NCANDS data of US child maltreatment reports for 2009, logistic regression, probit analysis, discriminant analysis and an artificial neural network are used to determine the factors which explain the decision to place a child in out-of-home care. As well as developing a new model for 2009, a previous study using 2005 data is replicated. While there are many small differences, the four estimation techniques give broadly the same results, demonstrating the robustness of the results. Similarly, apart from age and sexual abuse, the 2005 and 2009 results are roughly similar. For 2009, child characteristics (particularly child emotional problems) are more important than the nature of the abuse and the situation of the household; while caregiver characteristics are the least important. All these models have low explanatory power.

Ensemble projections for wine production in the Douro Valley of Portugal

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Wine production is largely governed by atmospheric conditions, such as air temperature and precipitation, together with soil management and viticultural/enological practices. Therefore, anthropogenic climate change is likely to have important impacts on the winemaking sector worldwide. An important winemaking region is the Portuguese Douro Valley, which is known by its world-famous Port Wine. The identification of robust relationships between atmospheric factors and wine parameters is of great relevance for the region. A multivariate linear regression analysis of a long wine production series (1932–2010) reveals that high rainfall and cool temperatures during budburst, shoot and inflorescence development (February-March) and warm temperatures during flowering and berry development (May) are generally favourable to high production. The probabilities of occurrence of three production categories (low, normal and high) are also modelled using multinomial logistic regression. Results show that both statistical models are valuable tools for predicting the production in a given year with a lead time of 3–4 months prior to harvest. These statistical models are applied to an ensemble of 16 regional climate model experiments following the SRES A1B scenario to estimate possible future changes. Wine production is projected to increase by about 10 % by the end of the 21st century, while the occurrence of high production years is expected to increase from 25 % to over 60 %. Nevertheless, further model development will be needed to include other aspects that may shape production in the future. In particular, the rising heat stress and/or changes in ripening conditions could limit the projected production increase in future decades.

Association of low adiponectin levels with the metabolic syndrome--the Chennai Urban Rural Epidemiology Study (CURES-4)

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The aim of the study was to assess the relation of adiponectin levels with the metabolic syndrome in Asian Indians, a high-risk group for diabetes and premature coronary artery disease. The study was conducted on 100 (50 men and 50 women) type 2 diabetic subjects and 100 age and sex matched subjects with normal glucose tolerance selected from the Chennai Urban Rural Epidemiology Study, an ongoing population study in Chennai in southern India. Metabolic syndrome was defined using modified Adult Treatment Panel III (ATPIII) guidelines. Adiponectin values were significantly lower in diabetic subjects (men: 5.2 vs 8.3 microg/mL, P=.00l; women: 7.6 vs 11.1 microg/mL, P<.00l) and those with the metabolic syndrome (men: 5.0 vs 6.8 microg/mL, P=.01; women: 6.5 vs 9.9 microg/mL, P=.001) compared with those without. Linear regression analysis revealed adiponectin to be associated with body mass index (P<.05), waist circumference (P<.01), fasting plasma glucose (P=.001), glycated hemoglobin (P<.001), triglycerides (P<.00l), high-density lipoprotein (HDL) cholesterol (P<.001), cholesterol/HDL ratio (P<.00l), and insulin resistance measured by homeostasis assessment model (P<.00l). Factor analysis identified 2 factors: factor 1, negatively loaded with adiponectin and HDL cholesterol and positively loaded with triglycerides, waist circumference, and insulin resistance measured by homeostasis assessment model; and factor 2, with a positive loading of waist circumference and systolic and diastolic blood pressure. Logistic regression analysis revealed adiponectin to be negatively associated with metabolic syndrome (odds ratio [OR], 0.365; P<.001) even after adjusting for age (OR, 0.344; P<.00l), sex (OR, 0.293; P<.001), and body mass index (OR, 0.292; P<.00l). Lower adiponectin levels are associated with the metabolic syndrome per se and several of its components, particularly, diabetes, insulin resistance, and dyslipidemia in this urban south Indian population.

Tensor regression based on linked multiway parameter analysis

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Classical regression methods take vectors as covariates and estimate the corresponding vectors of regression parameters. When addressing regression problems on covariates of more complex form such as multi-dimensional arrays (i.e. tensors), traditional computational models can be severely compromised by ultrahigh dimensionality as well as complex structure. By exploiting the special structure of tensor covariates, the tensor regression model provides a promising solution to reduce the model’s dimensionality to a manageable level, thus leading to efficient estimation. Most of the existing tensor-based methods independently estimate each individual regression problem based on tensor decomposition which allows the simultaneous projections of an input tensor to more than one direction along each mode. As a matter of fact, multi-dimensional data are collected under the same or very similar conditions, so that data share some common latent components but can also have their own independent parameters for each regression task. Therefore, it is beneficial to analyse regression parameters among all the regressions in a linked way. In this paper, we propose a tensor regression model based on Tucker Decomposition, which identifies not only the common components of parameters across all the regression tasks, but also independent factors contributing to each particular regression task simultaneously. Under this paradigm, the number of independent parameters along each mode is constrained by a sparsity-preserving regulariser. Linked multiway parameter analysis and sparsity modeling further reduce the total number of parameters, with lower memory cost than their tensor-based counterparts. The effectiveness of the new method is demonstrated on real data sets.

Do personality traits predict ‘complaining’ consumers?

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Although the effects of personality traits on complaining behaviour emerged in the early 1980s, there is limited research in the service industry. The purpose of this study is to examine whether consumer personality traits influence intentions to complain and whether product price and product types moderate the relationship between personality traits and intentions to complain in the retail industry. The research model is tested by logistic regression analysis on two groups of consumers who report passive and active complaining intentions. The study reveals that conscientious consumers who are open to new experiences tend to have higher intentions to complain. Being extroverted does not have any influence on complaining behaviour. Whilst price levels (low/high) and product types (grocery, clothing and electronics) improve the predictive ability of the complaining behaviour, the interaction effects relating to the three personality traits are statistically insignificant. Theoretical and managerial implications of the study findings are discussed.

Inequalities in dental services utilization among Brazilian low-income children: the role of individual determinants

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Objectives: To assess the role of the individual determinants on the inequalities of dental services utilization among low-income children living in the working area of Brazilian`s federal Primary Health Care program, which is called Family Health Program (FHP), in a big city in Southern Brazil. Methods: A cross-sectional population-based study was performed. The sample included 350 children, ages 0 to 14 years, whose parents answered a questionnaire about their socioeconomic conditions, perceived needs, oral hygiene habits, and access to dental services. The data analysis was performed according to a conceptual framework based on Andersen`s behavioral model of health services use. Multivariate models of logistic regression analysis instructed the hypothesis on covariates for never having had a dental visit. Results: Thirty one percent of the surveyed children had never had a dental visit. In the bivariate analysis, higher proportion of children who had never had a dental visit was found among the very young, those with inadequate oral hygiene habits, those without perceived need of dental care, and those whose family homes were under absent ownership. The mechanisms of social support showed to be important enabling factors: children attending schools/kindergartens and being regularly monitored by the FHP teams had higher odds of having gone to the dentist, even after adjusting for socioeconomic, demographic, and need variables. Conclusions: The conceptual framework has confirmed the presence of social and psychosocial inequalities on the utilization pattern of dental services for low-income children. The individual determinants seem to be important predictors of access.

Methods of Estimation in Multiple Linear Regression: Application to Clinical Data

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Nesse artigo, tem-se o interesse em avaliar diferentes estratégias de estimação de parâmetros para um modelo de regressão linear múltipla. Para a estimação dos parâmetros do modelo foram utilizados dados de um ensaio clínico em que o interesse foi verificar se o ensaio mecânico da propriedade de força máxima (EM-FM) está associada com a massa femoral, com o diâmetro femoral e com o grupo experimental de ratas ovariectomizadas da raça Rattus norvegicus albinus, variedade Wistar. Para a estimação dos parâmetros do modelo serão comparadas três metodologias: a metodologia clássica, baseada no método dos mínimos quadrados; a metodologia Bayesiana, baseada no teorema de Bayes; e o método Bootstrap, baseado em processos de reamostragem.

Poly-bagging predictors for classification modelling for credit scoring

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Credit scoring modelling comprises one of the leading formal tools for supporting the granting of credit. Its core objective consists of the generation of a score by means of which potential clients can be listed in the order of the probability of default. A critical factor is whether a credit scoring model is accurate enough in order to provide correct classification of the client as a good or bad payer. In this context the concept of bootstraping aggregating (bagging) arises. The basic idea is to generate multiple classifiers by obtaining the predicted values from the fitted models to several replicated datasets and then combining them into a single predictive classification in order to improve the classification accuracy. In this paper we propose a new bagging-type variant procedure, which we call poly-bagging, consisting of combining predictors over a succession of resamplings. The study is derived by credit scoring modelling. The proposed poly-bagging procedure was applied to some different artificial datasets and to a real granting of credit dataset up to three successions of resamplings. We observed better classification accuracy for the two-bagged and the three-bagged models for all considered setups. These results lead to a strong indication that the poly-bagging approach may promote improvement on the modelling performance measures, while keeping a flexible and straightforward bagging-type structure easy to implement. (C) 2011 Elsevier Ltd. All rights reserved.

Generalized log-gamma regression models with cure fraction

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In this paper, the generalized log-gamma regression model is modified to allow the possibility that long-term survivors may be present in the data. This modification leads to a generalized log-gamma regression model with a cure rate, encompassing, as special cases, the log-exponential, log-Weibull and log-normal regression models with a cure rate typically used to model such data. The models attempt to simultaneously estimate the effects of explanatory variables on the timing acceleration/deceleration of a given event and the surviving fraction, that is, the proportion of the population for which the event never occurs. The normal curvatures of local influence are derived under some usual perturbation schemes and two martingale-type residuals are proposed to assess departures from the generalized log-gamma error assumption as well as to detect outlying observations. Finally, a data set from the medical area is analyzed.

Hypotheses Testing on a Multivariate Null Intercept Errors-in-Variables Model

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Considering the Wald, score, and likelihood ratio asymptotic test statistics, we analyze a multivariate null intercept errors-in-variables regression model, where the explanatory and the response variables are subject to measurement errors, and a possible structure of dependency between the measurements taken within the same individual are incorporated, representing a longitudinal structure. This model was proposed by Aoki et al. (2003b) and analyzed under the bayesian approach. In this article, considering the classical approach, we analyze asymptotic test statistics and present a simulation study to compare the behavior of the three test statistics for different sample sizes, parameter values and nominal levels of the test. Also, closed form expressions for the score function and the Fisher information matrix are presented. We consider two real numerical illustrations, the odontological data set from Hadgu and Koch (1999), and a quality control data set.

Bayesian analysis for a skew extension of the multivariate null intercept measurement error model

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Skew-normal distribution is a class of distributions that includes the normal distributions as a special case. In this paper, we explore the use of Markov Chain Monte Carlo (MCMC) methods to develop a Bayesian analysis in a multivariate, null intercept, measurement error model [R. Aoki, H. Bolfarine, J.A. Achcar, and D. Leao Pinto Jr, Bayesian analysis of a multivariate null intercept error-in -variables regression model, J. Biopharm. Stat. 13(4) (2003b), pp. 763-771] where the unobserved value of the covariate (latent variable) follows a skew-normal distribution. The results and methods are applied to a real dental clinical trial presented in [A. Hadgu and G. Koch, Application of generalized estimating equations to a dental randomized clinical trial, J. Biopharm. Stat. 9 (1999), pp. 161-178].

Evolutionary model trees for handling continuous classes in machine learning

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Model trees are a particular case of decision trees employed to solve regression problems. They have the advantage of presenting an interpretable output, helping the end-user to get more confidence in the prediction and providing the basis for the end-user to have new insight about the data, confirming or rejecting hypotheses previously formed. Moreover, model trees present an acceptable level of predictive performance in comparison to most techniques used for solving regression problems. Since generating the optimal model tree is an NP-Complete problem, traditional model tree induction algorithms make use of a greedy top-down divide-and-conquer strategy, which may not converge to the global optimal solution. In this paper, we propose a novel algorithm based on the use of the evolutionary algorithms paradigm as an alternate heuristic to generate model trees in order to improve the convergence to globally near-optimal solutions. We call our new approach evolutionary model tree induction (E-Motion). We test its predictive performance using public UCI data sets, and we compare the results to traditional greedy regression/model trees induction algorithms, as well as to other evolutionary approaches. Results show that our method presents a good trade-off between predictive performance and model comprehensibility, which may be crucial in many machine learning applications. (C) 2010 Elsevier Inc. All rights reserved.

Improved likelihood inference for the shape parameter in Weibull regression

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We obtain adjustments to the profile likelihood function in Weibull regression models with and without censoring. Specifically, we consider two different modified profile likelihoods: (i) the one proposed by Cox and Reid [Cox, D.R. and Reid, N., 1987, Parameter orthogonality and approximate conditional inference. Journal of the Royal Statistical Society B, 49, 1-39.], and (ii) an approximation to the one proposed by Barndorff-Nielsen [Barndorff-Nielsen, O.E., 1983, On a formula for the distribution of the maximum likelihood estimator. Biometrika, 70, 343-365.], the approximation having been obtained using the results by Fraser and Reid [Fraser, D.A.S. and Reid, N., 1995, Ancillaries and third-order significance. Utilitas Mathematica, 47, 33-53.] and by Fraser et al. [Fraser, D.A.S., Reid, N. and Wu, J., 1999, A simple formula for tail probabilities for frequentist and Bayesian inference. Biometrika, 86, 655-661.]. We focus on point estimation and likelihood ratio tests on the shape parameter in the class of Weibull regression models. We derive some distributional properties of the different maximum likelihood estimators and likelihood ratio tests. The numerical evidence presented in the paper favors the approximation to Barndorff-Nielsen`s adjustment.

Improved score tests in symmetric linear regression models

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The class of symmetric linear regression models has the normal linear regression model as a special case and includes several models that assume that the errors follow a symmetric distribution with longer-than-normal tails. An important member of this class is the t linear regression model, which is commonly used as an alternative to the usual normal regression model when the data contain extreme or outlying observations. In this article, we develop second-order asymptotic theory for score tests in this class of models. We obtain Bartlett-corrected score statistics for testing hypotheses on the regression and the dispersion parameters. The corrected statistics have chi-squared distributions with errors of order O(n(-3/2)), n being the sample size. The corrections represent an improvement over the corresponding original Rao`s score statistics, which are chi-squared distributed up to errors of order O(n(-1)). Simulation results show that the corrected score tests perform much better than their uncorrected counterparts in samples of small or moderate size.

«
1
2
...
41
42
43
44
45
46
47
...
60
61
»