938 resultados para regression ana-lysis
Resumo:
Forecasting wind power is an important part of a successful integration of wind power into the power grid. Forecasts with lead times longer than 6 h are generally made by using statistical methods to post-process forecasts from numerical weather prediction systems. Two major problems that complicate this approach are the non-linear relationship between wind speed and power production and the limited range of power production between zero and nominal power of the turbine. In practice, these problems are often tackled by using non-linear non-parametric regression models. However, such an approach ignores valuable and readily available information: the power curve of the turbine's manufacturer. Much of the non-linearity can be directly accounted for by transforming the observed power production into wind speed via the inverse power curve so that simpler linear regression models can be used. Furthermore, the fact that the transformed power production has a limited range can be taken care of by employing censored regression models. In this study, we evaluate quantile forecasts from a range of methods: (i) using parametric and non-parametric models, (ii) with and without the proposed inverse power curve transformation and (iii) with and without censoring. The results show that with our inverse (power-to-wind) transformation, simpler linear regression models with censoring perform equally or better than non-linear models with or without the frequently used wind-to-power transformation.
Resumo:
We use sunspot group observations from the Royal Greenwich Observatory (RGO) to investigate the effects of intercalibrating data from observers with different visual acuities. The tests are made by counting the number of groups RB above a variable cut-off threshold of observed total whole-spot area (uncorrected for foreshortening) to simulate what a lower acuity observer would have seen. The synthesised annual means of RB are then re-scaled to the full observed RGO group number RA using a variety of regression techniques. It is found that a very high correlation between RA and RB (rAB > 0.98) does not prevent large errors in the intercalibration (for example sunspot maximum values can be over 30 % too large even for such levels of rAB). In generating the backbone sunspot number (RBB), Svalgaard and Schatten (2015, this issue) force regression fits to pass through the scatter plot origin which generates unreliable fits (the residuals do not form a normal distribution) and causes sunspot cycle amplitudes to be exaggerated in the intercalibrated data. It is demonstrated that the use of Quantile-Quantile (“Q Q”) plots to test for a normal distribution is a useful indicator of erroneous and misleading regression fits. Ordinary least squares linear fits, not forced to pass through the origin, are sometimes reliable (although the optimum method used is shown to be different when matching peak and average sunspot group numbers). However, other fits are only reliable if non-linear regression is used. From these results it is entirely possible that the inflation of solar cycle amplitudes in the backbone group sunspot number as one goes back in time, relative to related solar-terrestrial parameters, is entirely caused by the use of inappropriate and non-robust regression techniques to calibrate the sunspot data.
Resumo:
Objectives: To assess the role of the individual determinants on the inequalities of dental services utilization among low-income children living in the working area of Brazilian`s federal Primary Health Care program, which is called Family Health Program (FHP), in a big city in Southern Brazil. Methods: A cross-sectional population-based study was performed. The sample included 350 children, ages 0 to 14 years, whose parents answered a questionnaire about their socioeconomic conditions, perceived needs, oral hygiene habits, and access to dental services. The data analysis was performed according to a conceptual framework based on Andersen`s behavioral model of health services use. Multivariate models of logistic regression analysis instructed the hypothesis on covariates for never having had a dental visit. Results: Thirty one percent of the surveyed children had never had a dental visit. In the bivariate analysis, higher proportion of children who had never had a dental visit was found among the very young, those with inadequate oral hygiene habits, those without perceived need of dental care, and those whose family homes were under absent ownership. The mechanisms of social support showed to be important enabling factors: children attending schools/kindergartens and being regularly monitored by the FHP teams had higher odds of having gone to the dentist, even after adjusting for socioeconomic, demographic, and need variables. Conclusions: The conceptual framework has confirmed the presence of social and psychosocial inequalities on the utilization pattern of dental services for low-income children. The individual determinants seem to be important predictors of access.
Resumo:
Some vocal disorders in teachers are associated with occupational factors, but there are few studies that analyze the influence of vocal habits, fluid intake, mastication, and sleep on these disorders. The objective was to analyze the Occurrence of vocal fatigue, hoarseness, and dry throat in elementary and high school teachers and their association with vocal habits, fluid intake, mastication, and sleep. A sample of 422 elementary and secondary school teachers was Studied using a specific questionnaire. The multiple regression analysis showed that hoarseness was associated with absence of water intake (odds ratio (OR) = 1.7; P = 0.047), yelling/speaking loudly (OR = 1.6; P = 0.058), jaw-opening limitations (OR = 3.8; P = 0.003). average of: 6 hours of sleep/light (OR = 1.7; P = 0.039), and waking-up feeling replenished (OR = 2.0; P = 0.020). The presence of vocal fatigue was significantly associated with yelling/speaking loudly (OR = 2.2: P = 0.013), speaking excessively (OR = 2.4; P = 0.023), difficulty to open the mouth to masticate (OR = 6.6; P = 0.003), less than 6 hours of sleep (OR = 4.0; P = 0.008), and waking-up feeling replenished (sometimes OR = 2.8: P = 0.003; or never OR = 3.3 P = 0.002). The presence of dry throat was associated with being a former smoker (OR = 3.3; P = 0.011) and having jaw-opening limitations (OR = 3.9; P = 0.021). In recent years, speech and hearing interventions with teachers have focused on health-care promotion actions and prevention of vocal disorders, prioritizing issues related with hydration and healthy vocal use habits. However, the findings in the present study show the need to further focus on lifestyle habits related to sleep and eating habits.
Resumo:
Lonomia obliqua caterpillar bristle extract induces hemolysis in vitro on washed human and rat erythrocytes, in either the absence or presence of exogenous lecithin. In the former condition, phospholipases A(2) are key enzymes involved in hemolysis. However, the mechanism whereby this extract causes direct hemolysis is not known. Thus, the aim of this study was to investigate the hemolytic mechanism of the crude extract of the caterpillar L obliqua on human erythrocytes in the absence of lecithin. The extract significantly increased the erythrocyte osmotic fragility and promoted the removal of glycophorins A and C, and band 3 from the erythrocyte membrane. The use of Ca(2+) and Mg(2+) ions significantly potentiated glycoprotein removal, remarkably of erythrocyte band 3. The composition of fatty acids was analyzed by HPLC in both L obliqua caterpillar bristle extract and human erythrocyte membranes incubated with the extract. The levels of unsaturated fatty acids were remarkably augmented in erythrocytes incubated with the extract than in control erythrocytes, modifying thereby the saturated/unsaturated fatty acid ratio. Altogether, evidence is provided here that the interplay of at least three mechanisms of action accounts for the direct activity of the bristle extract on erythrocyte membrane, leading to hemolysis: the removal of glycoproteins and band 3; the insertion of fatty acids; and the action of phospholipases. Such mechanisms might affect erythrocyte flexibility and deformability, which may induce hemolysis by increasing erythrocyte fragility. However, whether the direct hemolytic activity of L obliqua caterpillar is the major cause of intravascular hemolysis during envenomation still needs further investigation. (C) 2010 Elsevier Ltd. All rights reserved.
Resumo:
In this article, we present a generalization of the Bayesian methodology introduced by Cepeda and Gamerman (2001) for modeling variance heterogeneity in normal regression models where we have orthogonality between mean and variance parameters to the general case considering both linear and highly nonlinear regression models. Under the Bayesian paradigm, we use MCMC methods to simulate samples for the joint posterior distribution. We illustrate this algorithm considering a simulated data set and also considering a real data set related to school attendance rate for children in Colombia. Finally, we present some extensions of the proposed MCMC algorithm.
Resumo:
In this paper, we compare the performance of two statistical approaches for the analysis of data obtained from the social research area. In the first approach, we use normal models with joint regression modelling for the mean and for the variance heterogeneity. In the second approach, we use hierarchical models. In the first case, individual and social variables are included in the regression modelling for the mean and for the variance, as explanatory variables, while in the second case, the variance at level 1 of the hierarchical model depends on the individuals (age of the individuals), and in the level 2 of the hierarchical model, the variance is assumed to change according to socioeconomic stratum. Applying these methodologies, we analyze a Colombian tallness data set to find differences that can be explained by socioeconomic conditions. We also present some theoretical and empirical results concerning the two models. From this comparative study, we conclude that it is better to jointly modelling the mean and variance heterogeneity in all cases. We also observe that the convergence of the Gibbs sampling chain used in the Markov Chain Monte Carlo method for the jointly modeling the mean and variance heterogeneity is quickly achieved.
Resumo:
Nesse artigo, tem-se o interesse em avaliar diferentes estratégias de estimação de parâmetros para um modelo de regressão linear múltipla. Para a estimação dos parâmetros do modelo foram utilizados dados de um ensaio clínico em que o interesse foi verificar se o ensaio mecânico da propriedade de força máxima (EM-FM) está associada com a massa femoral, com o diâmetro femoral e com o grupo experimental de ratas ovariectomizadas da raça Rattus norvegicus albinus, variedade Wistar. Para a estimação dos parâmetros do modelo serão comparadas três metodologias: a metodologia clássica, baseada no método dos mínimos quadrados; a metodologia Bayesiana, baseada no teorema de Bayes; e o método Bootstrap, baseado em processos de reamostragem.
Resumo:
The purpose of this paper is to develop a Bayesian analysis for nonlinear regression models under scale mixtures of skew-normal distributions. This novel class of models provides a useful generalization of the symmetrical nonlinear regression models since the error distributions cover both skewness and heavy-tailed distributions such as the skew-t, skew-slash and the skew-contaminated normal distributions. The main advantage of these class of distributions is that they have a nice hierarchical representation that allows the implementation of Markov chain Monte Carlo (MCMC) methods to simulate samples from the joint posterior distribution. In order to examine the robust aspects of this flexible class, against outlying and influential observations, we present a Bayesian case deletion influence diagnostics based on the Kullback-Leibler divergence. Further, some discussions on the model selection criteria are given. The newly developed procedures are illustrated considering two simulations study, and a real data previously analyzed under normal and skew-normal nonlinear regression models. (C) 2010 Elsevier B.V. All rights reserved.
A bivariate regression model for matched paired survival data: local influence and residual analysis
Resumo:
The use of bivariate distributions plays a fundamental role in survival and reliability studies. In this paper, we consider a location scale model for bivariate survival times based on the proposal of a copula to model the dependence of bivariate survival data. For the proposed model, we consider inferential procedures based on maximum likelihood. Gains in efficiency from bivariate models are also examined in the censored data setting. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and compared to the performance of the bivariate regression model for matched paired survival data. Sensitivity analysis methods such as local and total influence are presented and derived under three perturbation schemes. The martingale marginal and the deviance marginal residual measures are used to check the adequacy of the model. Furthermore, we propose a new measure which we call modified deviance component residual. The methodology in the paper is illustrated on a lifetime data set for kidney patients.
Resumo:
In this paper we have discussed inference aspects of the skew-normal nonlinear regression models following both, a classical and Bayesian approach, extending the usual normal nonlinear regression models. The univariate skew-normal distribution that will be used in this work was introduced by Sahu et al. (Can J Stat 29:129-150, 2003), which is attractive because estimation of the skewness parameter does not present the same degree of difficulty as in the case with Azzalini (Scand J Stat 12:171-178, 1985) one and, moreover, it allows easy implementation of the EM-algorithm. As illustration of the proposed methodology, we consider a data set previously analyzed in the literature under normality.
Resumo:
The purpose of this paper is to develop a Bayesian approach for log-Birnbaum-Saunders Student-t regression models under right-censored survival data. Markov chain Monte Carlo (MCMC) methods are used to develop a Bayesian procedure for the considered model. In order to attenuate the influence of the outlying observations on the parameter estimates, we present in this paper Birnbaum-Saunders models in which a Student-t distribution is assumed to explain the cumulative damage. Also, some discussions on the model selection to compare the fitted models are given and case deletion influence diagnostics are developed for the joint posterior distribution based on the Kullback-Leibler divergence. The developed procedures are illustrated with a real data set. (C) 2010 Elsevier B.V. All rights reserved.
Resumo:
In interval-censored survival data, the event of interest is not observed exactly but is only known to occur within some time interval. Such data appear very frequently. In this paper, we are concerned only with parametric forms, and so a location-scale regression model based on the exponentiated Weibull distribution is proposed for modeling interval-censored data. We show that the proposed log-exponentiated Weibull regression model for interval-censored data represents a parametric family of models that include other regression models that are broadly used in lifetime data analysis. Assuming the use of interval-censored data, we employ a frequentist analysis, a jackknife estimator, a parametric bootstrap and a Bayesian analysis for the parameters of the proposed model. We derive the appropriate matrices for assessing local influences on the parameter estimates under different perturbation schemes and present some ways to assess global influences. Furthermore, for different parameter settings, sample sizes and censoring percentages, various simulations are performed; in addition, the empirical distribution of some modified residuals are displayed and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be straightforwardly extended to a modified deviance residual in log-exponentiated Weibull regression models for interval-censored data. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
In this paper, the generalized log-gamma regression model is modified to allow the possibility that long-term survivors may be present in the data. This modification leads to a generalized log-gamma regression model with a cure rate, encompassing, as special cases, the log-exponential, log-Weibull and log-normal regression models with a cure rate typically used to model such data. The models attempt to simultaneously estimate the effects of explanatory variables on the timing acceleration/deceleration of a given event and the surviving fraction, that is, the proportion of the population for which the event never occurs. The normal curvatures of local influence are derived under some usual perturbation schemes and two martingale-type residuals are proposed to assess departures from the generalized log-gamma error assumption as well as to detect outlying observations. Finally, a data set from the medical area is analyzed.
Resumo:
In survival analysis applications, the failure rate function may frequently present a unimodal shape. In such case, the log-normal or log-logistic distributions are used. In this paper, we shall be concerned only with parametric forms, so a location-scale regression model based on the Burr XII distribution is proposed for modeling data with a unimodal failure rate function as an alternative to the log-logistic regression model. Assuming censored data, we consider a classic analysis, a Bayesian analysis and a jackknife estimator for the parameters of the proposed model. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and compared to the performance of the log-logistic and log-Burr XII regression models. Besides, we use sensitivity analysis to detect influential or outlying observations, and residual analysis is used to check the assumptions in the model. Finally, we analyze a real data set under log-Buff XII regression models. (C) 2008 Published by Elsevier B.V.