80 resultados para Generalised Linear Models
Resumo:
The estimation of data transformation is very useful to yield response variables satisfying closely a normal linear model, Generalized linear models enable the fitting of models to a wide range of data types. These models are based on exponential dispersion models. We propose a new class of transformed generalized linear models to extend the Box and Cox models and the generalized linear models. We use the generalized linear model framework to fit these models and discuss maximum likelihood estimation and inference. We give a simple formula to estimate the parameter that index the transformation of the response variable for a subclass of models. We also give a simple formula to estimate the rth moment of the original dependent variable. We explore the possibility of using these models to time series data to extend the generalized autoregressive moving average models discussed by Benjamin er al. [Generalized autoregressive moving average models. J. Amer. Statist. Assoc. 98, 214-223]. The usefulness of these models is illustrated in a Simulation study and in applications to three real data sets. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
In this paper we extend partial linear models with normal errors to Student-t errors Penalized likelihood equations are applied to derive the maximum likelihood estimates which appear to be robust against outlying observations in the sense of the Mahalanobis distance In order to study the sensitivity of the penalized estimates under some usual perturbation schemes in the model or data the local influence curvatures are derived and some diagnostic graphics are proposed A motivating example preliminary analyzed under normal errors is reanalyzed under Student-t errors The local influence approach is used to compare the sensitivity of the model estimates (C) 2010 Elsevier B V All rights reserved
Resumo:
Mixed linear models are commonly used in repeated measures studies. They account for the dependence amongst observations obtained from the same experimental unit. Often, the number of observations is small, and it is thus important to use inference strategies that incorporate small sample corrections. In this paper, we develop modified versions of the likelihood ratio test for fixed effects inference in mixed linear models. In particular, we derive a Bartlett correction to such a test, and also to a test obtained from a modified profile likelihood function. Our results generalize those in [Zucker, D.M., Lieberman, O., Manor, O., 2000. Improved small sample inference in the mixed linear model: Bartlett correction and adjusted likelihood. Journal of the Royal Statistical Society B, 62,827-838] by allowing the parameter of interest to be vector-valued. Additionally, our Bartlett corrections allow for random effects nonlinear covariance matrix structure. We report simulation results which show that the proposed tests display superior finite sample behavior relative to the standard likelihood ratio test. An application is also presented and discussed. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
We introduce in this paper the class of linear models with first-order autoregressive elliptical errors. The score functions and the Fisher information matrices are derived for the parameters of interest and an iterative process is proposed for the parameter estimation. Some robustness aspects of the maximum likelihood estimates are discussed. The normal curvatures of local influence are also derived for some usual perturbation schemes whereas diagnostic graphics to assess the sensitivity of the maximum likelihood estimates are proposed. The methodology is applied to analyse the daily log excess return on the Microsoft whose empirical distributions appear to have AR(1) and heavy-tailed errors. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
Background: Patients with chronic obstructive pulmonary disease (COPD) can have recurrent disease exacerbations triggered by several factors, including air pollution. Visits to the emergency respiratory department can be a direct result of short-term exposure to air pollution. The aim of this study was to investigate the relationship between the daily number of COPD emergency department visits and the daily environmental air concentrations of PM(10), SO(2), NO(2), CO and O(3) in the City of Sao Paulo, Brazil. Methods: The sample data were collected between 2001 and 2003 and are categorised by gender and age. Generalised linear Poisson regression models were adopted to control for both short-and long-term seasonal changes as well as for temperature and relative humidity. The non-linear dependencies were controlled using a natural cubic spline function. Third-degree polynomial distributed lag models were adopted to estimate both lag structures and the cumulative effects of air pollutants. Results: PM(10) and SO(2) readings showed both acute and lagged effects on COPD emergency department visits. Interquartile range increases in their concentration (28.3 mg/m(3) and 7.8 mg/m(3), respectively) were associated with a cumulative 6-day increase of 19% and 16% in COPD admissions, respectively. An effect on women was observed at lag 0, and among the elderly the lag period was noted to be longer. Increases in CO concentration showed impacts in the female and elderly groups. NO(2) and O(3) presented mild effects on the elderly and in women, respectively. Conclusion: These results indicate that air pollution affects health in a gender-and age-specific manner and should be considered a relevant risk factor that exacerbates COPD in urban environments.
Resumo:
Background This study aimed to evaluate the association between the total suspended particles (TSP) generated from burning sugar cane plantations and the incidence of hospital admissions from hypertension in the city of Araraquara. Methods The study was an ecological time-series study. Total daily records of hypertension (ICD 10th I10-15) were obtained from admitted patients of all ages in a hospital in Araraquara, Sao Paulo State, Brazil, from 23 March 2003 to 27 July 2004. The daily concentration of TSP (mu g/m(3)) was obtained using a Handi-Vol sampler placed in downtown Araraquara. The local airport provided daily measures of temperature and humidity. In generalised linear Poisson regression models, the daily number of hospital admissions for hypertension was considered to be the dependent variable and the daily TSP concentration the independent variable. Results TSP presented a lagged effect on hypertension admissions, which was first observed 1 day after a TSP increase and remained almost unchanged for the following 2 days. A 10 mu g/m(3) increase in the TSP 3 day moving average lagged in 1 day led to an increase in hypertension-related hospital admissions during the harvest period (12.5%, 95% CI 5.6% to 19.9%) that was almost 30% higher than during non-harvest periods (9.0%, 95% CI 4.0% to 14.3%). Conclusions Increases in TSP concentrations were associated with hypertension-related hospital admissions. Despite the benefits of reduced air pollution in urban cities achieved by using ethanol produced from sugar cane to power automobiles, areas where the sugar cane is produced and harvested were found to have increased public health risk.
Resumo:
Objectives: Air-pollution exposure has been associated with increased cardiovascular hospital admissions and mortality in time-series studies. We evaluated the relation between air pollutants and emergency room (ER) visits because of cardiac arrhythmia in a cardiology hospital. Methods: In a time-series study, we evaluated the association between the emergency room visits as a result of cardiac arrhythmia and daily variations in SO2, CO, NO2, O-3 and PM10, from January 1998 to August 1999. The cases of arrhythmia were modelled using generalised linear Poisson regression models, controlling for seasonality (short-term and long-term trend), and weather. Results: Interquartile range increases in CO (1.5 ppm), NO2 (49,5 mu g/m(3)) and PM10 (22.2 mu g/m(3)) on the concurrent day were associated with increases of 12.3% (95% CI: 7.6% to 17.2%), 10.4% (95% CI: 5.2% to 15.9%) and 6.7% (95% CI: 1.2% to 12.4%) in arrhythmia ER visits, respectively. PM10, CO and NO2 effects were dose-dependent and gaseous pollutants had thresholds. Only CO effect resisted estimates in models with more than one pollutant. Conclusions: Our results showed that air pollutant effects on arrhythmia are predominantly acute starting at concentrations below air quality standards, and the association with CO and NO2 suggests a relevant role for pollution caused by cars.
Resumo:
Joint generalized linear models and double generalized linear models (DGLMs) were designed to model outcomes for which the variability can be explained using factors and/or covariates. When such factors operate, the usual normal regression models, which inherently exhibit constant variance, will under-represent variation in the data and hence may lead to erroneous inferences. For count and proportion data, such noise factors can generate a so-called overdispersion effect, and the use of binomial and Poisson models underestimates the variability and, consequently, incorrectly indicate significant effects. In this manuscript, we propose a DGLM from a Bayesian perspective, focusing on the case of proportion data, where the overdispersion can be modeled using a random effect that depends on some noise factors. The posterior joint density function was sampled using Monte Carlo Markov Chain algorithms, allowing inferences over the model parameters. An application to a data set on apple tissue culture is presented, for which it is shown that the Bayesian approach is quite feasible, even when limited prior information is available, thereby generating valuable insight for the researcher about its experimental results.
Resumo:
We present simple matrix formulae for corrected score statistics in symmetric nonlinear regression models. The corrected score statistics follow more closely a chi (2) distribution than the classical score statistic. Our simulation results indicate that the corrected score tests display smaller size distortions than the original score test. We also compare the sizes and the powers of the corrected score tests with bootstrap-based score tests.
Resumo:
In this article, we give an asymptotic formula of order n(-1/2), where n is the sample size, for the skewness of the distributions of the maximum likelihood estimates of the parameters in exponencial family nonlinear models. We generalize the result by Cordeiro and Cordeiro ( 2001). The formula is given in matrix notation and is very suitable for computer implementation and to obtain closed form expressions for a great variety of models. Some special cases and two applications are discussed.
Resumo:
The aim of this article is to discuss the estimation of the systematic risk in capital asset pricing models with heavy-tailed error distributions to explain the asset returns. Diagnostic methods for assessing departures from the model assumptions as well as the influence of observations on the parameter estimates are also presented. It may be shown that outlying observations are down weighted in the maximum likelihood equations of linear models with heavy-tailed error distributions, such as Student-t, power exponential, logistic II, so on. This robustness aspect may also be extended to influential observations. An application in which the systematic risk estimate of Microsoft is compared under normal and heavy-tailed errors is presented for illustration.
Resumo:
Likelihood ratio tests can be substantially size distorted in small- and moderate-sized samples. In this paper, we apply Skovgaard`s [Skovgaard, I.M., 2001. Likelihood asymptotics. Scandinavian journal of Statistics 28, 3-321] adjusted likelihood ratio statistic to exponential family nonlinear models. We show that the adjustment term has a simple compact form that can be easily implemented from standard statistical software. The adjusted statistic is approximately distributed as X(2) with high degree of accuracy. It is applicable in wide generality since it allows both the parameter of interest and the nuisance parameter to be vector-valued. Unlike the modified profile likelihood ratio statistic obtained from Cox and Reid [Cox, D.R., Reid, N., 1987. Parameter orthogonality and approximate conditional inference. journal of the Royal Statistical Society B49, 1-39], the adjusted statistic proposed here does not require an orthogonal parameterization. Numerical comparison of likelihood-based tests of varying dispersion favors the test we propose and a Bartlett-corrected version of the modified profile likelihood ratio test recently obtained by Cysneiros and Ferrari [Cysneiros, A.H.M.A., Ferrari, S.L.P., 2006. An improved likelihood ratio test for varying dispersion in exponential family nonlinear models. Statistics and Probability Letters 76 (3), 255-265]. (C) 2008 Elsevier B.V. All rights reserved.
Resumo:
Birnbaum-Saunders models have largely been applied in material fatigue studies and reliability analyses to relate the total time until failure with some type of cumulative damage. In many problems related to the medical field, such as chronic cardiac diseases and different types of cancer, a cumulative damage caused by several risk factors might cause some degradation that leads to a fatigue process. In these cases, BS models can be suitable for describing the propagation lifetime. However, since the cumulative damage is assumed to be normally distributed in the BS distribution, the parameter estimates from this model can be sensitive to outlying observations. In order to attenuate this influence, we present in this paper BS models, in which a Student-t distribution is assumed to explain the cumulative damage. In particular, we show that the maximum likelihood estimates of the Student-t log-BS models attribute smaller weights to outlying observations, which produce robust parameter estimates. Also, some inferential results are presented. In addition, based on local influence and deviance component and martingale-type residuals, a diagnostics analysis is derived. Finally, a motivating example from the medical field is analyzed using log-BS regression models. Since the parameter estimates appear to be very sensitive to outlying and influential observations, the Student-t log-BS regression model should attenuate such influences. The model checking methodologies developed in this paper are used to compare the fitted models.
Resumo:
The Birnbaum-Saunders regression model is commonly used in reliability studies. We derive a simple matrix formula for second-order covariances of maximum-likelihood estimators in this class of models. The formula is quite suitable for computer implementation, since it involves only simple operations on matrices and vectors. Some simulation results show that the second-order covariances can be quite pronounced in small to moderate sample sizes. We also present empirical applications.
Resumo:
We consider the issue of assessing influence of observations in the class of Birnbaum-Saunders nonlinear regression models, which is useful in lifetime data analysis. Our results generalize those in Galea et al. [8] which are confined to Birnbaum-Saunders linear regression models. Some influence methods, such as the local influence, total local influence of an individual and generalized leverage are discussed. Additionally, the normal curvatures for studying local influence are derived under some perturbation schemes. We also give an application to a real fatigue data set.