948 resultados para generalized additive model
Resumo:
Aim To assess the geographical transferability of niche-based species distribution models fitted with two modelling techniques. Location Two distinct geographical study areas in Switzerland and Austria, in the subalpine and alpine belts. Methods Generalized linear and generalized additive models (GLM and GAM) with a binomial probability distribution and a logit link were fitted for 54 plant species, based on topoclimatic predictor variables. These models were then evaluated quantitatively and used for spatially explicit predictions within (internal evaluation and prediction) and between (external evaluation and prediction) the two regions. Comparisons of evaluations and spatial predictions between regions and models were conducted in order to test if species and methods meet the criteria of full transferability. By full transferability, we mean that: (1) the internal evaluation of models fitted in region A and B must be similar; (2) a model fitted in region A must at least retain a comparable external evaluation when projected into region B, and vice-versa; and (3) internal and external spatial predictions have to match within both regions. Results The measures of model fit are, on average, 24% higher for GAMs than for GLMs in both regions. However, the differences between internal and external evaluations (AUC coefficient) are also higher for GAMs than for GLMs (a difference of 30% for models fitted in Switzerland and 54% for models fitted in Austria). Transferability, as measured with the AUC evaluation, fails for 68% of the species in Switzerland and 55% in Austria for GLMs (respectively for 67% and 53% of the species for GAMs). For both GAMs and GLMs, the agreement between internal and external predictions is rather weak on average (Kulczynski's coefficient in the range 0.3-0.4), but varies widely among individual species. The dominant pattern is an asymmetrical transferability between the two study regions (a mean decrease of 20% for the AUC coefficient when the models are transferred from Switzerland and 13% when they are transferred from Austria). Main conclusions The large inter-specific variability observed among the 54 study species underlines the need to consider more than a few species to test properly the transferability of species distribution models. The pronounced asymmetry in transferability between the two study regions may be due to peculiarities of these regions, such as differences in the ranges of environmental predictors or the varied impact of land-use history, or to species-specific reasons like differential phenotypic plasticity, existence of ecotypes or varied dependence on biotic interactions that are not properly incorporated into niche-based models. The lower variation between internal and external evaluation of GLMs compared to GAMs further suggests that overfitting may reduce transferability. Overall, a limited geographical transferability calls for caution when projecting niche-based models for assessing the fate of species in future environments.
Resumo:
Air pollution has been associated with health effects on different age groups. The present study was designed to assess the impact of daily changes in air pollutants (NO2, SO2, CO, O3, and particle matter (PM10)) on total number of daily neonatal deaths (those that occur between the first and the 28th days of life) in São Paulo, from January 1998 to December 2000, since adverse outcomes such as neonatal deaths associated with air pollution in Brazil have not been evaluated before. Generalized additive Poisson regression models were used and nonparametric smooth functions (loess) were adopted to control long-term trend, temperature, humidity, and short-term trends. A linear term was used for holidays. The association between air pollutants and neonatal deaths showed a short time lag. Interquartile range increases in PM10 (23.3 µg/m³) and SO2 (9.2 µg/m³) were associated with increases of 4% (95% CI, 2-6) and 6% (95% CI, 4-8), respectively. Instead of adopting a two-pollutant model we created an index to represent PM10 and SO2 effects. For an interquartile range increase in the index an increase of 6.3% (95% CI, 6.1-6.5) in neonatal deaths was observed. These results agree with previous studies performed by our group showing the deleterious effects of air pollutants during the perinatal period. The method reported here represents an alternative approach to analyze the relationship between highly correlated pollutants and public health problems, reinforcing the idea of the synergic effects of air pollutants in public health.
Resumo:
Exposure to nitrogen oxides (NOx) emitted by burning fossil fuels has been associated with respiratory diseases. We aimed to estimate the effects of NOx exposure on mortality owing to respiratory diseases in residents of Taubaté, São Paulo, Brazil, of all ages and both sexes. This time-series ecological study from August 1, 2011 to July 31, 2012 used information on deaths caused by respiratory diseases obtained from the Health Department of Taubaté. Estimated daily levels of pollutants (NOx, particulate matter, ozone, carbon monoxide) were obtained from the Centro de Previsão de Tempo e Estudos Climáticos Coupled Aerosol and Tracer Transport model to the Brazilian developments on the Regional Atmospheric Modeling System. These environmental variables were used to adjust the multipollutant model for apparent temperature. To estimate association between hospitalizations owing to asthma and air pollutants, generalized additive Poisson regression models were developed, with lags as much as 5 days. There were 385 deaths with a daily mean (±SD) of 1.05±1.03 (range: 0-5). Exposure to NOx was significantly associated with mortality owing to respiratory diseases: relative risk (RR)=1.035 (95% confidence interval [CI]: 1.008-1.063) for lag 2, RR=1.064 (95%CI: 1.017-1.112) lag 3, RR=1.055 (95%CI: 1.025-1.085) lag 4, and RR=1.042 (95%CI: 1.010-1.076) lag 5. A 3 µg/m3 reduction in NOx concentration resulted in a decrease of 10-18 percentage points in risk of death caused by respiratory diseases. Even at NOx concentrations below the acceptable standard, there is association with deaths caused by respiratory diseases.
Resumo:
Thesis written in co-mentorship with Robert Michaud.
Resumo:
Nonlinear system identification is considered using a generalized kernel regression model. Unlike the standard kernel model, which employs a fixed common variance for all the kernel regressors, each kernel regressor in the generalized kernel model has an individually tuned diagonal covariance matrix that is determined by maximizing the correlation between the training data and the regressor using a repeated guided random search based on boosting optimization. An efficient construction algorithm based on orthogonal forward regression with leave-one-out (LOO) test statistic and local regularization (LR) is then used to select a parsimonious generalized kernel regression model from the resulting full regression matrix. The proposed modeling algorithm is fully automatic and the user is not required to specify any criterion to terminate the construction procedure. Experimental results involving two real data sets demonstrate the effectiveness of the proposed nonlinear system identification approach.
Resumo:
In many data sets from clinical studies there are patients insusceptible to the occurrence of the event of interest. Survival models which ignore this fact are generally inadequate. The main goal of this paper is to describe an application of the generalized additive models for location, scale, and shape (GAMLSS) framework to the fitting of long-term survival models. in this work the number of competing causes of the event of interest follows the negative binomial distribution. In this way, some well known models found in the literature are characterized as particular cases of our proposal. The model is conveniently parameterized in terms of the cured fraction, which is then linked to covariates. We explore the use of the gamlss package in R as a powerful tool for inference in long-term survival models. The procedure is illustrated with a numerical example. (C) 2009 Elsevier Ireland Ltd. All rights reserved.
Resumo:
The estimation of data transformation is very useful to yield response variables satisfying closely a normal linear model, Generalized linear models enable the fitting of models to a wide range of data types. These models are based on exponential dispersion models. We propose a new class of transformed generalized linear models to extend the Box and Cox models and the generalized linear models. We use the generalized linear model framework to fit these models and discuss maximum likelihood estimation and inference. We give a simple formula to estimate the parameter that index the transformation of the response variable for a subclass of models. We also give a simple formula to estimate the rth moment of the original dependent variable. We explore the possibility of using these models to time series data to extend the generalized autoregressive moving average models discussed by Benjamin er al. [Generalized autoregressive moving average models. J. Amer. Statist. Assoc. 98, 214-223]. The usefulness of these models is illustrated in a Simulation study and in applications to three real data sets. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
Objective. To investigate the short-term effects of exposure to particulate matter from biomass burning in the Amazon on the daily demand for outpatient care due to respiratory diseases in children and the elderly. Methods. Epidemiologic study with ecologic time series design. Daily consultation records were obtained from the 14 primary health care clinics in the municipality of Alta Floresta, state of Mato Grosso, in the southern region of the Brazilian Amazon, between January 2004 and December 2005. Information on the daily levels of fine particulate matter was made available by the Brazilian National Institute for Spatial Research. To control for confounding factors ( situations in which a non-causal association between exposure and disease is observed due to a third variable), variables related to time trends, seasonality, temperature, relative humidity, rainfall, and calendar effects ( such as occurrence of holidays and weekends) were included in the model. Poisson regression with generalized additive models was used. Results. A 10 mu g/m(3) increase in the level of exposure to particulate matter was associated with increases of 2.9% and 2.6% in outpatient consultations due to respiratory diseases in children on the 6th and 7th days following exposure. Significant associations were not observed for elderly individuals. Conclusions. The results suggest that the levels of particulate matter from biomass burning in the Amazon are associated with adverse effects on the respiratory health of children.
Resumo:
We consider consider the problem of dichotomizing a continuous covariate when performing a regression analysis based on a generalized estimation approach. The problem involves estimation of the cutpoint for the covariate and testing the hypothesis that the binary covariate constructed from the continuous covariate has a significant impact on the outcome. Due to the multiple testing used to find the optimal cutpoint, we need to make an adjustment to the usual significance test to preserve the type-I error rates. We illustrate the techniques on one data set of patients given unrelated hematopoietic stem cell transplantation. Here the question is whether the CD34 cell dose given to patient affects the outcome of the transplant and what is the smallest cell dose which is needed for good outcomes. (C) 2010 Elsevier BM. All rights reserved.
Resumo:
Detecting both the majors genes that control the phenotypic mean and those controlling phenotypic variance has been raised in quantitative trait loci analysis. In order to mapping both kinds of genes, we applied the idea of the classic Haley-Knott regression to double generalized linear models. We performed both kinds of quantitative trait loci detection for a Red Jungle Fowl x White Leghorn F2 intercross using double generalized linear models. It is shown that double generalized linear model is a proper and efficient approach for localizing variance-controlling genes. We compared two models with or without fixed sex effect and prefer including the sex effect in order to reduce the residual variances. We found that different genes might take effect on the body weight at different time as the chicken grows.
Resumo:
Background: Genetic variation for environmental sensitivity indicates that animals are genetically different in their response to environmental factors. Environmental factors are either identifiable (e.g. temperature) and called macro-environmental or unknown and called micro-environmental. The objectives of this study were to develop a statistical method to estimate genetic parameters for macro- and micro-environmental sensitivities simultaneously, to investigate bias and precision of resulting estimates of genetic parameters and to develop and evaluate use of Akaike’s information criterion using h-likelihood to select the best fitting model. Methods: We assumed that genetic variation in macro- and micro-environmental sensitivities is expressed as genetic variance in the slope of a linear reaction norm and environmental variance, respectively. A reaction norm model to estimate genetic variance for macro-environmental sensitivity was combined with a structural model for residual variance to estimate genetic variance for micro-environmental sensitivity using a double hierarchical generalized linear model in ASReml. Akaike’s information criterion was constructed as model selection criterion using approximated h-likelihood. Populations of sires with large half-sib offspring groups were simulated to investigate bias and precision of estimated genetic parameters. Results: Designs with 100 sires, each with at least 100 offspring, are required to have standard deviations of estimated variances lower than 50% of the true value. When the number of offspring increased, standard deviations of estimates across replicates decreased substantially, especially for genetic variances of macro- and micro-environmental sensitivities. Standard deviations of estimated genetic correlations across replicates were quite large (between 0.1 and 0.4), especially when sires had few offspring. Practically, no bias was observed for estimates of any of the parameters. Using Akaike’s information criterion the true genetic model was selected as the best statistical model in at least 90% of 100 replicates when the number of offspring per sire was 100. Application of the model to lactation milk yield in dairy cattle showed that genetic variance for micro- and macro-environmental sensitivities existed. Conclusion: The algorithm and model selection criterion presented here can contribute to better understand genetic control of macro- and micro-environmental sensitivities. Designs or datasets should have at least 100 sires each with 100 offspring.
Resumo:
Background: The sensitivity to microenvironmental changes varies among animals and may be under genetic control. It is essential to take this element into account when aiming at breeding robust farm animals. Here, linear mixed models with genetic effects in the residual variance part of the model can be used. Such models have previously been fitted using EM and MCMC algorithms. Results: We propose the use of double hierarchical generalized linear models (DHGLM), where the squared residuals are assumed to be gamma distributed and the residual variance is fitted using a generalized linear model. The algorithm iterates between two sets of mixed model equations, one on the level of observations and one on the level of variances. The method was validated using simulations and also by re-analyzing a data set on pig litter size that was previously analyzed using a Bayesian approach. The pig litter size data contained 10,060 records from 4,149 sows. The DHGLM was implemented using the ASReml software and the algorithm converged within three minutes on a Linux server. The estimates were similar to those previously obtained using Bayesian methodology, especially the variance components in the residual variance part of the model. Conclusions: We have shown that variance components in the residual variance part of a linear mixed model can be estimated using a DHGLM approach. The method enables analyses of animal models with large numbers of observations. An important future development of the DHGLM methodology is to include the genetic correlation between the random effects in the mean and residual variance parts of the model as a parameter of the DHGLM.
Resumo:
This paper presents the techniques of likelihood prediction for the generalized linear mixed models. Methods of likelihood prediction is explained through a series of examples; from a classical one to more complicated ones. The examples show, in simple cases, that the likelihood prediction (LP) coincides with already known best frequentist practice such as the best linear unbiased predictor. The paper outlines a way to deal with the covariate uncertainty while producing predictive inference. Using a Poisson error-in-variable generalized linear model, it has been shown that in complicated cases LP produces better results than already know methods.
Resumo:
Using the operator formalism, we obtain the bosonic representation for the free fermion field satisfying an equation of motion with higher-order derivatives. Then, we consider the operator solution of a generalized Schwinger model with higher-derivative coupling. Since the increasing of the derivative order implies the introduction of an equivalent number of extra fermionic degrees of freedom, the mass acquired by the gauge field is bigger than the one for the standard two-dimensional QED. An analysis of the problem from the functional integration point of view corroborates the findings of canonical quantization, and corrects certain results previously announced in the literature on the basis of Fujikawa's technique.
Resumo:
We calculate the Green functions of the two versions of the generalized Schwinger model, the anomalous and the nonanomalous one, in their higher order Lagrangian density form. Furthermore, it is shown through a sequence of transformations that the bosonized Lagrangian density is equivalent to the former, at least for the bosonic correlation functions. The introduction of the sources from the beginning, leading to a gauge-invariant source term, is also considered. It is verified that the two models have the same correlation functions only if the gauge-invariant sector is taken into account. Finally, there is presented a generalization of the Wess-Zumino term, and its physical consequences are studied, in particular the appearance of gauge-dependent massive excitations.