Biblioteca Digital

963 resultados para STATISTICAL MODELS

The generalized log-gamma mixture model with covariates: local influence and residual analysis

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In a sample of censored survival times, the presence of an immune proportion of individuals who are not subject to death, failure or relapse, may be indicated by a relatively high number of individuals with large censored survival times. In this paper the generalized log-gamma model is modified for the possibility that long-term survivors may be present in the data. The model attempts to separately estimate the effects of covariates on the surviving fraction, that is, the proportion of the population for which the event never occurs. The logistic function is used for the regression model of the surviving fraction. Inference for the model parameters is considered via maximum likelihood. Some influence methods, such as the local influence and total local influence of an individual are derived, analyzed and discussed. Finally, a data set from the medical area is analyzed under the log-gamma generalized mixture model. A residual analysis is performed in order to select an appropriate model.

Log-modified Weibull regression models with censored data: Sensitivity and residual analysis

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This paper proposes a regression model considering the modified Weibull distribution. This distribution can be used to model bathtub-shaped failure rate functions. Assuming censored data, we consider maximum likelihood and Jackknife estimators for the parameters of the model. We derive the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes and we also present some ways to perform global influence. Besides, for different parameter settings, sample sizes and censoring percentages, various simulations are performed and the empirical distribution of the modified deviance residual is displayed and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be straightforwardly extended for a martingale-type residual in log-modified Weibull regression models with censored data. Finally, we analyze a real data set under log-modified Weibull regression models. A diagnostic analysis and a model checking based on the modified deviance residual are performed to select appropriate models. (c) 2008 Elsevier B.V. All rights reserved.

On estimation and influence diagnostics for zero-inflated negative binomial regression models

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The zero-inflated negative binomial model is used to account for overdispersion detected in data that are initially analyzed under the zero-Inflated Poisson model A frequentist analysis a jackknife estimator and a non-parametric bootstrap for parameter estimation of zero-inflated negative binomial regression models are considered In addition an EM-type algorithm is developed for performing maximum likelihood estimation Then the appropriate matrices for assessing local influence on the parameter estimates under different perturbation schemes and some ways to perform global influence analysis are derived In order to study departures from the error assumption as well as the presence of outliers residual analysis based on the standardized Pearson residuals is discussed The relevance of the approach is illustrated with a real data set where It is shown that zero-inflated negative binomial regression models seems to fit the data better than the Poisson counterpart (C) 2010 Elsevier B V All rights reserved

Regression models for grouped survival data: Estimation and sensitivity analysis

Relevância:

20.00% 20.00%

Publicador:

Resumo:

In this study, regression models are evaluated for grouped survival data when the effect of censoring time is considered in the model and the regression structure is modeled through four link functions. The methodology for grouped survival data is based on life tables, and the times are grouped in k intervals so that ties are eliminated. Thus, the data modeling is performed by considering the discrete models of lifetime regression. The model parameters are estimated by using the maximum likelihood and jackknife methods. To detect influential observations in the proposed models, diagnostic measures based on case deletion, which are denominated global influence, and influence measures based on small perturbations in the data or in the model, referred to as local influence, are used. In addition to those measures, the local influence and the total influential estimate are also employed. Various simulation studies are performed and compared to the performance of the four link functions of the regression models for grouped survival data for different parameter settings, sample sizes and numbers of intervals. Finally, a data set is analyzed by using the proposed regression models. (C) 2010 Elsevier B.V. All rights reserved.

General results for the beta-modified Weibull distribution

Relevância:

20.00% 20.00%

Publicador:

Resumo:

We study in detail the so-called beta-modified Weibull distribution, motivated by the wide use of the Weibull distribution in practice, and also for the fact that the generalization provides a continuous crossover towards cases with different shapes. The new distribution is important since it contains as special sub-models some widely-known distributions, such as the generalized modified Weibull, beta Weibull, exponentiated Weibull, beta exponential, modified Weibull and Weibull distributions, among several others. It also provides more flexibility to analyse complex real data. Various mathematical properties of this distribution are derived, including its moments and moment generating function. We examine the asymptotic distributions of the extreme values. Explicit expressions are also derived for the chf, mean deviations, Bonferroni and Lorenz curves, reliability and entropies. The estimation of parameters is approached by two methods: moments and maximum likelihood. We compare by simulation the performances of the estimates from these methods. We obtain the expected information matrix. Two applications are presented to illustrate the proposed distribution.

The exponentiated generalized gamma distribution with application to lifetime data

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A four-parameter extension of the generalized gamma distribution capable of modelling a bathtub-shaped hazard rate function is defined and studied. The beauty and importance of this distribution lies in its ability to model monotone and non-monotone failure rate functions, which are quite common in lifetime data analysis and reliability. The new distribution has a number of well-known lifetime special sub-models, such as the exponentiated Weibull, exponentiated generalized half-normal, exponentiated gamma and generalized Rayleigh, among others. We derive two infinite sum representations for its moments. We calculate the density of the order statistics and two expansions for their moments. The method of maximum likelihood is used for estimating the model parameters and the observed information matrix is obtained. Finally, a real data set from the medical area is analysed.

Spatial portability of numerical models of leaf wetness duration based on empirical approaches

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Leaf wetness duration (LWD) models based on empirical approaches offer practical advantages over physically based models in agricultural applications, but their spatial portability is questionable because they may be biased to the climatic conditions under which they were developed. In our study, spatial portability of three LWD models with empirical characteristics - a RH threshold model, a decision tree model with wind speed correction, and a fuzzy logic model - was evaluated using weather data collected in Brazil, Canada, Costa Rica, Italy and the USA. The fuzzy logic model was more accurate than the other models in estimating LWD measured by painted leaf wetness sensors. The fraction of correct estimates for the fuzzy logic model was greater (0.87) than for the other models (0.85-0.86) across 28 sites where painted sensors were installed, and the degree of agreement k statistic between the model and painted sensors was greater for the fuzzy logic model (0.71) than that for the other models (0.64-0.66). Values of the k statistic for the fuzzy logic model were also less variable across sites than those of the other models. When model estimates were compared with measurements from unpainted leaf wetness sensors, the fuzzy logic model had less mean absolute error (2.5 h day(-1)) than other models (2.6-2.7 h day(-1)) after the model was calibrated for the unpainted sensors. The results suggest that the fuzzy logic model has greater spatial portability than the other models evaluated and merits further validation in comparison with physical models under a wider range of climate conditions. (C) 2010 Elsevier B.V. All rights reserved.

Alternative Analytical Expressions for the General van Genuchten-Mualem and van Genuchten-Burdine Hydraulic Conductivity Models

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The van Genuchten expressions for the unsaturated soil hydraulic properties, first published in 1980, are used frequently in various vadose zone flow and transport applications assuming a specific relationship between the m and n soil hydraulic parameters. By comparison, probably because of the complexity of the hydraulic conductivity equations, the more general solutions with independent m and n values are rarely used. We expressed the general van Genuchten-Mualem and van Genuchten-Burdine hydraulic conductivity equations in terms of hypergeometric functions, which can be approximated by infinite series that converge rapidly for relatively large values of the van Genuchten-Mualem parameter n but only very slowly when n is close to one. Alternative equations were derived that provide very close approximations of the analytical results. The newly proposed equations allow the use of independent values of the parameters m and n in the soil water retention model of van Genuchten for subsequent prediction of the van Genuchten-Mualem and van Genuchten-Burdine hydraulic conductivity models, thus providing more flexibility in fitting experimental pressure-head-dependent water content, theta(h), and hydraulic conductivity, K(h), or K(theta) data.

Forecasting fuel ethanol consumption in Brazil by time series models: 2006-2012

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article analysed scenarios for Brazilian consumption of ethanol for the period 2006 to 2012. The results show that if the country`s GDP sustains a 4.6% a year growth, domestic consumption of fuel ethanol could increase to 25.16 billion liters in this period, which is a volume relatively close to the forecasted gasoline consumption of 31 billion liters. At a lower GDP growth of 1.22% a year, gasoline consumption would be reduced and domestic ethanol consumption in Brazil would be no higher than 18.32 billion liters. Contrary to the current situation, forecasts indicated that hydrated ethanol consumption could become much higher than anhydrous consumption in Brazil. The former is being consumed in cars moved exclusively by ethanol and flex-fuel cars, successfully introduced in the country at 2003. Flex cars allow Brazilian consumers to choose between gasoline and hydrated ethanol and immediately switch to whichever fuel presents the most favourable relative price.

Spatio-temporal modeling of agricultural yield data with an application to pricing crop insurance contracts

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article presents a statistical model of agricultural yield data based on a set of hierarchical Bayesian models that allows joint modeling of temporal and spatial autocorrelation. This method captures a comprehensive range of the various uncertainties involved in predicting crop insurance premium rates as opposed to the more traditional ad hoc, two-stage methods that are typically based on independent estimation and prediction. A panel data set of county-average yield data was analyzed for 290 counties in the State of Parana (Brazil) for the period of 1990 through 2002. Posterior predictive criteria are used to evaluate different model specifications. This article provides substantial improvements in the statistical and actuarial methods often applied to the calculation of insurance premium rates. These improvements are especially relevant to situations where data are limited.

Parametric and nonparametric statistical modelling of crop yield: implications for pricing crop insurance contracts

Relevância:

20.00% 20.00%

Publicador:

Resumo:

This article considers alternative methods to calculate the fair premium rate of crop insurance contracts based on county yields. The premium rate was calculated using parametric and nonparametric approaches to estimate the conditional agricultural yield density. These methods were applied to a data set of county yield provided by the Statistical and Geography Brazilian Institute (IBGE), for the period of 1990 through 2002, for soybean, corn and wheat, in the State of Paran. In this article, we propose methodological alternatives to pricing crop insurance contracts resulting in more accurate premium rates in a situation of limited data.

Does using stepwise variable selection to build sequential path analysis models make sense?

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Causal inference methods - mainly path analysis and structural equation modeling - offer plant physiologists information about cause-and-effect relationships among plant traits. Recently, an unusual approach to causal inference through stepwise variable selection has been proposed and used in various works on plant physiology. The approach should not be considered correct from a biological point of view. Here, it is explained why stepwise variable selection should not be used for causal inference, and shown what strange conclusions can be drawn based upon the former analysis when one aims to interpret cause-and-effect relationships among plant traits.

Parametric correlation functions to model the structure of permanent environmental (co)variances in milk yield random regression models

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The objective of the present study was to estimate milk yield genetic parameters applying random regression models and parametric correlation functions combined with a variance function to model animal permanent environmental effects. A total of 152,145 test-day milk yields from 7,317 first lactations of Holstein cows belonging to herds located in the southeastern region of Brazil were analyzed. Test-day milk yields were divided into 44 weekly classes of days in milk. Contemporary groups were defined by herd-test-day comprising a total of 2,539 classes. The model included direct additive genetic, permanent environmental, and residual random effects. The following fixed effects were considered: contemporary group, age of cow at calving (linear and quadratic regressions), and the population average lactation curve modeled by fourth-order orthogonal Legendre polynomial. Additive genetic effects were modeled by random regression on orthogonal Legendre polynomials of days in milk, whereas permanent environmental effects were estimated using a stationary or nonstationary parametric correlation function combined with a variance function of different orders. The structure of residual variances was modeled using a step function containing 6 variance classes. The genetic parameter estimates obtained with the model using a stationary correlation function associated with a variance function to model permanent environmental effects were similar to those obtained with models employing orthogonal Legendre polynomials for the same effect. A model using a sixth-order polynomial for additive effects and a stationary parametric correlation function associated with a seventh-order variance function to model permanent environmental effects would be sufficient for data fitting.

Random regression models to estimate test-day milk yield genetic parameters Holstein cows in Southeastern Brazil

Relevância:

20.00% 20.00%

Publicador:

Resumo:

A total of 152,145 weekly test-day milk yield records from 7317 first lactations of Holstein cows distributed in 93 herds in southeastern Brazil were analyzed. Test-day milk yields were classified into 44 weekly classes of DIM. The contemporary groups were defined as herd-year-week of test-day. The model included direct additive genetic, permanent environmental and residual effects as random and fixed effects of contemporary group and age of cow at calving as covariable, linear and quadratic effects. Mean trends were modeled by a cubic regression on orthogonal polynomials of DIM. Additive genetic and permanent environmental random effects were estimated by random regression on orthogonal Legendre polynomials. Residual variances were modeled using third to seventh-order variance functions or a step function with 1, 6,13,17 and 44 variance classes. Results from Akaike`s and Schwarz`s Bayesian information criterion suggested that a model considering a 7th-order Legendre polynomial for additive effect, a 12th-order polynomial for permanent environment effect and a step function with 6 classes for residual variances, fitted best. However, a parsimonious model, with a 6th-order Legendre polynomial for additive effects and a 7th-order polynomial for permanent environmental effects, yielded very similar genetic parameter estimates. (C) 2008 Elsevier B.V. All rights reserved.

A split-pot experiment with sorghum to test a root water uptake partitioning model

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Correct modeling of root water uptake partitioning over depth is an important issue in hydrological and crop growth models. Recently a physically based model to describe root water uptake was developed at single root scale and upscaled to the root system scale considering a homogeneous distribution of roots per soil layer. Root water uptake partitioning is calculated over soil layers or compartments as a function of respective soil hydraulic conditions, specifically the soil matric flux potential, root characteristics and a root system efficiency factor to compensate for within-layer root system heterogeneities. The performance of this model was tested in an experiment performed in two-compartment split-pot lysimeters with sorghum plants. The compartments were submitted to different irrigation cycles resulting in contrasting water contents over time. The root system efficiency factor was determined to be about 0.05. Release of water from roots to soil was predicted and observed on several occasions during the experiment; however, model predictions suggested root water release to occur more often and at a higher rate than observed. This may be due to not considering internal root system resistances, thus overestimating the ease with which roots can act as conductors of water. Excluding these erroneous predictions from the dataset, statistical indices show model performance to be of good quality.

«
1
2
...
57
58
59
60
61
62
63
64
65
»