91 resultados para Modelos log-linear
em Biblioteca Digital da Produção Intelectual da Universidade de São Paulo (BDPI/USP)
Resumo:
We introduce the log-beta Weibull regression model based on the beta Weibull distribution (Famoye et al., 2005; Lee et al., 2007). We derive expansions for the moment generating function which do not depend on complicated functions. The new regression model represents a parametric family of models that includes as sub-models several widely known regression models that can be applied to censored survival data. We employ a frequentist analysis, a jackknife estimator, and a parametric bootstrap for the parameters of the proposed model. We derive the appropriate matrices for assessing local influences on the parameter estimates under different perturbation schemes and present some ways to assess global influences. Further, for different parameter settings, sample sizes, and censoring percentages, several simulations are performed. In addition, the empirical distribution of some modified residuals are displayed and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be extended to a modified deviance residual in the proposed regression model applied to censored data. We define martingale and deviance residuals to evaluate the model assumptions. The extended regression model is very useful for the analysis of real data and could give more realistic fits than other special regression models.
Resumo:
OBJETIVO: Analisar a tendência da mortalidade por diarreia entre menores de 5 anos, no município de Osasco (SP), entre 1980 e 2000. MÉTODOS: Trata-se de estudo observacional com dois delineamentos. Um descritivo, que toma o indivíduo como unidade do estudo, e outro ecológico, analisando agregado populacional que incluiu análise de séries temporais. A fonte de dados foi o sistema de informação de mortalidade do Estado de São Paulo e censos de 1980, 1991 e 2000. Descreveu-se a variação sazonal e para a análise de tendência aplicaram-se modelos log lineares de regressão polinomiais, utilizando-se variáveis sociodemográficas da criança e da mãe. Foram analisadas a evolução de indicadores sociodemográficos do município de 1980 a 2000, as taxas médias de mortalidade por diarreia nos menores de 5 anos e seus diferenciais por distrito nos anos 90. RESULTADOS: Dos 1.360 óbitos, 94,3 e 75,3% atingiram, respectivamente, menores de 1 ano e de 6 meses. O declínio da mortalidade foi de 98,3%, com deslocamento da sazonalidade do verão para o outono. A mediana da idade elevou-se de 2 meses nos primeiros períodos para 3 meses no último. O resíduo de óbitos manteve-se entre filhos de mães de 20 a 29 anos e escolaridade < 8 anos. O risco relativo entre o distrito mais atingido e a taxa média do município diminuiu de 3,4 para 1,3 do primeiro para o segundo quinquênio dos anos 90. CONCLUSÃO: Nossos resultados apontam uma elevação da idade mais vulnerável e a provável mudança do agente mais frequentemente associado ao óbito por diarreia.
Resumo:
Objetivo: Analisar a tendência da mortalidade por diarreia entre menores de 5 anos, no município de Osasco (SP), entre 1980 e 2000. Métodos: Trata-se de estudo observacional com dois delineamentos. Um descritivo, que toma o indivíduo como unidade do estudo, e outro ecológico, analisando agregado populacional que incluiu análise de séries temporais. A fonte de dados foi o sistema de informação de mortalidade do Estado de São Paulo e censos de 1980, 1991 e 2000. Descreveu-se a variação sazonal e para a análise de tendência aplicaram-se modelos log lineares de regressão polinomiais, utilizando-se variáveis sociodemográficas da criança e da mãe. Foram analisadas a evolução de indicadores sociodemográficos do município de 1980 a 2000, as taxas médias de mortalidade por diarreia nos menores de 5 anos e seus diferenciais por distrito nos anos 90. Resultados: Dos 1.360 óbitos, 94,3 e 75,3% atingiram, respectivamente, menores de 1 ano e de 6 meses. O declínio da mortalidade foi de 98,3%, com deslocamento da sazonalidade do verão para o outono. A mediana da idade elevou-se de 2 meses nos primeiros períodos para 3 meses no último. O resíduo de óbitos manteve-se entre filhos de mães de 20 a 29 anos e escolaridade < 8 anos. O risco relativo entre o distrito mais atingido e a taxa média do município diminuiu de 3,4 para 1,3 do primeiro para o segundo quinquênio dos anos 90. Conclusão: Nossos resultados apontam uma elevação da idade mais vulnerável e a provável mudança do agente mais frequentemente associado ao óbito por diarreia
Resumo:
Capybaras were monitored weekly from 1998 to 2006 by counting individuals in three anthropogenic environments (mixed agricultural fields, forest and open areas) of southeastern Brazil in order to examine the possible influence of environmental variables (temperature, humidity, wind speed, precipitation and global radiation) on the detectability of this species. There was consistent seasonality in the number of capybaras in the study area, with a specific seasonal pattern in each area. Log-linear models were fitted to the sample counts of adult capybaras separately for each sampled area, with an allowance for monthly effects, time trends and the effects of environmental variables. Log-linear models containing effects for the months of the year and a quartic time trend were highly significant. The effects of environmental variables on sample counts were different in each type of environment. As environmental variables affect capybara detectability, they should be considered in future species survey/monitoring programs.
Resumo:
SETTING: Chronic obstructive pulmonary disease (COPD) is the third leading cause of death among adults in Brazil. OBJECTIVE: To evaluate the mortality and hospitalisation trends in Brazil caused by COPD during the period 1996-2008. DESIGN: We used the health official statistics system to obtain data about mortality (1996-2008) and morbidity (1998-2008) due to COPD and all respiratory diseases (tuberculosis: codes A15-16; lung cancer: code C34, and all diseases coded from J40 to 47 in the 10th Revision of the International Classification of Diseases) as the underlying cause, in persons aged 45-74 years. We used the Joinpoint Regression Program log-linear model using Poisson regression that creates a Monte Carlo permutation test to identify points where trend lines change significantly in magnitude/direction to verify peaks and trends. RESULTS: The annual per cent change in age-adjusted death rates due to COPD declined by 2.7% in men (95%CI -3.6 to -1.8) and -2.0% (95%CI -2.9 to -1.0) in women; and due to all respiratory causes it declined by -1.7% (95%CI 2.4 to -1.0) in men and -1.1% (95%CI -1.8 to -0.3) in women. Although hospitalisation rates for COPD are declining, the hospital admission fatality rate increased in both sexes. CONCLUSION: COPD is still a leading cause of mortality in Brazil despite the observed decline in the mortality/hospitalisation rates for both sexes.
Resumo:
Aims: The heterogeneity of the Brazilian population renders the extrapolation of pharmacogenomic data derived from well-defined ethnic groups inappropriate. We investigated the influence of self-reported `race/color`, geographical origin and genetic ancestry on the distribution of four VKORC1 SNPs and haplotypes in Brazilians. Comparative data were obtained from two major ancestral roots of Brazilians: Portuguese and Africans from former Portuguese colonies. Materials & methods: A total of 1037 healthy adults Brazilians, recruited at four different geographical regions and self identified as white, brown or black (race/color categories), 89 Portuguese and 216 Africans from Angola and Mozambique were genotyped for the VKORC1 3673G>A (rs9923231), 5808T>G (rs2884737), 6853G>C (rs8050894) and 9041G>A (rs7294) polymorphisms using TaqMan (R) (Applied Biosystems, CA, USA) assays. VKORC1 haplotypes were statistically inferred using the haplo.stats software. We inferred the statistical association between the distribution of the VKORC1 polymorphisms among Brazilians and self-reported color, geographical region and genetic ancestry by fitting multinomial log linear models via neural networks. Individual proportions of European and African ancestry were used to assess the impact of genetic admixture on the frequency distribution of VKORC1 polymorphisms among Brazilians, and for the comparison of Brazilians with Portuguese and Africans. Results: The frequency distribution of the 3673G>A and 5808T>G polymorphisms, and VKORC1 haplotypes among Brazilians varies across geographical regions, within self-reported color categories and according to the individual proportions of European and African genetic ancestry. Notably, the frequency of the warfarin sensitive VKORC1 3673A allele and the distribution of VKORC1 haplotypes varied continuously as the individual proportion of European ancestry increased in the entire cohort, independently of race/color categorization and geographical origin. Brazilians with more than 80% African ancestry differ significantly from Angolans and Mozambicans in frequency of the 3673G>A, 5808T>G and 6853G>C polymorphisms and haplotype distribution, whereas no such differences are observed between Brazilians with more than 90% European ancestry and Portuguese individuals. Conclusion: The diversity of the Brazilian population, evident in the distribution of VKORC1 polymorphisms, must be taken into account in the design of pharmacogenetic clinical trials and dealt with as a continuous variable. Warfarin dosing algorithms that include `race` terms defined for other populations are clearly not applicable to the heterogeneous and extensively admixed Brazilian population.
Resumo:
When missing data occur in studies designed to compare the accuracy of diagnostic tests, a common, though naive, practice is to base the comparison of sensitivity, specificity, as well as of positive and negative predictive values on some subset of the data that fits into methods implemented in standard statistical packages. Such methods are usually valid only under the strong missing completely at random (MCAR) assumption and may generate biased and less precise estimates. We review some models that use the dependence structure of the completely observed cases to incorporate the information of the partially categorized observations into the analysis and show how they may be fitted via a two-stage hybrid process involving maximum likelihood in the first stage and weighted least squares in the second. We indicate how computational subroutines written in R may be used to fit the proposed models and illustrate the different analysis strategies with observational data collected to compare the accuracy of three distinct non-invasive diagnostic methods for endometriosis. The results indicate that even when the MCAR assumption is plausible, the naive partial analyses should be avoided.
Resumo:
We review some issues related to the implications of different missing data mechanisms on statistical inference for contingency tables and consider simulation studies to compare the results obtained under such models to those where the units with missing data are disregarded. We confirm that although, in general, analyses under the correct missing at random and missing completely at random models are more efficient even for small sample sizes, there are exceptions where they may not improve the results obtained by ignoring the partially classified data. We show that under the missing not at random (MNAR) model, estimates on the boundary of the parameter space as well as lack of identifiability of the parameters of saturated models may be associated with undesirable asymptotic properties of maximum likelihood estimators and likelihood ratio tests; even in standard cases the bias of the estimators may be low only for very large samples. We also show that the probability of a boundary solution obtained under the correct MNAR model may be large even for large samples and that, consequently, we may not always conclude that a MNAR model is misspecified because the estimate is on the boundary of the parameter space.
Resumo:
Birnbaum-Saunders models have largely been applied in material fatigue studies and reliability analyses to relate the total time until failure with some type of cumulative damage. In many problems related to the medical field, such as chronic cardiac diseases and different types of cancer, a cumulative damage caused by several risk factors might cause some degradation that leads to a fatigue process. In these cases, BS models can be suitable for describing the propagation lifetime. However, since the cumulative damage is assumed to be normally distributed in the BS distribution, the parameter estimates from this model can be sensitive to outlying observations. In order to attenuate this influence, we present in this paper BS models, in which a Student-t distribution is assumed to explain the cumulative damage. In particular, we show that the maximum likelihood estimates of the Student-t log-BS models attribute smaller weights to outlying observations, which produce robust parameter estimates. Also, some inferential results are presented. In addition, based on local influence and deviance component and martingale-type residuals, a diagnostics analysis is derived. Finally, a motivating example from the medical field is analyzed using log-BS regression models. Since the parameter estimates appear to be very sensitive to outlying and influential observations, the Student-t log-BS regression model should attenuate such influences. The model checking methodologies developed in this paper are used to compare the fitted models.
Resumo:
We introduce, for the first time, a new class of Birnbaum-Saunders nonlinear regression models potentially useful in lifetime data analysis. The class generalizes the regression model described by Rieck and Nedelman [Rieck, J.R., Nedelman, J.R., 1991. A log-linear model for the Birnbaum-Saunders distribution. Technometrics 33, 51-60]. We discuss maximum-likelihood estimation for the parameters of the model, and derive closed-form expressions for the second-order biases of these estimates. Our formulae are easily computed as ordinary linear regressions and are then used to define bias corrected maximum-likelihood estimates. Some simulation results show that the bias correction scheme yields nearly unbiased estimates without increasing the mean squared errors. Two empirical applications are analysed and discussed. Crown Copyright (C) 2009 Published by Elsevier B.V. All rights reserved.
Resumo:
Este estudo teve como objetivo desenvolver modelos preditores de fitomassa epigéa da vegetação arbórea da Floresta Baixa de Restinga. Foram selecionadas 102 árvores de 29 espécies ocorrentes na área de estudo e 102 indivíduos de jerivá (Syagrus romanzoffiana (Cham.) Glassman), distribuídos proporcionalmente entre as classes de diâmetro da vegetação arbórea. As árvores foram cortadas, ao nível do solo e foram medidos a altura total e o diâmetro à altura do peito (DAP) de cada árvore. As folhas foram separadas do lenho e a massa fresca da porção lenhosa e foliar medidas separadamente. Amostras de cada fração foram secas a 70 °C, até peso constante, para determinação da massa seca das árvores. Os modelos foram desenvolvidos através de análise de regressão linear, sendo a variável dependente a massa seca (MS) das árvores e as variáveis independentes a altura (h), o diâmetro a altura do peito (d) e as relações d² h e d² h multiplicada pela densidade da madeira (ρ d² h). Os modelos desenvolvidos indicam que o diâmetro explica grande parte da variabilidade da fitomassa das árvores da restinga e a altura é a variável explanatória da equação específica para o jerivá. Os modelos selecionados foram: ln MS (kg) = -1,352 + 2,009 ln d (R² = 0,96; s yx = 0,34) para a comunidade vegetal sem jerivá, ln MS (kg) = -2,052 + 0,801 ln d² h (R² = 0,94; s yx = 0,38) para a comunidade incluindo o jerivá, e ln MS (kg) = -0,884 + 2,40 ln h (R² = 0,92; s yx = 0,49) para o jerivá.
Resumo:
Com objetivo de estimar parâmetros genéticos e estudar a utilização de diferentes efeitos em avaliações genéticas para idade ao primeiro parto (IPP) por diferentes modelos, foram utilizados registros de IPP de animais da raça Nelore, nascidos entre os anos de 1990 e 2005. Foram considerados os seguintes modelos (M): M1, incluindo o efeito fixo de GC1 (constituído pelos animais nascidos na mesma fazenda e ano), além da covariável, peso aos 365 dias de idade (efeito linear e quadrático), totalizando 24.263 registros de IPP; M2, considerando os efeitos fixos de GC1, ano e estação de parição, totalizando 59.792 registros de IPP e M3, incluindo os efeitos fixos de GC2 (agrupando os animais nascidos na mesma fazenda, ano e que conceberam no mesmo manejo reprodutivo), ano e estação de parição, totalizando 59.792 registros de IPP. As estimativas dos componentes de variância e herdabilidade e os valores genéticos (VG) foram obtidos pelo método da máxima verossimilhança restrita, com a inclusão da matriz de parentesco disponível. As diferenças esperadas na progênie (DEPs) foram obtidas dividindo os VG por dois. Após a obtenção desses resultados, foram realizadas correlações entre os VG e o ranqueamento das DEPs dos reprodutores para IPP, utilizando-se o procedimento PROC CORR (SAS, 2003). Ao se considerar o ano e a estação de parto nos modelos de análise (M2 e M3), esses produziram um maior R², indicando que tais modelos conseguiram explicar, em maior grau, as diferenças existentes entre os animais para IPP. As herdabilidades estimadas foram de baixa magnitude (0,14 e 0,15). As correlações entre os VG obtidas por diferentes modelos foram 0,73 (M1 x M2); 0,91 (M2 x M3) e 0,66 (M1 x M3).
Resumo:
The aim of this study was to establish a digital elevation model and its horizontal resolution to interpolate the annual air temperature for the Alagoas State by means of multiple linear regression models. A multiple linear regression model was adjusted to series (11 to 34 years) of annual air temperatures obtained from 28 weather stations in the states of Alagoas, Bahia, Pernambuco and Sergipe, in the Northeast of Brazil, in function of latitude, longitude and altitude. The elevation models SRTM and GTOPO30 were used in the analysis, with original resolutions of 90 and 900 m, respectively. The SRTM was resampled for horizontal resolutions of 125, 250, 500, 750 and 900 m. For spatializing the annual mean air temperature for the state of Alagoas, a multiple linear regression model was used for each elevation and spatial resolution on a grid of the latitude and longitude. In Alagoas, estimates based on SRTM data resulted in a standard error of estimate (0.57 degrees C) and dispersion (r(2) = 0.62) lower than those obtained from GTOPO30 (0.93 degrees C and 0.20). In terms of SRTM resolutions, no significant differences were observed between the standard error (0.55 degrees C; 750 m - 0.58 degrees C; 250m) and dispersion (0.60; 500 m - 0.65; 750 m) estimates. The spatialization of annual air temperature in Alagoas, via multiple regression models applied to SRTM data showed higher concordance than that obtained with the GTOPO30, independent of the spatial resolution.
Resumo:
In this work, the rheological behavior of block copolymers with different morphologies (lamellar, cylindrical, spherical, and disordered) and their clay-containing nanocomposites was studied using small amplitude oscillatory shear. The copolymers studied were one asymmetric starblock styrene-butadiene-styrene copolymer and four styrene-ethylene/butylenes-styrene copolymers with different molecular architectures, one of them being modified with maleic anhydride. The nanocomposites of those copolymers were prepared by adding organophilic clay using three different preparation techniques: melt mixing, solution casting, and a hybrid melt mixing-solution technique. The nanocomposites were characterized by X-ray diffraction and transmission electron microscopy, and their viscoelastic properties were evaluated and compared to the ones of the pure copolymers. The influence of copolymer morphology and presence of clay on the storage modulus (G`) curves was studied by the evaluation of the changes in the low frequency slope of log G` x log omega (omega: frequency) curves upon variation of temperature and clay addition. This slope may be related to the degree of liquid- or solid-like behavior of a material. It was observed that at temperatures corresponding to the ordered state, the rheological behavior of the nanocomposites depended mainly on the viscoelasticity of each type of ordered phase and the variation of the slope due to the addition of clay was small. For temperatures corresponding to the disordered state, however, the rheological behavior of the copolymer nanocomposites was dictated mostly by the degree of clay dispersion: When the clay was well dispersed, a strong solid-like behavior corresponding to large G` slope variations was observed.
Resumo:
In this paper, we compare three residuals to assess departures from the error assumptions as well as to detect outlying observations in log-Burr XII regression models with censored observations. These residuals can also be used for the log-logistic regression model, which is a special case of the log-Burr XII regression model. For different parameter settings, sample sizes and censoring percentages, various simulation studies are performed and the empirical distribution of each residual is displayed and compared with the standard normal distribution. These studies suggest that the residual analysis usually performed in normal linear regression models can be straightforwardly extended to the modified martingale-type residual in log-Burr XII regression models with censored data.