915 resultados para Logistic regression model
Resumo:
This paper deals with asymptotic results on a multivariate ultrastructural errors-in-variables regression model with equation errors Sufficient conditions for attaining consistent estimators for model parameters are presented Asymptotic distributions for the line regression estimators are derived Applications to the elliptical class of distributions with two error assumptions are presented The model generalizes previous results aimed at univariate scenarios (C) 2010 Elsevier Inc All rights reserved
Resumo:
When missing data occur in studies designed to compare the accuracy of diagnostic tests, a common, though naive, practice is to base the comparison of sensitivity, specificity, as well as of positive and negative predictive values on some subset of the data that fits into methods implemented in standard statistical packages. Such methods are usually valid only under the strong missing completely at random (MCAR) assumption and may generate biased and less precise estimates. We review some models that use the dependence structure of the completely observed cases to incorporate the information of the partially categorized observations into the analysis and show how they may be fitted via a two-stage hybrid process involving maximum likelihood in the first stage and weighted least squares in the second. We indicate how computational subroutines written in R may be used to fit the proposed models and illustrate the different analysis strategies with observational data collected to compare the accuracy of three distinct non-invasive diagnostic methods for endometriosis. The results indicate that even when the MCAR assumption is plausible, the naive partial analyses should be avoided.
Resumo:
The main object of this paper is to discuss the Bayes estimation of the regression coefficients in the elliptically distributed simple regression model with measurement errors. The posterior distribution for the line parameters is obtained in a closed form, considering the following: the ratio of the error variances is known, informative prior distribution for the error variance, and non-informative prior distributions for the regression coefficients and for the incidental parameters. We proved that the posterior distribution of the regression coefficients has at most two real modes. Situations with a single mode are more likely than those with two modes, especially in large samples. The precision of the modal estimators is studied by deriving the Hessian matrix, which although complicated can be computed numerically. The posterior mean is estimated by using the Gibbs sampling algorithm and approximations by normal distributions. The results are applied to a real data set and connections with results in the literature are reported. (C) 2011 Elsevier B.V. All rights reserved.
Resumo:
We review several asymmetrical links for binary regression models and present a unified approach for two skew-probit links proposed in the literature. Moreover, under skew-probit link, conditions for the existence of the ML estimators and the posterior distribution under improper priors are established. The framework proposed here considers two sets of latent variables which are helpful to implement the Bayesian MCMC approach. A simulation study to criteria for models comparison is conducted and two applications are made. Using different Bayesian criteria we show that, for these data sets, the skew-probit links are better than alternative links proposed in the literature.
Resumo:
Regression models for the mean quality-adjusted survival time are specified from hazard functions of transitions between two states and the mean quality-adjusted survival time may be a complex function of covariates. We discuss a regression model for the mean quality-adjusted survival (QAS) time based on pseudo-observations, which has the advantage of directly modeling the effect of covariates in the QAS time. Both Monte Carlo Simulations and a real data set are studied. Copyright (C) 2009 John Wiley & Sons, Ltd.
Resumo:
Birnbaum-Saunders models have largely been applied in material fatigue studies and reliability analyses to relate the total time until failure with some type of cumulative damage. In many problems related to the medical field, such as chronic cardiac diseases and different types of cancer, a cumulative damage caused by several risk factors might cause some degradation that leads to a fatigue process. In these cases, BS models can be suitable for describing the propagation lifetime. However, since the cumulative damage is assumed to be normally distributed in the BS distribution, the parameter estimates from this model can be sensitive to outlying observations. In order to attenuate this influence, we present in this paper BS models, in which a Student-t distribution is assumed to explain the cumulative damage. In particular, we show that the maximum likelihood estimates of the Student-t log-BS models attribute smaller weights to outlying observations, which produce robust parameter estimates. Also, some inferential results are presented. In addition, based on local influence and deviance component and martingale-type residuals, a diagnostics analysis is derived. Finally, a motivating example from the medical field is analyzed using log-BS regression models. Since the parameter estimates appear to be very sensitive to outlying and influential observations, the Student-t log-BS regression model should attenuate such influences. The model checking methodologies developed in this paper are used to compare the fitted models.
Resumo:
The Birnbaum-Saunders regression model is commonly used in reliability studies. We derive a simple matrix formula for second-order covariances of maximum-likelihood estimators in this class of models. The formula is quite suitable for computer implementation, since it involves only simple operations on matrices and vectors. Some simulation results show that the second-order covariances can be quite pronounced in small to moderate sample sizes. We also present empirical applications.
Resumo:
The main purpose of this work is to study the behaviour of Skovgaard`s [Skovgaard, I.M., 2001. Likelihood asymptotics. Scandinavian journal of Statistics 28, 3-32] adjusted likelihood ratio statistic in testing simple hypothesis in a new class of regression models proposed here. The proposed class of regression models considers Dirichlet distributed observations, and the parameters that index the Dirichlet distributions are related to covariates and unknown regression coefficients. This class is useful for modelling data consisting of multivariate positive observations summing to one and generalizes the beta regression model described in Vasconcellos and Cribari-Neto [Vasconcellos, K.L.P., Cribari-Neto, F., 2005. Improved maximum likelihood estimation in a new class of beta regression models. Brazilian journal of Probability and Statistics 19,13-31]. We show that, for our model, Skovgaard`s adjusted likelihood ratio statistics have a simple compact form that can be easily implemented in standard statistical software. The adjusted statistic is approximately chi-squared distributed with a high degree of accuracy. Some numerical simulations show that the modified test is more reliable in finite samples than the usual likelihood ratio procedure. An empirical application is also presented and discussed. (C) 2009 Elsevier B.V. All rights reserved.
Resumo:
We introduce, for the first time, a new class of Birnbaum-Saunders nonlinear regression models potentially useful in lifetime data analysis. The class generalizes the regression model described by Rieck and Nedelman [Rieck, J.R., Nedelman, J.R., 1991. A log-linear model for the Birnbaum-Saunders distribution. Technometrics 33, 51-60]. We discuss maximum-likelihood estimation for the parameters of the model, and derive closed-form expressions for the second-order biases of these estimates. Our formulae are easily computed as ordinary linear regressions and are then used to define bias corrected maximum-likelihood estimates. Some simulation results show that the bias correction scheme yields nearly unbiased estimates without increasing the mean squared errors. Two empirical applications are analysed and discussed. Crown Copyright (C) 2009 Published by Elsevier B.V. All rights reserved.
Resumo:
In this paper, we study the influence of the National Telecom Business Volume by the data in 2008 that have been published in China Statistical Yearbook of Statistics. We illustrate the procedure of modeling “National Telecom Business Volume” on the following eight variables, GDP, Consumption Levels, Retail Sales of Social Consumer Goods Total Renovation Investment, the Local Telephone Exchange Capacity, Mobile Telephone Exchange Capacity, Mobile Phone End Users, and the Local Telephone End Users. The testing of heteroscedasticity and multicollinearity for model evaluation is included. We also consider AIC and BIC criterion to select independent variables, and conclude the result of the factors which are the optimal regression model for the amount of telecommunications business and the relation between independent variables and dependent variable. Based on the final results, we propose several recommendations about how to improve telecommunication services and promote the economic development.
Resumo:
BACKGROUND: People who have suffered a stroke commonly report unfulfilled need for rehabilitation. Using a model of patient satisfaction, we examined characteristics in individuals that at 3 months after stroke predicted, or at 12 months were associated with unmet need for rehabilitation or dissatisfaction with health care services at 12 months after stroke. METHODS: The participants (n = 175) received care at the stroke units at the Karolinska University Hospital, Sweden. The dependent variables "unfulfilled needs for rehabilitation" and "dissatisfaction with care" were collected using a questionnaire. Stroke severity, domains of the Stroke Impact Scale (SIS), the Sense of Coherence scale (SOC) and socio demographic factors were used as independent variables in four logistic regression analyses. RESULTS: Unfulfilled needs for rehabilitation at 12 months were predicted by strength (SIS) (odds ratio (OR) 7.05) at three months, and associated with hand function (SIS) (OR 4.38) and poor self-rated recovery (SIS) (OR 2.46) at 12 months. Dissatisfaction with care was predicted by SOC (OR 4.18) and participation (SIS) (OR 3.78), and associated with SOC (OR 3.63) and strength (SIS) (OR 3.08). CONCLUSIONS: Thirty-three percent of the participants reported unmet needs for rehabilitation and fourteen percent were dissatisfied with the care received. In order to attend to rehabilitation needs when they arise, rehabilitation services may need to be more flexible in terms of when rehabilitation is provided. Long term services with scheduled re-assessments and with more emphasis on understanding the experiences of both the patients and their social networks might better be able to provide services that attend to patients' needs and aid peoples' reorientation; this would apply particularly to those with poor coping capacity.
Resumo:
Introdução: A retinopatia diabética (RD) é a principal causa de novos casos de cegueira entre norte-americanos em idade produtiva. Existe uma associação entre RD e as outras complicações microvasculares do diabete melito. A associação da RD com a fase inicial da nefropatia, a microalbuminúria, não está esclarecida em pacientes com diabete melito (DM) tipo 2. Polimorfismos de genes (ENNP1; FABP2) relacionados à resistência insulínica, entre outros, poderiam estar associados à RD. Objetivo: O objetivo deste estudo foi avaliar fatores genéticos e não genéticos associados à RD avançada em pacientes com DM tipo 2. Métodos: Neste estudo caso-controle foram incluídos pacientes DM tipo 2 submetidos à avaliação clínica, laboratorial e oftalmológica. Foi realizada oftalmoscopia binocular indireta sob midríase e obtidas retinografias coloridas em 7 campos padronizados. Foram classificados como casos os pacientes portadores de RD avançada (formas graves de RD não proliferativa e RD proliferativa) e como controles os pacientes sem RD avançada (fundoscopia normal, e outras formas de RD). Foram estudados os polimorfismos K121Q do gene ENNP1 e A54T do gene FABP2. Na análise estatística foram utilizados testes paramétricos e não paramétricos conforme indicado. Foi realizada análise de regressão logística múltipla para avaliar fatores associados à RD avançada. O nível de significância adotado foi de 0,05%. Resultados: Foram avaliados 240 pacientes com DM tipo 2 com 60,6 ± 8,4 anos de idade e duração conhecida de DM de 14,4 ± 8,4 anos. Destes, 67 pacientes (27,9%) apresentavam RD avançada. Os pacientes com RD avançada apresentaram maior duração conhecida de DM (18,1 ± 8,1 vs. 12,9 ± 8,2 anos; P< 0,001), menor índice de massa corporal (IMC) (27,5 ± 4,2 vs. 29,0 ± 9,6 kg/m2; P= 0,019), além de uso de insulina mais freqüente (70,8% vs 35,3%; P< 0,001) e presença de nefropatia diabética (81,1% vs 34,8%; P< 0,001) quando comparados com os pacientes sem RD avançada. Na avaliação laboratorial os pacientes com RD avançada apresentaram valores mais elevados de creatinina sérica [1,4 (0,6 -13,6) vs 0,8 (0,5-17,9) mg/dl; P<0,001] e de albuminúria [135,0 (3,6-1816,0) vs 11,3 (1,5-5105,0) μg/min; P<0,001] quando comparados com pacientes sem RD avançada. A distribuição dos genótipos dos polimorfismos do ENNP1 e FABP2 não foi diferente entre os grupos. A análise de regressão logística múltipla demonstrou que a presença de nefropatia (OR=6,59; IC95%: 3,01-14,41; P<0,001) e o uso de insulina (OR=3,47; IC95%: 1,60- 7,50; P=0,002) foram os fatores associados à RD avançada, ajustados para a duração de DM, presença de hipertensão arterial, glicohemoglobina e IMC. Quando na análise foram incluídos apenas pacientes normoalbuminúricos e microalbuminúricos, a microalbuminúria (OR=3,8; IC95%: 1,38-10,47; P=0,010), o uso de insulina (OR=5,04; IC95%: 1,67-15,21; P=0,004), a duração do DM (OR=1,06 IC95%: 1,00-1,13; P=0,048) e a glicohemoglobina (OR=1,35; IC95%: 1,02-1,79; P=0,034) foram os fatores associados à RD avançada, ajustados para a presença de hipertensão arterial e IMC. Conclusão: Pacientes com DM tipo 2 portadores de formas avançadas de RD apresentam mais freqüentemente envolvimento renal pelo DM, incluindo o estágio de microalbuminúria. Uma avaliação renal com medida de albuminúria dever ser incorporada como avaliação de rotina nestes pacientes.
Resumo:
Esta pesquisa tem o objetivo de identificar as variáveis e sua influência na propensão à aquisição de crédito pessoal, propondo um modelo estatístico de propensão ao financiamento por cartões de crédito híbridos para maximização de contratação de crédito e otimização dos esforços de marketing. O estudo descritivo pode gerar insights para a compreensão da expansão do crédito ao consumo, sobretudo num contexto de escassez de opções de financiamento e limitação no canal de distribuição. Foram usados dados de uma base de clientes de uma instituição financeira com variáveis sócio demográficas e transacionai, e o modelo matemático foi seguido da validação de sua capacidade preditiva.
Resumo:
The dyslipidemia and excess weight in adolescents, when combined, suggest a progression of risk factors for cardiovascular disease (CVD). Besides these, the dietary habits and lifestyle have also been considered unsuitable impacting the development of chronic diseases. The study objectives were: (1) estimate the prevalence of lipid profile and correlate with body mass index (BMI), waist circumference (WC) and waist / height ratio (WHR) in adolescents, considering the maturation sexual, (2) know the sources of variance in the diet and the number of days needed to estimate the usual diet of adolescents and (3) describe the dietary patterns and lifestyle of adolescents, family history of CVD and age correlates them with the patterns of risk for CVD, adjusted for sexual maturation. A cross-sectional study was performed with 432 adolescents, aged 10-19 years from public schools of the Natal city, Brazil. The dyslipidemias were evaluated considering the lipid profile, the index of I Castelli (TC / HDL) and II (LDL / HDL) and non-HDL cholesterol. Anthropometric indicators were BMI, WC and WHR. The intake of energy, nutrients including fiber, fatty acids and cholesterol was estimated from two 24-hour recalls (24HR). The variables of lipid profile, anthropometric and clinical data were used in the models of Pearson correlation and linear regression, considering the sexual maturation. The variance ratio of the diet was calculated from the component-person variance, determined by analysis of variance (ANOVA). The definition of the number of days to estimate the usual intake of each nutrient was obtained by taking the hypothetical correlation (r) ≥ 0.9, between nutrient intake and the true observed. We used the principal component analysis as a method of extracting factors that 129 accounted for the dependent variables and known cardiovascular risk obtained from the lipid profile, the index for Castelli I and II, non-HDL cholesterol, BMI, and WC the WHR. Dietary patterns and lifestyle were obtained from the independent variables, based on nutrients consumed and physical activity weekly. In the study of principal component analysis (PCA) was investigated associations between the patterns of cardiovascular risk factors in dietary patterns and lifestyle, age and positive family history of CVD, through bivariate and multiple logistic regression adjusted for sexual maturation. The low HDL-C dyslipidemia was most prevalent (50.5%) for adolescents. Significant correlations were observed between hypercholesterolemia and positive family history of CVD (r = 0.19, p <0.01) and hypertriglyceridemia with BMI (r = 0.30, p <0.01), with the CC (r = 0.32, p <0.01) and WHR (r = 0.33, p <0.01). The linear model constructed with sexual maturation, age and BMI explained about 1 to 10.4% of the variation in the lipid profile. The sources of variance between individuals were greater for all nutrients in both sexes. The reasons for variances were 1 for all nutrients were higher in females. The results suggest that to assess the diet of adolescents with greater precision, 2 days would be enough to R24h consumption of energy, carbohydrates, fiber, saturated and monounsaturated fatty acids. In contrast, 3 days would be recommended for protein, lipid, polyunsaturated fatty acids and cholesterol. Two cardiovascular risk factors as have been extracted in the ACP, referring to the dependent variables: the standard lipid profile (HDL-C and non-HDL cholesterol) and "standard anthropometric index (BMI, WC, WHR) with a power explaining 75% of the variance of the original data. The factors are representative of two independent variables led to dietary patterns, "pattern 130 western diet" and "pattern protein diet", and one on the lifestyle, "pattern energy balance". Together, these patterns provide an explanation power of 67%. Made adjustment for sexual maturation in males remained significant variables: the associations between puberty and be pattern anthropometric indicator (OR = 3.32, CI 1.34 to 8.17%), and between family history of CVD and the pattern lipid profile (OR = 2.62, CI 1.20 to 5.72%). In females adolescents, associations were identified between age after the first stage of puberty with anthropometric pattern (OR = 3.59, CI 1.58 to 8.17%) and lipid profile (OR = 0.33, CI 0.15 to 0.75%). Conclusions: The low HDL-C was the most prevalent dyslipidemia independent of sex and nutritional status of adolescents. Hypercholesterolemia was influenced by family history of CVD and sexual maturation, in turn, hypertriglyceridemia was closely associated with anthropometric indicators. The variance between the diets was greater for all nutrients. This fact reflected in a variance ratio less than 1 and consequently in a lower number of days requerid to estimate the usual diet of adolescents considering gender. The two dietary patterns were extracted and the pattern considered unhealthy lifestyle as healthy. The associations were found between the patterns of CVD risk with age and family history of CVD in the studied adolescents
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)