889 resultados para Heterogeneous regression
Resumo:
Logistic regression is one of the most important tools in the analysis of epidemiological and clinical data. Such data often contain missing values for one or more variables. Common practice is to eliminate all individuals for whom any information is missing. This deletion approach does not make efficient use of available information and often introduces bias.^ Two methods were developed to estimate logistic regression coefficients for mixed dichotomous and continuous covariates including partially observed binary covariates. The data were assumed missing at random (MAR). One method (PD) used predictive distribution as weight to calculate the average of the logistic regressions performing on all possible values of missing observations, and the second method (RS) used a variant of resampling technique. Additional seven methods were compared with these two approaches in a simulation study. They are: (1) Analysis based on only the complete cases, (2) Substituting the mean of the observed values for the missing value, (3) An imputation technique based on the proportions of observed data, (4) Regressing the partially observed covariates on the remaining continuous covariates, (5) Regressing the partially observed covariates on the remaining continuous covariates conditional on response variable, (6) Regressing the partially observed covariates on the remaining continuous covariates and response variable, and (7) EM algorithm. Both proposed methods showed smaller standard errors (s.e.) for the coefficient involving the partially observed covariate and for the other coefficients as well. However, both methods, especially PD, are computationally demanding; thus for analysis of large data sets with partially observed covariates, further refinement of these approaches is needed. ^
Resumo:
This study examines Hispanic levels of incorporation and access to health care. Applying the Aday and Andersen framework for the study of access, the study examined the relationship between two levels of Hispanic incorporation into U.S. society, i.e., mainstream versus ethnic, and potential and realized measures of access to health care. Data for the study were drawn from a 1992 telephone survey of 600 randomly selected Hispanics in Houston and Harris County.^ The hypotheses tested were: (1) Hispanics who are incorporated into mainstream society are more likely to have better potential and realized access to health care than those who are incorporated into ethnic-group enclaves regardless of their socioeconomic status (SES), health status and health needs, and (2) there is no interaction between the levels of incorporation (mainstream or ethnic) and SES, health status, and health needs in predicting potential and realized access.^ The data analysis supported Hypothesis One for the two measures of potential access. The results of bivariate and multiple logistic regression analyses indicated that for Hispanics in Houston and Harris County, being in the "mainstream" incorporation category increased their potential access to care, having "health insurance" and a "regular place of care". For the selected measure of realized access, having a "regular check-up", the analysis did not demonstrate statistically significant differences in having a regular check-up among Hispanics incorporated in the ethnic or mainstream incorporation categories.^ Hypothesis Two, that there is no interaction between the levels of incorporation and socioeconomic characteristics, health status, and health needs in predicting potential and realized access among Hispanics was supported by the data. The results of the logistic regression analysis showed that, after adjusting for socioeconomic status, health status, and health needs, the association between "level of incorporation" and the two measures of potential access ("health insurance" and having a "usual place of care") was not modified by the control variables nor by their interaction with level of incorporation. That is, the effect of incorporation on Hispanics' health insurance coverage, and having a usual place of care, was homogenous across Hispanics with different SES and health status.^ The main research implication of this dissertation is the employment of a theoretical framework for the assessment of cultural factors essential to research on migrating heterogeneous subpopulations. It also provided strategies to solve practical and methodological difficulties in the secondary analyses of data on these populations. ^
Resumo:
A large number of ridge regression estimators have been proposed and used with little knowledge of their true distributions. Because of this lack of knowledge, these estimators cannot be used to test hypotheses or to form confidence intervals.^ This paper presents a basic technique for deriving the exact distribution functions for a class of generalized ridge estimators. The technique is applied to five prominent generalized ridge estimators. Graphs of the resulting distribution functions are presented. The actual behavior of these estimators is found to be considerably different than the behavior which is generally assumed for ridge estimators.^ This paper also uses the derived distributions to examine the mean squared error properties of the estimators. A technique for developing confidence intervals based on the generalized ridge estimators is also presented. ^
Resumo:
The history of the logistic function since its introduction in 1838 is reviewed, and the logistic model for a polychotomous response variable is presented with a discussion of the assumptions involved in its derivation and use. Following this, the maximum likelihood estimators for the model parameters are derived along with a Newton-Raphson iterative procedure for evaluation. A rigorous mathematical derivation of the limiting distribution of the maximum likelihood estimators is then presented using a characteristic function approach. An appendix with theorems on the asymptotic normality of sample sums when the observations are not identically distributed, with proofs, supports the presentation on asymptotic properties of the maximum likelihood estimators. Finally, two applications of the model are presented using data from the Hypertension Detection and Follow-up Program, a prospective, population-based, randomized trial of treatment for hypertension. The first application compares the risk of five-year mortality from cardiovascular causes with that from noncardiovascular causes; the second application compares risk factors for fatal or nonfatal coronary heart disease with those for fatal or nonfatal stroke. ^