937 resultados para multiple linear regression models
Resumo:
Objective. Essential hypertension affects 25% of the US adult population and is a leading contributor to morbidity and mortality. Because BP is a multifactorial phenotype that resists simple genetic analysis, intermediate phenotypes within the complex network of BP regulatory systems may be more accessible to genetic dissection. The Renin-Angiotensin System (RAS) is known to influence intermediate and long-term blood pressure regulation through alterations in vascular tone and renal sodium and fluid resorption. This dissertation examines associations between renin (REN), angiotensinogen (AGT), angiotensin-converting enzyme (ACE) and angiotensin II type 1 receptor (AT1) gene variation and interindividual differences in plasma hormone levels, renal hemodynamics, and BP homeostasis.^ Methods. A total of 150 unrelated men and 150 unrelated women, between 20.0 and 49.9 years of age and free of acute or chronic illness except for a history of hypertension (11 men and 7 women, all off medications), were studied after one week on a controlled sodium diet. RAS plasma hormone levels, renal hemodynamics and BP were determined prior to and during angiotensin II (Ang II) infusion. Individuals were genotyped by PCR for a variable number tandem repeat (VNTR) polymorphism in REN, and for the following restriction fragment length polymorphisms (RFLP): AGT M235T, ACE I/D, and AT1 A1166C. Associations between clinical measurements and allelic variation were examined using multiple linear regression statistical models.^ Results. Women homozygous for the AT1 1166C allele demonstrated higher intracellular levels of sodium (p = 0.044). Men homozygous for the AGT T235 allele demonstrated a blunted decrement in renal plasma flow in response to Ang II infusion (p = 0.0002). There were no significant associations between RAS gene variation and interindividual variation in RAS plasma hormone levels or BP.^ Conclusions. Rather than identifying new BP controlling genes or alleles, the study paradigm employed in this thesis (i.e., measured genes, controlled environments and interventions) may provide mechanistic insight into how candidate genes affect BP homeostasis. ^
Resumo:
robreg provides a number of robust estimators for linear regression models. Among them are the high breakdown-point and high efficiency MM-estimator, the Huber and bisquare M-estimator, and the S-estimator, each supporting classic or robust standard errors. Furthermore, basic versions of the LMS/LQS (least median of squares) and LTS (least trimmed squares) estimators are provided. Note that the moremata package, also available from SSC, is required.
Resumo:
Parameter estimates from commonly used multivariable parametric survival regression models do not directly quantify differences in years of life expectancy. Gaussian linear regression models give results in terms of absolute mean differences, but are not appropriate in modeling life expectancy, because in many situations time to death has a negative skewed distribution. A regression approach using a skew-normal distribution would be an alternative to parametric survival models in the modeling of life expectancy, because parameter estimates can be interpreted in terms of survival time differences while allowing for skewness of the distribution. In this paper we show how to use the skew-normal regression so that censored and left-truncated observations are accounted for. With this we model differences in life expectancy using data from the Swiss National Cohort Study and from official life expectancy estimates and compare the results with those derived from commonly used survival regression models. We conclude that a censored skew-normal survival regression approach for left-truncated observations can be used to model differences in life expectancy across covariates of interest.
Resumo:
The main objective of this preliminary study was to further clarify the association between testosterone (T) levels and depression by investigating symptom-based depression subtypes in a sample of 64 men. The data were taken from the ZInEP epidemiology survey. Gonadal hormones of a melancholic (n = 25) and an atypical (n = 14) depression subtype, derived from latent class analysis, were compared with those of healthy controls (n = 18). Serum T was assayed using an enzyme-linked immunosorbent assay procedure. Analysis of variance, analysis of covariance, non-parametrical tests, and generalized linear regression models were performed to examine group differences. The atypical depressive subtype showed significantly lower T levels compared with the melancholic depressives. While accumulative evidence indicates that, beyond psychosocial characteristics, the melancholic and atypical depressive subtypes are also distinguishable by biological correlates, the current study expanded this knowledge to include gonadal hormones. Further longitudinal research is warranted to disclose causality by linking the multiple processes in pathogenesis of depression.
Resumo:
When considering data from many trials, it is likely that some of them present a markedly different intervention effect or exert an undue influence on the summary results. We develop a forward search algorithm for identifying outlying and influential studies in meta-analysis models. The forward search algorithm starts by fitting the hypothesized model to a small subset of likely outlier-free studies and proceeds by adding studies into the set one-by-one that are determined to be closest to the fitted model of the existing set. As each study is added to the set, plots of estimated parameters and measures of fit are monitored to identify outliers by sharp changes in the forward plots. We apply the proposed outlier detection method to two real data sets; a meta-analysis of 26 studies that examines the effect of writing-to-learn interventions on academic achievement adjusting for three possible effect modifiers, and a meta-analysis of 70 studies that compares a fluoride toothpaste treatment to placebo for preventing dental caries in children. A simple simulated example is used to illustrate the steps of the proposed methodology, and a small-scale simulation study is conducted to evaluate the performance of the proposed method. Copyright © 2016 John Wiley & Sons, Ltd.
Resumo:
Interaction effect is an important scientific interest for many areas of research. Common approach for investigating the interaction effect of two continuous covariates on a response variable is through a cross-product term in multiple linear regression. In epidemiological studies, the two-way analysis of variance (ANOVA) type of method has also been utilized to examine the interaction effect by replacing the continuous covariates with their discretized levels. However, the implications of model assumptions of either approach have not been examined and the statistical validation has only focused on the general method, not specifically for the interaction effect.^ In this dissertation, we investigated the validity of both approaches based on the mathematical assumptions for non-skewed data. We showed that linear regression may not be an appropriate model when the interaction effect exists because it implies a highly skewed distribution for the response variable. We also showed that the normality and constant variance assumptions required by ANOVA are not satisfied in the model where the continuous covariates are replaced with their discretized levels. Therefore, naïve application of ANOVA method may lead to an incorrect conclusion. ^ Given the problems identified above, we proposed a novel method modifying from the traditional ANOVA approach to rigorously evaluate the interaction effect. The analytical expression of the interaction effect was derived based on the conditional distribution of the response variable given the discretized continuous covariates. A testing procedure that combines the p-values from each level of the discretized covariates was developed to test the overall significance of the interaction effect. According to the simulation study, the proposed method is more powerful then the least squares regression and the ANOVA method in detecting the interaction effect when data comes from a trivariate normal distribution. The proposed method was applied to a dataset from the National Institute of Neurological Disorders and Stroke (NINDS) tissue plasminogen activator (t-PA) stroke trial, and baseline age-by-weight interaction effect was found significant in predicting the change from baseline in NIHSS at Month-3 among patients received t-PA therapy.^
Resumo:
This dissertation develops and explores the methodology for the use of cubic spline functions in assessing time-by-covariate interactions in Cox proportional hazards regression models. These interactions indicate violations of the proportional hazards assumption of the Cox model. Use of cubic spline functions allows for the investigation of the shape of a possible covariate time-dependence without having to specify a particular functional form. Cubic spline functions yield both a graphical method and a formal test for the proportional hazards assumption as well as a test of the nonlinearity of the time-by-covariate interaction. Five existing methods for assessing violations of the proportional hazards assumption are reviewed and applied along with cubic splines to three well known two-sample datasets. An additional dataset with three covariates is used to explore the use of cubic spline functions in a more general setting. ^
Resumo:
The performance of the Hosmer-Lemeshow global goodness-of-fit statistic for logistic regression models was explored in a wide variety of conditions not previously fully investigated. Computer simulations, each consisting of 500 regression models, were run to assess the statistic in 23 different situations. The items which varied among the situations included the number of observations used in each regression, the number of covariates, the degree of dependence among the covariates, the combinations of continuous and discrete variables, and the generation of the values of the dependent variable for model fit or lack of fit.^ The study found that the $\rm\ C$g* statistic was adequate in tests of significance for most situations. However, when testing data which deviate from a logistic model, the statistic has low power to detect such deviation. Although grouping of the estimated probabilities into quantiles from 8 to 30 was studied, the deciles of risk approach was generally sufficient. Subdividing the estimated probabilities into more than 10 quantiles when there are many covariates in the model is not necessary, despite theoretical reasons which suggest otherwise. Because it does not follow a X$\sp2$ distribution, the statistic is not recommended for use in models containing only categorical variables with a limited number of covariate patterns.^ The statistic performed adequately when there were at least 10 observations per quantile. Large numbers of observations per quantile did not lead to incorrect conclusions that the model did not fit the data when it actually did. However, the statistic failed to detect lack of fit when it existed and should be supplemented with further tests for the influence of individual observations. Careful examination of the parameter estimates is also essential since the statistic did not perform as desired when there was moderate to severe collinearity among covariates.^ Two methods studied for handling tied values of the estimated probabilities made only a slight difference in conclusions about model fit. Neither method split observations with identical probabilities into different quantiles. Approaches which create equal size groups by separating ties should be avoided. ^
Resumo:
Este trabajo propone una metodología basada en Sistemas de Información Geográfica para estimar la demanda de viajes en estaciones de redes de transporte público, tomando como ejemplo la red de metro de Madrid. Primero se emplea una serie de datos descriptivos para caracterizar la red, clasificar las estaciones y obtener una tipología de las mismas. Luego, con el objetivo de explicar y predecir los viajes (entradas a la red) se generan dos modelos: uno sencillo a partir de las tasas de penetración de uso del metro en función de la distancia (distance decay), y otro más complejo basado en un modelo de regresión lineal múltiple (MRLM) que incorpora variables relativas a la estación y su entorno (densidad, mezcla de usos, diseño urbano, presencia de modos competidores). Su aplicación muestra resultados alentadores, y se plantea como una alternativa a los clásicos modelos de cuatro etapas, más complejos y con un mayor coste económico.
Resumo:
Euphausiids constitute major biomass component in shelf ecosystems and play a fundamental role in the rapid vertical transport of carbon from the ocean surface to the deeper layers during their daily vertical migration (DVM). DVM depth and migration patterns depend on oceanographic conditions with respect to temperature, light and oxygen availability at depth, factors that are highly dependent on season in most marine regions. Changes in the abiotic conditions also shape Euphausiid metabolism including aerobic and anaerobic energy production. Here we introduce a global krill respiration model which includes the effect of latitude (LAT), the day of the year of interest (DoY), and the number of daylight hours on the day of interest (DLh), in addition to the basal variables that determine ectothermal oxygen consumption (temperature, body mass and depth) in the ANN model (Artificial Neural Networks). The newly implemented parameters link space and time in terms of season and photoperiod to krill respiration. The ANN model showed a better fit (r**2=0.780) when DLh and LAT were included, indicating a decrease in respiration with increasing LAT and decreasing DLh. We therefore propose DLh as a potential variable to consider when building physiological models for both hemispheres. We also tested for seasonality the standard respiration rate of the most common species that were investigated until now in a large range of DLh and DoY with Multiple Linear Regression (MLR) or General Additive model (GAM). GAM successfully integrated DLh (r**2= 0.563) and DoY (r**2= 0.572) effects on respiration rates of the Antarctic krill, Euphausia superba, yielding the minimum metabolic activity in mid-June and the maximum at the end of December. Neither the MLR nor the GAM approach worked for the North Pacific krill Euphausia pacifica, and MLR for the North Atlantic krill Meganyctiphanes norvegica remained inconclusive because of insufficient seasonal data coverage. We strongly encourage comparative respiration measurements of worldwide Euphausiid key species at different seasons to improve accuracy in ecosystem modelling.
Resumo:
Aim: To investigate shell size variation among gastropod faunas of fossil and recent long-lived European lakes and discuss potential underlying processes. Location: 23 long-lived lakes of the Miocene to Recent of Europe. Methods: Based on a dataset of 1412 species of both fossil and extant lacustrine gastropods, we assessed differences in shell size in terms of characteristics of the faunas (species richness, degree of endemism, differences in family composition) and the lakes (surface area, latitude and longitude of lake centroid, distance to closest neighbouring lake) using multiple and linear regression models. Because of a strong species-area relationship, we used resampling to determine whether any observed correlation is driven by that relationship. Results: The regression models indicated size range expansion rather than unidirectional increase or decrease as the dominant pattern of size evolution. The multiple regression models for size range and maximum and minimum size were statistically significant, while the model with mean size was not. Individual contributions and linear regressions indicated species richness and lake surface area as best predictors for size changes. Resampling analysis revealed no significant effects of species richness on the observed patterns. The correlations are comparable across families of different size classes, suggesting a general pattern. Main conclusions: Among the chosen variables, species richness and lake surface area are the most robust predictors of shell size in long-lived lake gastropods. Although the most outstanding and attractive examples for size evolution in lacustrine gastropods derive from lakes with extensive durations, shell size appears to be independent of the duration of the lake as well as longevity of a species. The analogue of long-lived lakes as 'evolutionary islands' does not hold for developments of shell size because different sets of parameters predict size changes.
Resumo:
Linear regression is a technique widely used in digital signal processing. It consists on finding the linear function that better fits a given set of samples. This paper proposes different hardware architectures for the implementation of the linear regression method on FPGAs, specially targeting area restrictive systems. It saves area at the cost of constraining the lengths of the input signal to some fixed values. We have implemented the proposed scheme in an Automatic Modulation Classifier, meeting the hard real-time constraints this kind of systems have.
Resumo:
Algorithms for distributed agreement are a powerful means for formulating distributed versions of existing centralized algorithms. We present a toolkit for this task and show how it can be used systematically to design fully distributed algorithms for static linear Gaussian models, including principal component analysis, factor analysis, and probabilistic principal component analysis. These algorithms do not rely on a fusion center, require only low-volume local (1-hop neighborhood) communications, and are thus efficient, scalable, and robust. We show how they are also guaranteed to asymptotically converge to the same solution as the corresponding existing centralized algorithms. Finally, we illustrate the functioning of our algorithms on two examples, and examine the inherent cost-performance tradeoff.
Resumo:
Aplicación de simulación de Monte Carlo y técnicas de Análisis de la Varianza (ANOVA) a la comparación de modelos estocásticos dinámicos para accidentes de tráfico.