941 resultados para Log-linear model
Resumo:
Abstract Background For analyzing longitudinal familial data we adopted a log-linear form to incorporate heterogeneity in genetic variance components over the time, and additionally a serial correlation term in the genetic effects at different levels of ages. Due to the availability of multiple measures on the same individual, we permitted environmental correlations that may change across time. Results Systolic blood pressure from family members from the first and second cohort was used in the current analysis. Measures of subjects receiving hypertension treatment were set as censored values and they were corrected. An initial check of the variance and covariance functions proposed for analyzing longitudinal familial data, using empirical semi-variogram plots, indicated that the observed trait dispersion pattern follows the assumptions adopted. Conclusion The corrections for censored phenotypes based on ordinary linear models may be an appropriate simple model to correct the data, ensuring that the original variability in the data was retained. In addition, empirical semi-variogram plots are useful for diagnosis of the (co)variance model adopted.
Resumo:
The presented study carried out an analysis on rural landscape changes. In particular the study focuses on the understanding of driving forces acting on the rural built environment using a statistical spatial model implemented through GIS techniques. It is well known that the study of landscape changes is essential for a conscious decision making in land planning. From a bibliography review results a general lack of studies dealing with the modeling of rural built environment and hence a theoretical modelling approach for such purpose is needed. The advancement in technology and modernity in building construction and agriculture have gradually changed the rural built environment. In addition, the phenomenon of urbanization of a determined the construction of new volumes that occurred beside abandoned or derelict rural buildings. Consequently there are two types of transformation dynamics affecting mainly the rural built environment that can be observed: the conversion of rural buildings and the increasing of building numbers. It is the specific aim of the presented study to propose a methodology for the development of a spatial model that allows the identification of driving forces that acted on the behaviours of the building allocation. In fact one of the most concerning dynamic nowadays is related to an irrational expansion of buildings sprawl across landscape. The proposed methodology is composed by some conceptual steps that cover different aspects related to the development of a spatial model: the selection of a response variable that better describe the phenomenon under study, the identification of possible driving forces, the sampling methodology concerning the collection of data, the most suitable algorithm to be adopted in relation to statistical theory and method used, the calibration process and evaluation of the model. A different combination of factors in various parts of the territory generated favourable or less favourable conditions for the building allocation and the existence of buildings represents the evidence of such optimum. Conversely the absence of buildings expresses a combination of agents which is not suitable for building allocation. Presence or absence of buildings can be adopted as indicators of such driving conditions, since they represent the expression of the action of driving forces in the land suitability sorting process. The existence of correlation between site selection and hypothetical driving forces, evaluated by means of modeling techniques, provides an evidence of which driving forces are involved in the allocation dynamic and an insight on their level of influence into the process. GIS software by means of spatial analysis tools allows to associate the concept of presence and absence with point futures generating a point process. Presence or absence of buildings at some site locations represent the expression of these driving factors interaction. In case of presences, points represent locations of real existing buildings, conversely absences represent locations were buildings are not existent and so they are generated by a stochastic mechanism. Possible driving forces are selected and the existence of a causal relationship with building allocations is assessed through a spatial model. The adoption of empirical statistical models provides a mechanism for the explanatory variable analysis and for the identification of key driving variables behind the site selection process for new building allocation. The model developed by following the methodology is applied to a case study to test the validity of the methodology. In particular the study area for the testing of the methodology is represented by the New District of Imola characterized by a prevailing agricultural production vocation and were transformation dynamic intensively occurred. The development of the model involved the identification of predictive variables (related to geomorphologic, socio-economic, structural and infrastructural systems of landscape) capable of representing the driving forces responsible for landscape changes.. The calibration of the model is carried out referring to spatial data regarding the periurban and rural area of the study area within the 1975-2005 time period by means of Generalised linear model. The resulting output from the model fit is continuous grid surface where cells assume values ranged from 0 to 1 of probability of building occurrences along the rural and periurban area of the study area. Hence the response variable assesses the changes in the rural built environment occurred in such time interval and is correlated to the selected explanatory variables by means of a generalized linear model using logistic regression. Comparing the probability map obtained from the model to the actual rural building distribution in 2005, the interpretation capability of the model can be evaluated. The proposed model can be also applied to the interpretation of trends which occurred in other study areas, and also referring to different time intervals, depending on the availability of data. The use of suitable data in terms of time, information, and spatial resolution and the costs related to data acquisition, pre-processing, and survey are among the most critical aspects of model implementation. Future in-depth studies can focus on using the proposed model to predict short/medium-range future scenarios for the rural built environment distribution in the study area. In order to predict future scenarios it is necessary to assume that the driving forces do not change and that their levels of influence within the model are not far from those assessed for the time interval used for the calibration.
Resumo:
This thesis consists of three self-contained papers. In the first paper I analyze the labor supply behavior of Bologna Pizza Delivery Vendors. Recent influential papers analyze labor supply behavior of taxi drivers (Camerer et al., 1997; and Crawford and Meng, 2011) and suggest that reference-dependence preferences have an important influence on drivers’ labor-supply decisions. Unlike previous papers, I am able to identify an exogenous and transitory change in labor demand. Using high frequency data on orders and rainfall as an exogenous demand shifter, I invariably find that reference-dependent preferences play no role in their labor’ supply decisions and the behavior of pizza vendors is perfectly consistent with the predictions of the standard model of labor’ supply. In the second paper, I investigate how the voting behavior of Members of Parliament is influenced by the Members seating nearby. By exploiting the random seating arrangements in the Icelandic Parliament, I show that being seated next to Members of a different party increases the probability of not being aligned with one’s own party. Using the exact spatial orientation of the peers, I provide evidence that supports the hypothesis that interaction is the main channel that explain these results. In the third paper, I provide an estimate of the trade flows that there would have been between the UK and Europe if the UK had joined the Euro. As an alternative approach to the standard log-linear gravity equation I employ the synthetic control method. I show that the aggregate trade flows between Britain and Europe would have been 13% higher if the UK had adopted the Euro.
Resumo:
BACKGROUND: First investigations of the interactions between weather and the incidence of acute myocardial infarctions date back to 1938. The early observation of a higher incidence of myocardial infarctions in the cold season could be confirmed in very different geographical regions and cohorts. While the influence of seasonal variations on the incidence of myocardial infarctions has been extensively documented, the impact of individual meteorological parameters on the disease has so far not been investigated systematically. Hence the present study intended to assess the impact of the essential variables of weather and climate on the incidence of myocardial infarctions. METHODS: The daily incidence of myocardial infarctions was calculated from a national hospitalization survey. The hourly weather and climate data were provided by the database of the national weather forecast. The epidemiological and meteorological data were correlated by multivariate analysis based on a generalized linear model assuming a log-link-function and a Poisson distribution. RESULTS: High ambient pressure, high pressure gradients, and heavy wind activity were associated with an increase in the incidence of the totally 6560 hospitalizations for myocardial infarction irrespective of the geographical region. Snow- and rainfall had inconsistent effects. Temperature, Foehn, and lightning showed no statistically significant impact. CONCLUSIONS: Ambient pressure, pressure gradient, and wind activity had a statistical impact on the incidence of myocardial infarctions in Switzerland from 1990 to 1994. To establish a cause-and-effect relationship more data are needed on the interaction between the pathophysiological mechanisms of the acute coronary syndrome and weather and climate variables.
Resumo:
Various inference procedures for linear regression models with censored failure times have been studied extensively. Recent developments on efficient algorithms to implement these procedures enhance the practical usage of such models in survival analysis. In this article, we present robust inferences for certain covariate effects on the failure time in the presence of "nuisance" confounders under a semiparametric, partial linear regression setting. Specifically, the estimation procedures for the regression coefficients of interest are derived from a working linear model and are valid even when the function of the confounders in the model is not correctly specified. The new proposals are illustrated with two examples and their validity for cases with practical sample sizes is demonstrated via a simulation study.
Resumo:
The Receiver Operating Characteristic (ROC) curve is a prominent tool for characterizing the accuracy of continuous diagnostic test. To account for factors that might invluence the test accuracy, various ROC regression methods have been proposed. However, as in any regression analysis, when the assumed models do not fit the data well, these methods may render invalid and misleading results. To date practical model checking techniques suitable for validating existing ROC regression models are not yet available. In this paper, we develop cumulative residual based procedures to graphically and numerically assess the goodness-of-fit for some commonly used ROC regression models, and show how specific components of these models can be examined within this framework. We derive asymptotic null distributions for the residual process and discuss resampling procedures to approximate these distributions in practice. We illustrate our methods with a dataset from the Cystic Fibrosis registry.
Resumo:
Objective: There is an ongoing debate concerning how outcome variables change during the course of psychotherapy. We compared the dose–effect model, which posits diminishing effects of additional sessions in later treatment phases, against a model that assumes a linear and steady treatment progress through termination. Method: Session-by-session outcome data of 6,375 outpatients were analyzed, and participants were categorized according to treatment length. Linear and log-linear (i.e., negatively accelerating) latent growth curve models (LGCMs) were estimated and compared for different treatment length categories. Results: When comparing the fit of the various models, the log-linear LGCMs assuming negatively accelerating treatment progress consistently outperformed the linear models irre- spective of treatment duration. The rate of change was found to be inversely related to the length of treatment. Conclusion: As proposed by the dose–effect model, the expected course of improvement in psychotherapy appears to follow a negatively accelerated pattern of change, irrespective of the duration of the treatment. However, our results also suggest that the rate of change is not constant across various treatment lengths. As proposed by the “good enough level” model, longer treatments are associated with less rapid rates of change.
Resumo:
BACKGROUND Genome-wide association studies have linked CYP17A1 coding for the steroid hormone synthesizing enzyme 17α-hydroxylase (CYP17A1) to blood pressure (BP). We hypothesized that the genetic signal may translate into a correlation of ambulatory BP (ABP) with apparent CYP17A1 activity in a family-based population study and estimated the heritability of CYP17A1 activity. METHODS In the Swiss Kidney Project on Genes in Hypertension, day and night urinary excretions of steroid hormone metabolites were measured in 518 participants (220 men, 298 women), randomly selected from the general population. CYP17A1 activity was assessed by 2 ratios of urinary steroid metabolites: one estimating the combined 17α-hydroxylase/17,20-lyase activity (ratio 1) and the other predominantly 17α-hydroxylase activity (ratio 2). A mixed linear model was used to investigate the association of ABP with log-transformed CYP17A1 activities exploring effect modification by urinary sodium excretion. RESULTS Daytime ABP was positively associated with ratio 1 under conditions of high, but not low urinary sodium excretion (P interaction <0.05). Ratio 2 was not associated with ABP. Heritability estimates (SE) for day and night CYP17A1 activities were 0.39 (0.10) and 0.40 (0.09) for ratio 1, and 0.71 (0.09) and 0.55 (0.09) for ratio 2 (P values <0.001). CYP17A1 activities, assessed with ratio 1, were lower in older participants. CONCLUSIONS Low apparent CYP17A1 activity (assessed with ratio 1) is associated with elevated daytime ABP when salt intake is high. CYP17A1 activity is heritable and diminished in the elderly. These observations highlight the modifying effect of salt intake on the association of CYP17A1 with BP.
Resumo:
Unlike infections occurring during periods of chemotherapy-induced neutropenia, postoperative infections in patients with solid malignancy remain largely understudied. The purpose of this population-based study was to evaluate the clinical and economic burden, as well as the relationship of hospital surgical volume and outcomes associated with serious postoperative infection (SPI) – i.e., bacteremia/sepsis, pneumonia, and wound infection – following resection of common solid tumors.^ From the Texas Discharge Data Research File, we identified all Texas residents who underwent resection of cancer of the lung, esophagus, stomach, pancreas, colon, or rectum between 2002 and 2006. From their billing records, we identified ICD-9 codes indicating SPI and also subsequent SPI-related readmissions occurring within 30 days of surgery. Random-effects logistic regression was used to calculate the impact of SPI on mortality, as well as the association between surgical volume and SPI, adjusting for case-mix, hospital characteristics, and clustering of multiple surgical admissions within the same patient and patients within the same hospital. Excess bed days and costs were calculated by subtracting values for patients without infections from those with infections computed using multilevel mixed-effects generalized linear model by fitting a gamma distribution to the data using log link.^ Serious postoperative infection occurred following 9.4% of the 37,582 eligible tumor resections and was independently associated with an 11-fold increase in the odds of in-hospital mortality (95% Confidence Interval [95% CI], 6.7-18.5, P < 0.001). Patients with SPI required 6.3 additional hospital days (95% CI, 6.1 - 6.5) at an incremental cost of $16,396 (95% CI, $15,927–$16,875). There was a significant trend toward lower overall rates of SPI with higher surgical volume (P=0.037). ^ Due to the substantial morbidity, mortality, and excess costs associated with SPI following solid tumor resections and given that, under current reimbursement practices, most of this heavy burden is borne by acute care providers, it is imperative for hospitals to identify more effective prophylactic measures, so that these potentially preventable infections and their associated expenditures can be averted. Additional volume-outcomes research is also needed to identify infection prevention processes that can be transferred from higher- to lower-volume providers.^
Resumo:
Objectives. This paper seeks to assess the effect on statistical power of regression model misspecification in a variety of situations. ^ Methods and results. The effect of misspecification in regression can be approximated by evaluating the correlation between the correct specification and the misspecification of the outcome variable (Harris 2010).In this paper, three misspecified models (linear, categorical and fractional polynomial) were considered. In the first section, the mathematical method of calculating the correlation between correct and misspecified models with simple mathematical forms was derived and demonstrated. In the second section, data from the National Health and Nutrition Examination Survey (NHANES 2007-2008) were used to examine such correlations. Our study shows that comparing to linear or categorical models, the fractional polynomial models, with the higher correlations, provided a better approximation of the true relationship, which was illustrated by LOESS regression. In the third section, we present the results of simulation studies that demonstrate overall misspecification in regression can produce marked decreases in power with small sample sizes. However, the categorical model had greatest power, ranging from 0.877 to 0.936 depending on sample size and outcome variable used. The power of fractional polynomial model was close to that of linear model, which ranged from 0.69 to 0.83, and appeared to be affected by the increased degrees of freedom of this model.^ Conclusion. Correlations between alternative model specifications can be used to provide a good approximation of the effect on statistical power of misspecification when the sample size is large. When model specifications have known simple mathematical forms, such correlations can be calculated mathematically. Actual public health data from NHANES 2007-2008 were used as examples to demonstrate the situations with unknown or complex correct model specification. Simulation of power for misspecified models confirmed the results based on correlation methods but also illustrated the effect of model degrees of freedom on power.^
Resumo:
A Bayesian approach to estimation of the regression coefficients of a multinominal logit model with ordinal scale response categories is presented. A Monte Carlo method is used to construct the posterior distribution of the link function. The link function is treated as an arbitrary scalar function. Then the Gauss-Markov theorem is used to determine a function of the link which produces a random vector of coefficients. The posterior distribution of the random vector of coefficients is used to estimate the regression coefficients. The method described is referred to as a Bayesian generalized least square (BGLS) analysis. Two cases involving multinominal logit models are described. Case I involves a cumulative logit model and Case II involves a proportional-odds model. All inferences about the coefficients for both cases are described in terms of the posterior distribution of the regression coefficients. The results from the BGLS method are compared to maximum likelihood estimates of the regression coefficients. The BGLS method avoids the nonlinear problems encountered when estimating the regression coefficients of a generalized linear model. The method is not complex or computationally intensive. The BGLS method offers several advantages over Bayesian approaches. ^
Resumo:
Scholars have found that socioeconomic status was one of the key factors that influenced early-stage lung cancer incidence rates in a variety of regions. This thesis examined the association between median household income and lung cancer incidence rates in Texas counties. A total of 254 individual counties in Texas with corresponding lung cancer incidence rates from 2004 to 2008 and median household incomes in 2006 were collected from the National Cancer Institute Surveillance System. A simple linear model and spatial linear models with two structures, Simultaneous Autoregressive Structure (SAR) and Conditional Autoregressive Structure (CAR), were used to link median household income and lung cancer incidence rates in Texas. The residuals of the spatial linear models were analyzed with Moran's I and Geary's C statistics, and the statistical results were used to detect similar lung cancer incidence rate clusters and disease patterns in Texas.^
Resumo:
Assessing wind conditions on complex terrain has become a hard task as terrain complexity increases. That is why there is a need to extrapolate in a reliable manner some wind parameters that determine wind farms viability such as annual average wind speed at all hub heights as well as turbulence intensities. The development of these tasks began in the early 90´s with the widely used linear model WAsP and WAsP Engineering especially designed for simple terrain with remarkable results on them but not so good on complex orographies. Simultaneously non-linearized Navier Stokes solvers have been rapidly developed in the last decade through CFD (Computational Fluid Dynamics) codes allowing simulating atmospheric boundary layer flows over steep complex terrain more accurately reducing uncertainties. This paper describes the features of these models by validating them through meteorological masts installed in a highly complex terrain. The study compares the results of the mentioned models in terms of wind speed and turbulence intensity.
Resumo:
The last decade, scientific studies have indicated an association between air pollution to which people are exposed and wide range of adverse health outcomes. We have developed a tool which is based on a model (MM5-CMAQ) running over Europe with 50 km spatial resolution, based on EMEP annual emissions, to produce a short-term forecast of the impact on health. In order to estimate the mortality change (forecasted for the next 24 hours) we have chosen a log-linear (Poisson) regression form to estimate the concentration-response function. The parameters involved in the C-R function have been estimated based on epidemiological studies, which have been published. Finally, we have derived the relationship between concentration change and mortality change from the C-R function which is the final health impact function.
Resumo:
Naproxen-C14H14O3 is a nonsteroidal anti-inflammatory drug which has been found at detectable concentrations in wastewater, surface water, and groundwater. Naproxen is relatively hydrophilic and is in anionic form at pH between 6 and 8. In this study, column experiments were performed using an unconsolidated aquifer material from an area near Barcelona (Spain) to assess transport and reaction mechanisms of Naproxen in the aquifer matrix under different pore water fluxes. Results were evaluated using HYDRUS-1D, which was used to estimate transport parameters. Batch sorption isotherms for Naproxen conformed with the linear model with a sorption coefficient of 0.42 (cm3 g−1), suggesting a low sorption affinity. Naproxen breakthrough curves (BTCs) measured in soil columns under steady-state, saturated water flow conditions displayed similar behavior, with no apparent hysteresis in sorption or dependence of retardation (R, 3.85-4.24) on pore water velocities. Soil sorption did not show any significant decrease for increasing flow rates, as observed from Naproxen recovery in the effluent. Sorption parameters estimated by the model suggest that Naproxen has a low sorption affinity to aquifer matrix. Most sorption of Naproxen occurred on the instantaneous sorption sites, with the kinetic sorption sites representing only about 10 to 40% of total sorption.