927 resultados para LINEAR-REGRESSION MODELS
Resumo:
OBJECTIVE: Anemia is a common comorbid condition in various inflammatory states and an established predictor of mortality in patients with chronic heart failure, ischemic heart disease, and end-stage renal disease. The present study of patients with abdominal aortic aneurysm (AAA) undergoing endovascular repair (EVAR) assessed the relationships between baseline hemoglobin concentration and AAA size, as well as anemia and long-term survival. METHODS: Between March 1994 and November 2006, 711 patients (65 women, mean age 75.8 +/- 7.8 years) underwent elective EVAR. Anemia was defined as a hemoglobin level <13 g/dL in men and <12 g/dL in women. Post-EVAR mean follow-up was 48.3 +/- 32.0 months. Association of hemoglobin level with AAA size was assessed with multiple linear regression. Mortality was determined with use of the internet-based Social Security Death Index and the electronic hospital record. Kaplan-Meier survival curves of anemic and nonanemic patient groups were compared by the log-rank method. Multivariable logistic regression models were used to determine the influence of anemia on vital status after EVAR. RESULTS: A total of 218/711 (30.7%) of AAA patients undergoing EVAR had anemia at baseline. After adjustment for various risk factors, hemoglobin level was inversely related to maximum AAA diameter (beta: - .144, 95%-CI: -1.482 - .322, P = .002). Post-EVAR survival was 65.5% at 5 years and 44.4% at 10 years. In long-term follow-up, survival was significantly lower in patients with anemia as compared to patients without anemia (P < .0001 by log-rank). Baseline hemoglobin levels were independently related to long-term mortality in multivariable Cox regression analysis adjusted for various risk factors (adjusted HR: 0.866, 95% CI: .783 to .958, P = .005). Within this model, statin use (adjusted HR: .517, 95% CI: .308 to .868, P = .013) was independently related to long-term survival, whereas baseline AAA diameter (adjusted HR: 1.022, 95% CI: 1.009 to 1.036, P = .001) was an independently associated with increased mortality. CONCLUSIONS: Baseline hemoglobin concentration is independently associated with AAA size and reduced long-term survival following EVAR. Thus, the presence or absence of anemia offers a potential refinement of existing risk stratification instruments.
Resumo:
Soil degradation is a major problem in the agriculturally dominated country of Tajikistan, which makes it necessary to determine and monitor the state of soils. For this purpose a soil spectral library was established as it enables the determination of soil properties with relatively low costs and effort. A total of 1465 soil samples were collected from three 10x10 km test sites in western Tajikistan. The diffuse reflectance of the samples was measured with a FieldSpec PRO FR from ASD in the spectral range from 380 to 2500 nm in laboratory. 166 samples were finally selected based on their spectral information and analysed on total C and N, organic C, pH, CaCO₃, extractable P, exchangeable Ca, Mg and K, and the fractions clay, silt and sand. Multiple linear regression was used to set up the models. Two third of the chemically analysed samples were used to calibrate the models, one third was used for hold-out validation. Very good prediction accuracy was obtained for total C (R² = 0.76, RMSEP = 4.36 g kg⁻¹), total N (R² = 0.83, RMSEP = 0.30 g kg⁻¹) and organic C (R² = 0.81, RMSEP = 3.30 g kg⁻¹), good accuracy for pH (R² = 0.61, RMSEP = 0.157) and CaCO3(R² = 0.72, RMSEP = 4.63 %). No models could be developed for extractable P, exchangeable Ca, Mg and K, and the fractions clay, silt and sand. It can be concluded that the spectral library approach has a high potential to substitute standard laboratory methods where rapid and inexpensive analysis is required.
Resumo:
Background mortality is an essential component of any forest growth and yield model. Forecasts of mortality contribute largely to the variability and accuracy of model predictions at the tree, stand and forest level. In the present study, I implement and evaluate state-of-the-art techniques to increase the accuracy of individual tree mortality models, similar to those used in many of the current variants of the Forest Vegetation Simulator, using data from North Idaho and Montana. The first technique addresses methods to correct for bias induced by measurement error typically present in competition variables. The second implements survival regression and evaluates its performance against the traditional logistic regression approach. I selected the regression calibration (RC) algorithm as a good candidate for addressing the measurement error problem. Two logistic regression models for each species were fitted, one ignoring the measurement error, which is the “naïve” approach, and the other applying RC. The models fitted with RC outperformed the naïve models in terms of discrimination when the competition variable was found to be statistically significant. The effect of RC was more obvious where measurement error variance was large and for more shade-intolerant species. The process of model fitting and variable selection revealed that past emphasis on DBH as a predictor variable for mortality, while producing models with strong metrics of fit, may make models less generalizable. The evaluation of the error variance estimator developed by Stage and Wykoff (1998), and core to the implementation of RC, in different spatial patterns and diameter distributions, revealed that the Stage and Wykoff estimate notably overestimated the true variance in all simulated stands, but those that are clustered. Results show a systematic bias even when all the assumptions made by the authors are guaranteed. I argue that this is the result of the Poisson-based estimate ignoring the overlapping area of potential plots around a tree. Effects, especially in the application phase, of the variance estimate justify suggested future efforts of improving the accuracy of the variance estimate. The second technique implemented and evaluated is a survival regression model that accounts for the time dependent nature of variables, such as diameter and competition variables, and the interval-censored nature of data collected from remeasured plots. The performance of the model is compared with the traditional logistic regression model as a tool to predict individual tree mortality. Validation of both approaches shows that the survival regression approach discriminates better between dead and alive trees for all species. In conclusion, I showed that the proposed techniques do increase the accuracy of individual tree mortality models, and are a promising first step towards the next generation of background mortality models. I have also identified the next steps to undertake in order to advance mortality models further.
Analysis of spring break-up and its effects on a biomass feedstock supply chain in northern Michigan
Resumo:
Demand for bio-fuels is expected to increase, due to rising prices of fossil fuels and concerns over greenhouse gas emissions and energy security. The overall cost of biomass energy generation is primarily related to biomass harvesting activity, transportation, and storage. With a commercial-scale cellulosic ethanol processing facility in Kinross Township of Chippewa County, Michigan about to be built, models including a simulation model and an optimization model have been developed to provide decision support for the facility. Both models track cost, emissions and energy consumption. While the optimization model provides guidance for a long-term strategic plan, the simulation model aims to present detailed output for specified operational scenarios over an annual period. Most importantly, the simulation model considers the uncertainty of spring break-up timing, i.e., seasonal road restrictions. Spring break-up timing is important because it will impact the feasibility of harvesting activity and the time duration of transportation restrictions, which significantly changes the availability of feedstock for the processing facility. This thesis focuses on the statistical model of spring break-up used in the simulation model. Spring break-up timing depends on various factors, including temperature, road conditions and soil type, as well as individual decision making processes at the county level. The spring break-up model, based on the historical spring break-up data from 27 counties over the period of 2002-2010, starts by specifying the probability distribution of a particular county’s spring break-up start day and end day, and then relates the spring break-up timing of the other counties in the harvesting zone to the first county. In order to estimate the dependence relationship between counties, regression analyses, including standard linear regression and reduced major axis regression, are conducted. Using realizations (scenarios) of spring break-up generated by the statistical spring breakup model, the simulation model is able to probabilistically evaluate different harvesting and transportation plans to help the bio-fuel facility select the most effective strategy. For early spring break-up, which usually indicates a longer than average break-up period, more log storage is required, total cost increases, and the probability of plant closure increases. The risk of plant closure may be partially offset through increased use of rail transportation, which is not subject to spring break-up restrictions. However, rail availability and rail yard storage may then become limiting factors in the supply chain. Rail use will impact total cost, energy consumption, system-wide CO2 emissions, and the reliability of providing feedstock to the bio-fuel processing facility.
Resumo:
Despite widespread use of species-area relationships (SARs), dispute remains over the most representative SAR model. Using data of small-scale SARs of Estonian dry grassland communities, we address three questions: (1) Which model describes these SARs best when known artifacts are excluded? (2) How do deviating sampling procedures (marginal instead of central position of the smaller plots in relation to the largest plot; single values instead of average values; randomly located subplots instead of nested subplots) influence the properties of the SARs? (3) Are those effects likely to bias the selection of the best model? Our general dataset consisted of 16 series of nested-plots (1 cm(2)-100 m(2), any-part system), each of which comprised five series of subplots located in the four corners and the centre of the 100-m(2) plot. Data for the three pairs of compared sampling designs were generated from this dataset by subsampling. Five function types (power, quadratic power, logarithmic, Michaelis-Menten, Lomolino) were fitted with non-linear regression. In some of the communities, we found extremely high species densities (including bryophytes and lichens), namely up to eight species in 1 cm(2) and up to 140 species in 100 m(2), which appear to be the highest documented values on these scales. For SARs constructed from nested-plot average-value data, the regular power function generally was the best model, closely followed by the quadratic power function, while the logarithmic and Michaelis-Menten functions performed poorly throughout. However, the relative fit of the latter two models increased significantly relative to the respective best model when the single-value or random-sampling method was applied, however, the power function normally remained far superior. These results confirm the hypothesis that both single-value and random-sampling approaches cause artifacts by increasing stochasticity in the data, which can lead to the selection of inappropriate models.
Resumo:
The human immunodeficiency virus-1 reverse transcriptase inhibitory activity of 2-(2,6-disubstituted phenyl)-3-(substituted pyrimidin-2-yl)-thiazolidin-4-ones have been analyzed using combinatorial protocol in multiple linear regression (CP-MLR) with several electronic and molecular surface area features of the compounds obtained from Molecular Operating Environment (MOE) software. The study has indicated the role of different charged molecular surface areas in modeling the inhibitory activity of the compounds. The derived models collectively suggested that the compounds should be compact without bulky substitutions on its peripheries for better HIV-1 RT inhibitory activity. It also emphasized the necessity of hydrophobicity and compact structural features for their activity. The scope of the descriptors identified for these analogues have been verified by extending the dataset with different 2-(disubstituted phenyl)-3-(substituted pyridin-2-yl)-thiazolidin-4-ones. The joint analysis of extended dataset highlighted the information content of identified descriptors in modeling the HIV-1 RT inhibitory activity of the compounds.
Resumo:
Two series of closely related antimalarial agents, 7-chloro-4-(3’,5’-disubstitutedanilino) quinolines, have been analyzed using Combinatorial Protocol in Multiple Linear Regression (CP-MLR) for the structure-activity relations with more than 450 topological descriptors for each set. The study clearly suggested that 3’- and 5’- substituents of the anilino moiety map different domains in the activity space. While one domain favors the compact structural frames having aromatic, heterocyclic ring(s) substituted with closely spaced F, NO2 and O functional groups, the other prefers structural frames enriched with unsaturation, loops, branches, electronic content and devoid of carbonyl function. Also, this study gives an indication in favour of the electron rich centres in the aniline substituent groups for better antimalarial activity; an observation in line with several of the previous reports too. The models developed and the participating descriptors suggest that the substituent groups of the 4-anilino moiety of the 4-(3’, 5’-disubstitutedanilino)quinolines hold scope for further modification in the optimisation of the antimalarial activity.
Resumo:
BACKGROUND The possible impact of coinfection with the Kaposi sarcoma-associated herpes virus (KSHV) on the response to antiretroviral therapy (ART) is unknown. Prospective studies are rare, particularly in Africa. METHODS We enrolled a prospective cohort of HIV-infected adults initiating ART in Johannesburg, South Africa. The subjects were defined as seropositive to KSHV if they were reactive to either KSHV lytic K8.1 or latent Orf73 antigen or to both. The subjects were followed from ART initiation until 18 months of treatment. HIV viral load and CD4 counts were tested 6 monthly. Linear generalized estimating and log-binomial regression models were used to estimate the effect of KSHV infection on immunologic recovery and response and HIV viral load suppression within 18 months after ART initiation. RESULTS Three hundred eighty-five subjects initiating ART from November 2008 to March 2009 were considered to be eligible including 184 (48%) KSHV+. The KSHV+ group was similar to the KSHV- in terms of age, gender, initiating CD4 count, body mass index, tuberculosis, and hemoglobin levels. The KSHV+ group gained a similar number of cells at 6 [difference of 10 cells per cubic millimeter, 95% confidence interval (CI): -11 to 31], 12 (3 cells per cubic millimeter, 95% CI: -19 to 25), and 18 months (24 cells per cubic millimeter, 95% CI: -13 to 61) compared with that gained by the KSHV- group. Adjusted relative risk of failure to suppress viral load to <400 copies per milliliter (1.03; 95% CI: 0.90 to 1.17) were similar for KSHV+ and KSHV- by 6 months on treatment. CONCLUSIONS In a population with a high KSHV prevalence, HIV-positive adults coinfected with KSHV achieved similar immunologic and virologic responses to ART early after treatment initiation compared with those with KSHV-.
Resumo:
BACKGROUND Muscle strength greatly influences gait kinematics. The question was whether this association is similar in different diseases. METHODS Data from instrumented gait analysis of 716 patients were retrospectively assessed. The effect of muscle strength on gait deviations, namely the gait profile score (GPS) was evaluated by means of generalised least square models. This was executed for seven different patient groups. The groups were formed according to the type of disease: orthopaedic/neurologic, uni-/bilateral affection, and flaccid/spastic muscles. RESULTS Muscle strength had a negative effect on GPS values, which did not significantly differ amongst the different patient groups. However, an offset of the GPS regression line was found, which was mostly dependent on the basic disease. Surprisingly, spastic patients, who have reduced strength and additionally spasticity in clinical examination, and flaccid neurologic patients showed the same offset. Patients with additional lack of trunk control (Tetraplegia) showed the largest offset. CONCLUSION Gait kinematics grossly depend on muscle strength. This was seen in patients with very different pathologies. Nevertheless, optimal correction of biomechanics and muscle strength may still not lead to a normal gait, especially in that of neurologic patients. The basic disease itself has an additional effect on gait deviations expressed as a GPS-offset of the linear regression line.
Resumo:
The purpose of this study was to compare the validity and output of the biaxial ActiGraph GT1M and the triaxial GT3X (ActiGraph, LLC, Pensacola, FL, USA)accelerometer in 5- to 9-year-old children. Thirty-two children wore the two monitors while their energy expenditure was measured with indirect calorimetry. They performed four locomotor and four play activities in an exercise laboratory and were further measured during 12 minutes of a sports lesson. Validity evidence in relation to indirect calorimetry was examined with linear regression equations applied to the laboratory data. During the sports lessons predicted energy expenditure according to the regression equations was compared to measured energy expenditure with the Wilcoxon-signed rank test and the Spearman correlation. To compare the output, agreement between counts of the two monitors during the laboratory activities was assessed with Bland-Altman plots. The evidence of validity was similar for both monitors. Agreement between the output of the two monitors was good for vertical counts (mean bias = −14 ± 22 counts) but not for horizontal counts (−17 ± 32 counts). The current results indicate that the two accelerometer models are able to estimate energy expenditure of a range of physical activities equally well in young children. However, they show output differences for movement in the horizontal direction.
Resumo:
Background Non-adherence is one of the strongest predictors of therapeutic failure in HIV-positive patients. Virologic failure with subsequent emergence of resistance reduces future treatment options and long-term clinical success. Methods Prospective observational cohort study including patients starting new class of antiretroviral therapy (ART) between 2003 and 2010. Participants were naïve to ART class and completed ≥1 adherence questionnaire prior to resistance testing. Outcomes were development of any IAS-USA, class-specific, or M184V mutations. Associations between adherence and resistance were estimated using logistic regression models stratified by ART class. Results Of 314 included individuals, 162 started NNRTI and 152 a PI/r regimen. Adherence was similar between groups with 85% reporting adherence ≥95%. Number of new mutations increased with increasing non-adherence. In NNRTI group, multivariable models indicated a significant linear association in odds of developing IAS-USA (odds ratio (OR) 1.66, 95% confidence interval (CI): 1.04-2.67) or class-specific (OR 1.65, 95% CI: 1.00-2.70) mutations. Levels of drug resistance were considerably lower in PI/r group and adherence was only significantly associated with M184V mutations (OR 8.38, 95% CI: 1.26-55.70). Adherence was significantly associated with HIV RNA in PI/r but not NNRTI regimens. Conclusion Therapies containing PI/r appear more forgiving to incomplete adherence compared with NNRTI regimens, which allow higher levels of resistance, even with adherence above 95%. However, in failing PI/r regimens good adherence may prevent accumulation of further resistance mutations and therefore help to preserve future drug options. In contrast, adherence levels have little impact on NNRTI treatments once the first mutations have emerged.
Resumo:
Tables of estimated regression coefficients, usually accompanied by additional information such as standard errors, t-statistics, p-values, confidence intervals or significance stars, have long been the preferred way of communicating results from statistical models. In recent years, however, the limits of this form of exposition have been increasingly recognized. For example, interpretation of regression tables can be very challenging in the presence of complications such as interaction effects, categorical variables, or nonlinear functional forms. Furthermore, while these issues might still be manageable in the case of linear regression, interpretational difficulties can be overwhelming in nonlinear models such as logistic regression. To facilitate sensible interpretation of such models it is often necessary to compute additional results such as marginal effects, predictive margins, or contrasts. Moreover, smart graphical displays of results can be very valuable in making complex relations accessible. A number of helpful commands geared at supporting these tasks have been recently introduced in Stata, making elaborate interpretation and communication of regression results possible without much extra effort. Examples of such commands are -margins-, -contrasts-, and -marginsplot-. In my talk, I will discuss the capabilities of these commands and present a range of examples illustrating their use.
Resumo:
Objective. Essential hypertension affects 25% of the US adult population and is a leading contributor to morbidity and mortality. Because BP is a multifactorial phenotype that resists simple genetic analysis, intermediate phenotypes within the complex network of BP regulatory systems may be more accessible to genetic dissection. The Renin-Angiotensin System (RAS) is known to influence intermediate and long-term blood pressure regulation through alterations in vascular tone and renal sodium and fluid resorption. This dissertation examines associations between renin (REN), angiotensinogen (AGT), angiotensin-converting enzyme (ACE) and angiotensin II type 1 receptor (AT1) gene variation and interindividual differences in plasma hormone levels, renal hemodynamics, and BP homeostasis.^ Methods. A total of 150 unrelated men and 150 unrelated women, between 20.0 and 49.9 years of age and free of acute or chronic illness except for a history of hypertension (11 men and 7 women, all off medications), were studied after one week on a controlled sodium diet. RAS plasma hormone levels, renal hemodynamics and BP were determined prior to and during angiotensin II (Ang II) infusion. Individuals were genotyped by PCR for a variable number tandem repeat (VNTR) polymorphism in REN, and for the following restriction fragment length polymorphisms (RFLP): AGT M235T, ACE I/D, and AT1 A1166C. Associations between clinical measurements and allelic variation were examined using multiple linear regression statistical models.^ Results. Women homozygous for the AT1 1166C allele demonstrated higher intracellular levels of sodium (p = 0.044). Men homozygous for the AGT T235 allele demonstrated a blunted decrement in renal plasma flow in response to Ang II infusion (p = 0.0002). There were no significant associations between RAS gene variation and interindividual variation in RAS plasma hormone levels or BP.^ Conclusions. Rather than identifying new BP controlling genes or alleles, the study paradigm employed in this thesis (i.e., measured genes, controlled environments and interventions) may provide mechanistic insight into how candidate genes affect BP homeostasis. ^
Resumo:
BACKGROUND Rising levels of overweight and obesity are important public-health concerns worldwide. The purpose of this study is to elucidate their prevalence and trends in Switzerland by analyzing variations in Body Mass Index (BMI) of Swiss conscripts. METHODS The conscription records were provided by the Swiss Army. This study focussed on conscripts 18.5-20.5 years of age from the seven one-year birth cohorts spanning the period 1986-1992. BMI across professional status, area-based socioeconomic position (abSEP), urbanicity and regions was analyzed. Two piecewise quantile regression models with linear splines for three birth-cohort groups were used to examine the association of median BMI with explanatory variables and to determine the extent to which BMI has varied over time. RESULTS The study population consisted of 188,537 individuals. Median BMI was 22.51 kg/m2 (22.45-22.57 95% confidence interval (CI)). BMI was lower among conscripts of high professional status (-0.46 kg/m2; 95% CI: -0.50, -0.42, compared with low), living in areas of high abSEP (-0.11 kg/m2; 95% CI: -0.16, -0.07 compared to medium) and from urban communities (-0.07 kg/m2; 95% CI: -0.11, -0.03, compared with peri-urban). Comparing with Midland, median BMI was highest in the North-West (0.25 kg/m2; 95% CI: 0.19-0.30) and Central regions (0.11 kg/m2; 95% CI: 0.05-0.16) and lowest in the East (-0.19 kg/m2; 95% CI: -0.24, -0.14) and Lake Geneva regions (-0.15 kg/m2; 95% CI: -0.20, -0.09). Trajectories of regional BMI growth varied across birth cohorts, with median BMI remaining high in the Central and North-West regions, whereas stabilization and in some cases a decline were observed elsewhere. CONCLUSIONS BMI of Swiss conscripts is associated with individual and abSEP and urbanicity. Results show regional variation in the levels and temporal trajectories of BMI growth and signal their possible slowdown among recent birth cohorts.
Resumo:
The comparison of radiotherapy techniques regarding secondary cancer risk has yielded contradictory results possibly stemming from the many different approaches used to estimate risk. The purpose of this study was to make a comprehensive evaluation of different available risk models applied to detailed whole-body dose distributions computed by Monte Carlo for various breast radiotherapy techniques including conventional open tangents, 3D conformal wedged tangents and hybrid intensity modulated radiation therapy (IMRT). First, organ-specific linear risk models developed by the International Commission on Radiological Protection (ICRP) and the Biological Effects of Ionizing Radiation (BEIR) VII committee were applied to mean doses for remote organs only and all solid organs. Then, different general non-linear risk models were applied to the whole body dose distribution. Finally, organ-specific non-linear risk models for the lung and breast were used to assess the secondary cancer risk for these two specific organs. A total of 32 different calculated absolute risks resulted in a broad range of values (between 0.1% and 48.5%) underlying the large uncertainties in absolute risk calculation. The ratio of risk between two techniques has often been proposed as a more robust assessment of risk than the absolute risk. We found that the ratio of risk between two techniques could also vary substantially considering the different approaches to risk estimation. Sometimes the ratio of risk between two techniques would range between values smaller and larger than one, which then translates into inconsistent results on the potential higher risk of one technique compared to another. We found however that the hybrid IMRT technique resulted in a systematic reduction of risk compared to the other techniques investigated even though the magnitude of this reduction varied substantially with the different approaches investigated. Based on the epidemiological data available, a reasonable approach to risk estimation would be to use organ-specific non-linear risk models applied to the dose distributions of organs within or near the treatment fields (lungs and contralateral breast in the case of breast radiotherapy) as the majority of radiation-induced secondary cancers are found in the beam-bordering regions.