134 resultados para Linear models (Statistics)
em Universit
Resumo:
An important statistical development of the last 30 years has been the advance in regression analysis provided by generalized linear models (GLMs) and generalized additive models (GAMs). Here we introduce a series of papers prepared within the framework of an international workshop entitled: Advances in GLMs/GAMs modeling: from species distribution to environmental management, held in Riederalp, Switzerland, 6-11 August 2001.We first discuss some general uses of statistical models in ecology, as well as provide a short review of several key examples of the use of GLMs and GAMs in ecological modeling efforts. We next present an overview of GLMs and GAMs, and discuss some of their related statistics used for predictor selection, model diagnostics, and evaluation. Included is a discussion of several new approaches applicable to GLMs and GAMs, such as ridge regression, an alternative to stepwise selection of predictors, and methods for the identification of interactions by a combined use of regression trees and several other approaches. We close with an overview of the papers and how we feel they advance our understanding of their application to ecological modeling.
Resumo:
BACKGROUND: We sought to improve upon previously published statistical modeling strategies for binary classification of dyslipidemia for general population screening purposes based on the waist-to-hip circumference ratio and body mass index anthropometric measurements. METHODS: Study subjects were participants in WHO-MONICA population-based surveys conducted in two Swiss regions. Outcome variables were based on the total serum cholesterol to high density lipoprotein cholesterol ratio. The other potential predictor variables were gender, age, current cigarette smoking, and hypertension. The models investigated were: (i) linear regression; (ii) logistic classification; (iii) regression trees; (iv) classification trees (iii and iv are collectively known as "CART"). Binary classification performance of the region-specific models was externally validated by classifying the subjects from the other region. RESULTS: Waist-to-hip circumference ratio and body mass index remained modest predictors of dyslipidemia. Correct classification rates for all models were 60-80%, with marked gender differences. Gender-specific models provided only small gains in classification. The external validations provided assurance about the stability of the models. CONCLUSIONS: There were no striking differences between either the algebraic (i, ii) vs. non-algebraic (iii, iv), or the regression (i, iii) vs. classification (ii, iv) modeling approaches. Anticipated advantages of the CART vs. simple additive linear and logistic models were less than expected in this particular application with a relatively small set of predictor variables. CART models may be more useful when considering main effects and interactions between larger sets of predictor variables.
Resumo:
STUDY DESIGN: Prospective, controlled, observational outcome study using clinical, radiographic, and patient/physician-based questionnaire data, with patient outcomes at 12 months follow-up. OBJECTIVE: To validate appropriateness criteria for low back surgery. SUMMARY OF BACKGROUND DATA: Most surgical treatment failures are attributed to poor patient selection, but no widely accepted consensus exists on detailed indications for appropriate surgery. METHODS: Appropriateness criteria for low back surgery have been developed by a multispecialty panel using the RAND appropriateness method. Based on panel criteria, a prospective study compared outcomes of patients appropriately and inappropriately treated at a single institution with 12 months follow-up assessment. Included were patients with low back pain and/or sciatica referred to the neurosurgical department. Information about symptoms, neurologic signs, the health-related quality of life (SF-36), disability status (Roland-Morris), and pain intensity (VAS) was assessed at baseline, at 6 months, and at 12 months follow-up. The appropriateness criteria were administered prospectively to each clinical situation and outside of the clinical setting, with the surgeon and patients blinded to the results of the panel decision. The patients were further stratified into 2 groups: appropriate treatment group (ATG) and inappropriate treatment group (ITG). RESULTS: Overall, 398 patients completed all forms at 12 months. Treatment was considered appropriate for 365 participants and inappropriate for 33 participants. The mean improvement in the SF-36 physical component score at 12 months was significantly higher in the ATG (mean: 12.3 points) than in the ITG (mean: 6.8 points) (P = 0.01), as well as the mean improvement in the SF-36 mental component score (ATG mean: 5.0 points; ITG mean: -0.5 points) (P = 0.02). Improvement was also significantly higher in the ATG for the mean VAS back pain (ATG mean: 2.3 points; ITG mean: 0.8 points; P = 0.02) and Roland-Morris disability score (ATG mean: 7.7 points; ITG mean: 4.2 points; P = 0.004). The ATG also had a higher improvement in mean VAS for sciatica (4.0 points) than the ITG (2.8 points), but the difference was not significant (P = 0.08). The SF-36 General Health score declined in both groups after 12 months, however, the decline was worse in the ITG (mean decline: 8.2 points) than in the ATG (mean decline: 1.2 points) (P = 0.04). Overall, in comparison to ITG patients, ATG patients had significantly higher improvement at 12 months, both statistically and clinically. CONCLUSION: In comparison to previously reported literature, our study is the first to assess the utility of appropriateness criteria for low back surgery at 1-year follow-up with multiple outcome dimensions. Our results confirm the hypothesis that application of appropriateness criteria can significantly improve patient outcomes.
Resumo:
BACKGROUND: To date, there is no quality assurance program that correlates patient outcome to perfusion service provided during cardiopulmonary bypass (CPB). A score was devised, incorporating objective parameters that would reflect the likelihood to influence patient outcome. The purpose was to create a new method for evaluating the quality of care the perfusionist provides during CPB procedures and to deduce whether it predicts patient morbidity and mortality. METHODS: We analysed 295 consecutive elective patients. We chose 10 parameters: fluid balance, blood transfused, Hct, ACT, PaO2, PaCO2, pH, BE, potassium and CPB time. Distribution analysis was performed using the Shapiro-Wilcoxon test. This made up the PerfSCORE and we tried to find a correlation to mortality rate, patient stay in the ICU and length of mechanical ventilation. Univariate analysis (UA) using linear regression was established for each parameter. Statistical significance was established when p < 0.05. Multivariate analysis (MA) was performed with the same parameters. RESULTS: The mean age was 63.8 +/- 12.6 years with 70% males. There were 180 CABG, 88 valves, and 27 combined CABG/valve procedures. The PerfSCORE of 6.6 +/- 2.4 (0-20), mortality of 2.7% (8/295), CPB time 100 +/- 41 min (19-313), ICU stay 52 +/- 62 hrs (7-564) and mechanical ventilation of 10.5 +/- 14.8 hrs (0-564) was calculated. CPB time, fluid balance, PaO2, PerfSCORE and blood transfused were significantly correlated to mortality (UA, p < 0.05). Also, CPB time, blood transfused and PaO2 were parameters predicting mortality (MA, p < 0.01). Only pH was significantly correlated for predicting ICU stay (UA). Ultrafiltration (UF) and CPB time were significantly correlated (UA, p < 0.01) while UF (p < 0.05) was the only parameter predicting mechanical ventilation duration (MA). CONCLUSIONS: CPB time, blood transfused and PaO2 are independent risk factors of mortality. Fluid balance, blood transfusion, PaO2, PerfSCORE and CPB time are independent parameters for predicting morbidity. PerfSCORE is a quality of perfusion measure that objectively quantifies perfusion performance.
Resumo:
1. Model-based approaches have been used increasingly in conservation biology over recent years. Species presence data used for predictive species distribution modelling are abundant in natural history collections, whereas reliable absence data are sparse, most notably for vagrant species such as butterflies and snakes. As predictive methods such as generalized linear models (GLM) require absence data, various strategies have been proposed to select pseudo-absence data. However, only a few studies exist that compare different approaches to generating these pseudo-absence data. 2. Natural history collection data are usually available for long periods of time (decades or even centuries), thus allowing historical considerations. However, this historical dimension has rarely been assessed in studies of species distribution, although there is great potential for understanding current patterns, i.e. the past is the key to the present. 3. We used GLM to model the distributions of three 'target' butterfly species, Melitaea didyma, Coenonympha tullia and Maculinea teleius, in Switzerland. We developed and compared four strategies for defining pools of pseudo-absence data and applied them to natural history collection data from the last 10, 30 and 100 years. Pools included: (i) sites without target species records; (ii) sites where butterfly species other than the target species were present; (iii) sites without butterfly species but with habitat characteristics similar to those required by the target species; and (iv) a combination of the second and third strategies. Models were evaluated and compared by the total deviance explained, the maximized Kappa and the area under the curve (AUC). 4. Among the four strategies, model performance was best for strategy 3. Contrary to expectations, strategy 2 resulted in even lower model performance compared with models with pseudo-absence data simulated totally at random (strategy 1). 5. Independent of the strategy model, performance was enhanced when sites with historical species presence data were not considered as pseudo-absence data. Therefore, the combination of strategy 3 with species records from the last 100 years achieved the highest model performance. 6. Synthesis and applications. The protection of suitable habitat for species survival or reintroduction in rapidly changing landscapes is a high priority among conservationists. Model-based approaches offer planning authorities the possibility of delimiting priority areas for species detection or habitat protection. The performance of these models can be enhanced by fitting them with pseudo-absence data relying on large archives of natural history collection species presence data rather than using randomly sampled pseudo-absence data.
Resumo:
PURPOSE: Bioaerosols and their constituents, such as endotoxins, are capable of causing an inflammatory reaction at the level of the lung-blood barrier, which becomes more permeable. Thus, it was hypothesized that occupational exposure to bioaerosols can increase leakage of surfactant protein-D (SP-D), a lung-specific protein, into the bloodstream. METHODS: SP-D was determined by ELISA in 316 wastewater workers, 67 garbage collectors, and 395 control subjects. Exposure was assessed with four interview-based indicators and by preliminary endotoxin measurements using the Limulus amoebocyte lysate assay. Influence of exposure on serum SP-D was assessed by multiple linear regression considering smoking, glomerular function, lung diseases, obesity, and other confounders. RESULTS: Overall, mean exposure levels to endotoxins were below 100 EU/m(3). However, special tasks of wastewater workers caused higher endotoxin exposure. SP-D concentration was slightly increased in this occupational group and associated with the occurrence of splashes and contact to raw sewage. No effect was found in garbage collectors. Smoking increased serum SP-D. No clinically relevant correlation between spirometry results and SP-D concentrations appeared. CONCLUSIONS: These results support the hypothesis that inhalation of bioaerosols, even at low concentrations, has a subclinical effect on the lung-blood barrier, the permeability of which increases without associated spirometric changes.
Resumo:
OBJECTIVES: Age- and height-adjusted spirometric lung function of South Asian children is lower than those of white children. It is unclear whether this is purely genetic, or partly explained by the environment. In this study, we assessed whether cultural factors, socioeconomic status, intrauterine growth, environmental exposures, or a family and personal history of wheeze contribute to explaining the ethnic differences in spirometric lung function. METHODS: We studied children aged 9 to 14 years from a population-based cohort, including 1088 white children and 275 UK-born South Asians. Log-transformed spirometric data were analyzed using multiple linear regressions, adjusting for anthropometric factors. Five different additional models adjusted for (1) cultural factors, (2) indicators of socioeconomic status, (3) perinatal data reflecting intrauterine growth, (4) environmental exposures, and (5) personal and family history of wheeze. RESULTS: Height- and gender-adjusted forced vital capacity (FVC) and forced expired volume in 1 second (FEV1) were lower in South Asian than white children (relative difference -11% and -9% respectively, P < .001), but PEF and FEF50 were similar (P ≥ .5). FEV1/FVC was higher in South Asians (1.8%, P < .001). These differences remained largely unchanged in all 5 alternative models. CONCLUSIONS: Our study confirmed important differences in lung volumes between South Asian and white children. These were not attenuated after adjustment for cultural and socioeconomic factors and intrauterine growth, neither were they explained by differences in environmental exposures nor a personal or family history of wheeze. This suggests that differences in lung function may be mainly genetic in origin. The implication is that ethnicity-specific predicted values remain important specifically for South Asian children.
Resumo:
Western European landscapes have drastically changed since the 1950s, with agricultural intensifications and the spread of urban settlements considered the most important drivers of this land-use/land-cover change. Losses of habitat for fauna and flora have been a direct consequence of this development. In the present study, we relate butterfly occurrence to land-use/land-cover changes over five decades between 1951 and 2000. The study area covers the entire Swiss territory. The 10 explanatory variables originate from agricultural statistics and censuses. Both state as well as rate was used as explanatory variables. Species distribution data were obtained from natural history collections. We selected eight butterfly species: four species occur on wetlands and four occur on dry grasslands. We used cluster analysis to track land-use/land-cover changes and to group communes based on similar trajectories of change. Generalized linear models were applied to identify factors that were significantly correlated with the persistence or disappearance of butterfly species. Results showed that decreasing agricultural areas and densities of farms with more than 10 ha of cultivated land are significantly related with wetland species decline, and increasing densities of livestock seem to have favored disappearance of dry grassland species. Moreover, we show that species declines are not only dependent on land-use/land-cover states but also on the rates of change; that is, the higher the transformation rate from small to large farms, the higher the loss of dry grassland species. We suggest that more attention should be paid to the rates of landscape change as feasible drivers of species change and derive some management suggestions.
Multimodel inference and multimodel averaging in empirical modeling of occupational exposure levels.
Resumo:
Empirical modeling of exposure levels has been popular for identifying exposure determinants in occupational hygiene. Traditional data-driven methods used to choose a model on which to base inferences have typically not accounted for the uncertainty linked to the process of selecting the final model. Several new approaches propose making statistical inferences from a set of plausible models rather than from a single model regarded as 'best'. This paper introduces the multimodel averaging approach described in the monograph by Burnham and Anderson. In their approach, a set of plausible models are defined a priori by taking into account the sample size and previous knowledge of variables influent on exposure levels. The Akaike information criterion is then calculated to evaluate the relative support of the data for each model, expressed as Akaike weight, to be interpreted as the probability of the model being the best approximating model given the model set. The model weights can then be used to rank models, quantify the evidence favoring one over another, perform multimodel prediction, estimate the relative influence of the potential predictors and estimate multimodel-averaged effects of determinants. The whole approach is illustrated with the analysis of a data set of 1500 volatile organic compound exposure levels collected by the Institute for work and health (Lausanne, Switzerland) over 20 years, each concentration having been divided by the relevant Swiss occupational exposure limit and log-transformed before analysis. Multimodel inference represents a promising procedure for modeling exposure levels that incorporates the notion that several models can be supported by the data and permits to evaluate to a certain extent model selection uncertainty, which is seldom mentioned in current practice.
Resumo:
OBJECTIVE: Renal cytochrome P450 3A5 (CYP3A5) activity has been associated with blood pressure and salt sensitivity in humans. We determined whether CYP3A5 polymorphisms are associated with ambulatory blood pressure (ABP) and with glomerular filtration rate (GFR) in African families. METHODS: Using a cross-sectional design, 375 individuals from 72 families, each with at least two hypertensive siblings, were recruited through a hypertension register in the Seychelles (Indian Ocean). We analyzed the association between the CYP3A5 alleles (*1, *3, *6 and *7) and ABP, GFR and renal sodium handling (fractional excretion of lithium), from pedigree data, allowing for other covariates and familial correlations. RESULTS: CYP3A5*1 carriers increased their daytime systolic and diastolic ABP with age (0.55 and 0.23 mmHg/year) more than non-carriers (0.21 and 0.04 mmHg/year). CYP3A5*1 had a significant main effect on daytime systolic/diastolic ABP [regression coefficient (SE): -29.6 (10.0)/-8.2 (4.1) mmHg, P = 0.003/0.045, respectively] and this effect was modified by age (CYP3A5*1 x age interactions, P = 0.017/0.018). For night-time ABP, the effect of CYP3A5*1 was modified by urinary sodium excretion, not by age. For renal function, CYP3A5*1 carriers had a 7.6(3.8) ml/min lower GFR (P = 0.045) than non-carriers. Proximal sodium reabsorption decreased with age in non-carriers, but not in CYP3A5*1 carriers (P for interaction = 0.02). CONCLUSIONS: These data demonstrate that CYP3A5 polymorphisms are associated with ambulatory BP, CYP3A5*1 carriers showing a higher age- and sodium- related increase in ABP than non-carriers. The age effect may be due, in part, to the action of CYP3A5 on renal sodium handling.
Resumo:
Models predicting species spatial distribution are increasingly applied to wildlife management issues, emphasising the need for reliable methods to evaluate the accuracy of their predictions. As many available datasets (e.g. museums, herbariums, atlas) do not provide reliable information about species absences, several presence-only based analyses have been developed. However, methods to evaluate the accuracy of their predictions are few and have never been validated. The aim of this paper is to compare existing and new presenceonly evaluators to usual presence/absence measures. We use a reliable, diverse, presence/absence dataset of 114 plant species to test how common presence/absence indices (Kappa, MaxKappa, AUC, adjusted D-2) compare to presenceonly measures (AVI, CVI, Boyce index) for evaluating generalised linear models (GLM). Moreover we propose a new, threshold-independent evaluator, which we call "continuous Boyce index". All indices were implemented in the B10MAPPER software. We show that the presence-only evaluators are fairly correlated (p > 0.7) to the presence/absence ones. The Boyce indices are closer to AUC than to MaxKappa and are fairly insensitive to species prevalence. In addition, the Boyce indices provide predicted-toexpected ratio curves that offer further insights into the model quality: robustness, habitat suitability resolution and deviation from randomness. This information helps reclassifying predicted maps into meaningful habitat suitability classes. The continuous Boyce index is thus both a complement to usual evaluation of presence/absence models and a reliable measure of presence-only based predictions.
Resumo:
PURPOSE: Not in Education, Employment, or Training (NEET) youth are youth disengaged from major social institutions and constitute a worrying concern. However, little is known about this subgroup of vulnerable youth. This study aimed to examine if NEET youth differ from other contemporaries in terms of personality, mental health, and substance use and to provide longitudinal examination of NEET status, testing its stability and prospective pathways with mental health and substance use. METHODS: As part of the Cohort Study on Substance Use Risk Factors, 4,758 young Swiss men in their early 20s answered questions concerning their current professional and educational status, personality, substance use, and symptomatology related to mental health. Descriptive statistics, generalized linear models for cross-sectional comparisons, and cross-lagged panel models for longitudinal associations were computed. RESULTS: NEET youth were 6.1% at baseline and 7.4% at follow-up with 1.4% being NEET at both time points. Comparisons between NEET and non-NEET youth showed significant differences in substance use and depressive symptoms only. Longitudinal associations showed that previous mental health, cannabis use, and daily smoking increased the likelihood of being NEET. Reverse causal paths were nonsignificant. CONCLUSIONS: NEET status seemed to be unlikely and transient among young Swiss men, associated with differences in mental health and substance use but not in personality. Causal paths presented NEET status as a consequence of mental health and substance use rather than a cause. Additionally, this study confirmed that cannabis use and daily smoking are public health problems. Prevention programs need to focus on these vulnerable youth to avoid them being disengaged.
Resumo:
OBJECTIVE: To investigate the association between fear of falling and gait performance in well-functioning older persons. DESIGN: Survey. SETTING: Community. PARTICIPANTS: Subjects (N=860, aged 65-70y) were a subsample of participants enrolled in a cohort study who underwent gait measurements. INTERVENTIONS: Not applicable. MAIN OUTCOME MEASURES: Fear of falling and its severity were assessed by 2 questions about fear and related activity restriction. Gait performance, including gait variability, was measured using body-fixed sensors. RESULTS: Overall, 29.6% (210/860) of the participants reported fear of falling, with 5.2% (45/860) reporting activity restriction. Fear of falling was associated with reduced gait performance, including increased gait variability. A gradient in gait performance was observed from participants without fear to those reporting fear without activity restriction and those reporting both fear and activity restriction. For instance, stride velocity decreased from 1.15+/-.15 to 1.11+/-.17 to 1.00+/-.19 m/s (P<.001) in participants without fear, with fear but no activity restriction and with fear and activity restriction, respectively. In multivariate analysis, fear of falling with activity restriction remained associated with reduced gait performance, independent of sex, comorbidity, functional status, falls history, and depressive symptoms. CONCLUSIONS: In these well-functioning older people, those reporting fear of falling with activity restriction had reduced gait performance and increased gait variability, independent of health and functional status. These relationships suggest that early interventions targeting fear of falling might potentially help to prevent its adverse consequences on mobility and function in similar populations.
Resumo:
We assessed decision-making capacity and emotional reactivity in 20 patients with multiple sclerosis (MS) and in 16 healthy subjects using the Gambling Task (GT), a model of real-life decision making, and the skin conductance response (SCR). Demographic, neurological, affective, and cognitive parameters were analyzed in MS patients for their effect on decision-making performance. MS patients persisted longer (slope, -3.6%) than the comparison group (slope, -6.4%) in making disadvantageous choices as the GT progressed (p < 0.001), suggesting significant slower learning in MS. Patients with higher Expanded Disability Status Scale scores (EDSS >2.0) showed a different pattern of impairment in the learning process compared with patients with lower functional impairment (EDSS </=2.0). This slower learning was associated with impaired emotional reactivity (anticipatory SCR 3.9 vs 6.1 microSiemens [microS] for patients vs the comparison group, p < 0.0001; post-choice SCR 3.9 vs 6.2 microS, p < 0.0001), but not with executive dysfunction. Impaired emotional dimensions of behavior (assessed using the Dysexecutive Questionnaire, p < 0.002) also correlated with slower learning. Given the considerable consequences that impaired decision making can have on daily life, we suggest that this factor may contribute to handicap and altered quality of life secondary to MS and is dependent on emotional experience. Ann Neurol 2004.