Biblioteca Digital

920 resultados para predictive regression model

Hybrid Water Demand Forecasting Model Associating Artificial Neural Network with Fourier Series

Relevância:

90.00% 90.00%

Publicador:

Resumo:

This paper addressed the problem of water-demand forecasting for real-time operation of water supply systems. The present study was conducted to identify the best fit model using hourly consumption data from the water supply system of Araraquara, Sa approximate to o Paulo, Brazil. Artificial neural networks (ANNs) were used in view of their enhanced capability to match or even improve on the regression model forecasts. The ANNs used were the multilayer perceptron with the back-propagation algorithm (MLP-BP), the dynamic neural network (DAN2), and two hybrid ANNs. The hybrid models used the error produced by the Fourier series forecasting as input to the MLP-BP and DAN2, called ANN-H and DAN2-H, respectively. The tested inputs for the neural network were selected literature and correlation analysis. The results from the hybrid models were promising, DAN2 performing better than the tested MLP-BP models. DAN2-H, identified as the best model, produced a mean absolute error (MAE) of 3.3 L/s and 2.8 L/s for training and test set, respectively, for the prediction of the next hour, which represented about 12% of the average consumption. The best forecasting model for the next 24 hours was again DAN2-H, which outperformed other compared models, and produced a MAE of 3.1 L/s and 3.0 L/s for training and test set respectively, which represented about 12% of average consumption. DOI: 10.1061/(ASCE)WR.1943-5452.0000177. (C) 2012 American Society of Civil Engineers.

Why Are Women With Cervical Cancer Not Being Diagnosed in Preinvasive Phase? An Analysis of Risk Factors Using a Hierarchical Model

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Objective: To assess the risk factors for delayed diagnosis of uterine cervical lesions. Materials and Methods: This is a case-control study that recruited 178 women at 2 Brazilian hospitals. The cases (n = 74) were composed of women with a late diagnosis of a lesion in the uterine cervix (invasive carcinoma in any stage). The controls (n = 104) were composed of women with cervical lesions diagnosed early on (low-or high-grade intraepithelial lesions). The analysis was performed by means of logistic regression model using a hierarchical model. The socioeconomic and demographic variables were included at level I (distal). Level II (intermediate) included the personal and family antecedents and knowledge about the Papanicolaou test and human papillomavirus. Level III (proximal) encompassed the variables relating to individuals' care for their own health, gynecologic symptoms, and variables relating to access to the health care system. Results: The risk factors for late diagnosis of uterine cervical lesions were age older than 40 years (odds ratio [OR] = 10.4; 95% confidence interval [CI], 2.3-48.4), not knowing the difference between the Papanicolaou test and gynecological pelvic examinations (OR, = 2.5; 95% CI, 1.3-4.9), not thinking that the Papanicolaou test was important (odds ratio [OR], 4.2; 95% CI, 1.3-13.4), and abnormal vaginal bleeding (OR, 15.0; 95% CI, 6.5-35.0). Previous treatment for sexually transmissible disease was a protective factor (OR, 0.3; 95% CI, 0.1-0.8) for delayed diagnosis. Conclusions: Deficiencies in cervical cancer prevention programs in developing countries are not simply a matter of better provision and coverage of Papanicolaou tests. The misconception about the Papanicolaou test is a serious educational problem, as demonstrated by the present study.

On the impact of disproportional samples in credit scoring models: An application to a Brazilian bank data

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Statistical methods have been widely employed to assess the capabilities of credit scoring classification models in order to reduce the risk of wrong decisions when granting credit facilities to clients. The predictive quality of a classification model can be evaluated based on measures such as sensitivity, specificity, predictive values, accuracy, correlation coefficients and information theoretical measures, such as relative entropy and mutual information. In this paper we analyze the performance of a naive logistic regression model (Hosmer & Lemeshow, 1989) and a logistic regression with state-dependent sample selection model (Cramer, 2004) applied to simulated data. Also, as a case study, the methodology is illustrated on a data set extracted from a Brazilian bank portfolio. Our simulation results so far revealed that there is no statistically significant difference in terms of predictive capacity between the naive logistic regression models and the logistic regression with state-dependent sample selection models. However, there is strong difference between the distributions of the estimated default probabilities from these two statistical modeling techniques, with the naive logistic regression models always underestimating such probabilities, particularly in the presence of balanced samples. (C) 2012 Elsevier Ltd. All rights reserved.

Morphometric analysis of fetal development of Cavia porcellus (Linnaeus, 1758) by ultrasonography: pilot study

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Measurements on the growth process and placental development of the embryo and fetuses of Cavia porcellus were carried out using ultrasonography. Embryo, fetus, and placenta were monitored from Day 15 after mating day to the end of gestation. Based on linear and quadratic regressions, the following morphometric analysis showed a good indicator of the gestational age: placental diameter, biparietal diameter, renal length, and crown rump. The embryonic cardiac beat was first detected at an average of 22.5 days. The placental diameter showed constant increase from beginning of gestation then remained to term and presented a quadratic correlation with gestational age (r2 = 0.89). Mean placental diameter at the end of pregnancy was 3.5 ± 0.23 cm. By Day 30, it was possible to measure biparietal diameter, which followed a linear pattern of increase up to the end of gestation (r2 = 0.95). Mean biparietal diameter in the end of pregnancy was 1.94 ± 0.03 cm. Kidneys were firstly observed on Day 35 as hyperechoic structures without the distinction of medullar and cortical layers, thus the regression model equation between kidney length and gestational age presents a quadratic relationship (r2 = 0.7). The crown rump presented a simple linear growth, starting from 15 days of gestation, displaying a high correlation with the gestational age (r2 = 0.9). The offspring were born after an average gestation of 61.3 days. In this study, we conclude that biparietal diameter, placental diameter, and crown rump are adequate predictive parameters of gestational age in guinea pigs because they present high correlation index.

A Multilevel Model with Time Series Components for the Analysis of Tribal Art Prices

Relevância:

90.00% 90.00%

Publicador:

Resumo:

In the present work we perform an econometric analysis of the Tribal art market. To this aim, we use a unique and original database that includes information on Tribal art market auctions worldwide from 1998 to 2011. In Literature, art prices are modelled through the hedonic regression model, a classic fixed-effect model. The main drawback of the hedonic approach is the large number of parameters, since, in general, art data include many categorical variables. In this work, we propose a multilevel model for the analysis of Tribal art prices that takes into account the influence of time on artwork prices. In fact, it is natural to assume that time exerts an influence over the price dynamics in various ways. Nevertheless, since the set of objects change at every auction date, we do not have repeated measurements of the same items over time. Hence, the dataset does not constitute a proper panel; rather, it has a two-level structure in that items, level-1 units, are grouped in time points, level-2 units. The main theoretical contribution is the extension of classical multilevel models to cope with the case described above. In particular, we introduce a model with time dependent random effects at the second level. We propose a novel specification of the model, derive the maximum likelihood estimators and implement them through the E-M algorithm. We test the finite sample properties of the estimators and the validity of the own-written R-code by means of a simulation study. Finally, we show that the new model improves considerably the fit of the Tribal art data with respect to both the hedonic regression model and the classic multilevel model.

Predicting adverse events in children with fever and chemotherapy-induced neutropenia: the prospective multicenter SPOG 2003 FN study

Relevância:

90.00% 90.00%

Publicador:

Resumo:

PURPOSE To develop a score predicting the risk of adverse events (AEs) in pediatric patients with cancer who experience fever and neutropenia (FN) and to evaluate its performance. PATIENTS AND METHODS Pediatric patients with cancer presenting with FN induced by nonmyeloablative chemotherapy were observed in a prospective multicenter study. A score predicting the risk of future AEs (ie, serious medical complication, microbiologically defined infection, radiologically confirmed pneumonia) was developed from a multivariate mixed logistic regression model. Its cross-validated predictive performance was compared with that of published risk prediction rules. Results An AE was reported in 122 (29%) of 423 FN episodes. In 57 episodes (13%), the first AE was known only after reassessment after 8 to 24 hours of inpatient management. Predicting AE at reassessment was better than prediction at presentation with FN. A differential leukocyte count did not increase the predictive performance. The score predicting future AE in 358 episodes without known AE at reassessment used the following four variables: preceding chemotherapy more intensive than acute lymphoblastic leukemia maintenance (weight = 4), hemoglobin > or = 90 g/L (weight = 5), leukocyte count less than 0.3 G/L (weight = 3), and platelet count less than 50 G/L (weight = 3). A score (sum of weights) > or = 9 predicted future AEs. The cross-validated performance of this score exceeded the performance of published risk prediction rules. At an overall sensitivity of 92%, 35% of the episodes were classified as low risk, with a specificity of 45% and a negative predictive value of 93%. CONCLUSION This score, based on four routinely accessible characteristics, accurately identifies pediatric patients with cancer with FN at risk for AEs after reassessment.

Comparison of two scoring systems for evaluation of treatment outcome in patients with complete bilateral cleft lip and palate

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Objective : To compare two scoring systems: the Huddart/Bodenham system (HB system) and the Bauru-BCLP yardstick (BCLP yardstick), which classify treatment outcome in terms of dental arch relationships in patients with complete bilateral cleft lip and palate (CBCLP). The predictive value of these scoring systems for treatment outcome was also evaluated. Design : Retrospective longitudinal study. Patients : Dental arch relationships of 43 CBCLP patients were evaluated at 6, 9, and 12 years. Setting : Treatment outcome in BCLP patients using two scoring systems. Main Outcome Measures : For each age group, the HB scores were correlated with the BCLP yardstick scores using Spearman's correlation coefficient. The predictive value of the two scoring systems was evaluated by backward regression analysis. Results : Intraobserver Kappa values for the BCLP yardstick scoring for the two observers were .506 and .627, respectively, and the interobserver reliability ranged from .427 and .581. The intraobserver reliability for the HB system ranged from .92 to .97 and the interobserver reliability from .88 to .96. The BCLP yardstick scores of 6 and 9 years together were predictors for the outcome at 12 years (explained variance 41.3%). Adding the incisor and lateral HB scores in the regression model increased the explained variance to 67%. Conclusions : The BCLP yardstick and the HB system are reliable scoring systems for evaluation of dental arch relationships of CBCLP patients. The HB system categorizes treatment outcome into similar categories as the BCLP yardstick. In case a more sensitive measure of treatment outcome is needed, selectively both scoring systems should be used.

Prediction of Psychosis by Mismatch Negativity

Relevância:

90.00% 90.00%

Publicador:

Resumo:

BACKGROUND: To develop risk-adapted prevention of psychosis, an accurate estimation of the individual risk of psychosis at a given time is needed. Inclusion of biological parameters into multilevel prediction models is thought to improve predictive accuracy of models on the basis of clinical variables. To this aim, mismatch negativity (MMN) was investigated in a sample clinically at high risk, comparing individuals with and without subsequent conversion to psychosis. METHODS: At baseline, an auditory oddball paradigm was used in 62 subjects meeting criteria of a late risk at-state who remained antipsychotic-naive throughout the study. Median follow-up period was 32 months (minimum of 24 months in nonconverters, n = 37). Repeated-measures analysis of covariance was employed to analyze the MMN recorded at frontocentral electrodes; additional comparisons with healthy controls (HC, n = 67) and first-episode schizophrenia patients (FES, n = 33) were performed. Predictive value was evaluated by a Cox regression model. RESULTS: Compared with nonconverters, duration MMN in converters (n = 25) showed significantly reduced amplitudes across the six frontocentral electrodes; the same applied in comparison with HC, but not FES, whereas the duration MMN in in nonconverters was comparable to HC and larger than in FES. A prognostic score was calculated based on a Cox regression model and stratified into two risk classes, which showed significantly different survival curves. CONCLUSIONS: Our findings demonstrate the duration MMN is significantly reduced in at-risk subjects converting to first-episode psychosis compared with nonconverters and may contribute not only to the prediction of conversion but also to a more individualized risk estimation and thus risk-adapted prevention.

A prediction model for assessing residential radon concentration in Switzerland

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Indoor radon is regularly measured in Switzerland. However, a nationwide model to predict residential radon levels has not been developed. The aim of this study was to develop a prediction model to assess indoor radon concentrations in Switzerland. The model was based on 44,631 measurements from the nationwide Swiss radon database collected between 1994 and 2004. Of these, 80% randomly selected measurements were used for model development and the remaining 20% for an independent model validation. A multivariable log-linear regression model was fitted and relevant predictors selected according to evidence from the literature, the adjusted R², the Akaike's information criterion (AIC), and the Bayesian information criterion (BIC). The prediction model was evaluated by calculating Spearman rank correlation between measured and predicted values. Additionally, the predicted values were categorised into three categories (50th, 50th-90th and 90th percentile) and compared with measured categories using a weighted Kappa statistic. The most relevant predictors for indoor radon levels were tectonic units and year of construction of the building, followed by soil texture, degree of urbanisation, floor of the building where the measurement was taken and housing type (P-values <0.001 for all). Mean predicted radon values (geometric mean) were 66 Bq/m³ (interquartile range 40-111 Bq/m³) in the lowest exposure category, 126 Bq/m³ (69-215 Bq/m³) in the medium category, and 219 Bq/m³ (108-427 Bq/m³) in the highest category. Spearman correlation between predictions and measurements was 0.45 (95%-CI: 0.44; 0.46) for the development dataset and 0.44 (95%-CI: 0.42; 0.46) for the validation dataset. Kappa coefficients were 0.31 for the development and 0.30 for the validation dataset, respectively. The model explained 20% overall variability (adjusted R²). In conclusion, this residential radon prediction model, based on a large number of measurements, was demonstrated to be robust through validation with an independent dataset. The model is appropriate for predicting radon level exposure of the Swiss population in epidemiological research. Nevertheless, some exposure misclassification and regression to the mean is unavoidable and should be taken into account in future applications of the model.

Model Evaluation Based on the Distribution of Estimated Absolute Prediction Error

Relevância:

90.00% 90.00%

Publicador:

Resumo:

The construction of a reliable, practically useful prediction rule for future response is heavily dependent on the "adequacy" of the fitted regression model. In this article, we consider the absolute prediction error, the expected value of the absolute difference between the future and predicted responses, as the model evaluation criterion. This prediction error is easier to interpret than the average squared error and is equivalent to the mis-classification error for the binary outcome. We show that the distributions of the apparent error and its cross-validation counterparts are approximately normal even under a misspecified fitted model. When the prediction rule is "unsmooth", the variance of the above normal distribution can be estimated well via a perturbation-resampling method. We also show how to approximate the distribution of the difference of the estimated prediction errors from two competing models. With two real examples, we demonstrate that the resulting interval estimates for prediction errors provide much more information about model adequacy than the point estimates alone.

A Diagnostic Test for the Mixing Distribution in a Generalised Linear Mixed Model

Relevância:

90.00% 90.00%

Publicador:

Resumo:

We introduce a diagnostic test for the mixing distribution in a generalised linear mixed model. The test is based on the difference between the marginal maximum likelihood and conditional maximum likelihood estimates of a subset of the fixed effects in the model. We derive the asymptotic variance of this difference, and propose a test statistic that has a limiting chi-square distribution under the null hypothesis that the mixing distribution is correctly specified. For the important special case of the logistic regression model with random intercepts, we evaluate via simulation the power of the test in finite samples under several alternative distributional forms for the mixing distribution. We illustrate the method by applying it to data from a clinical trial investigating the effects of hormonal contraceptives in women.

Checking Assumptions in Latent Class Regression Models via a Markov Chain Monte Carlo Estimation Approach: An Application to Depression and Socio-Economic Status

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Latent class regression models are useful tools for assessing associations between covariates and latent variables. However, evaluation of key model assumptions cannot be performed using methods from standard regression models due to the unobserved nature of latent outcome variables. This paper presents graphical diagnostic tools to evaluate whether or not latent class regression models adhere to standard assumptions of the model: conditional independence and non-differential measurement. An integral part of these methods is the use of a Markov Chain Monte Carlo estimation procedure. Unlike standard maximum likelihood implementations for latent class regression model estimation, the MCMC approach allows us to calculate posterior distributions and point estimates of any functions of parameters. It is this convenience that allows us to provide the diagnostic methods that we introduce. As a motivating example we present an analysis focusing on the association between depression and socioeconomic status, using data from the Epidemiologic Catchment Area study. We consider a latent class regression analysis investigating the association between depression and socioeconomic status measures, where the latent variable depression is regressed on education and income indicators, in addition to age, gender, and marital status variables. While the fitted latent class regression model yields interesting results, the model parameters are found to be invalid due to the violation of model assumptions. The violation of these assumptions is clearly identified by the presented diagnostic plots. These methods can be applied to standard latent class and latent class regression models, and the general principle can be extended to evaluate model assumptions in other types of models.

CD4+ T-cell count increase in HIV-1-infected patients with suppressed viral load within 1 year after start of antiretroviral therapy

Relevância:

90.00% 90.00%

Publicador:

Resumo:

BACKGROUND: CD4+ T-cell recovery in patients with continuous suppression of plasma HIV-1 viral load (VL) is highly variable. This study aimed to identify predictive factors for long-term CD4+ T-cell increase in treatment-naive patients starting combination antiretroviral therapy (cART). METHODS: Treatment-naive patients in the Swiss HIV Cohort Study reaching two VL measurements <50 copies/ml >3 months apart during the 1st year of cART were included (n=1816 patients). We studied CD4+ T-cell dynamics until the end of suppression or up to 5 years, subdivided into three periods: 1st year, years 2-3 and years 4-5 of suppression. Multiple median regression adjusted for repeated CD4+ T-cell measurements was used to study the dependence of CD4+ T-cell slopes on clinical covariates and drug classes. RESULTS: Median CD4+ T-cell increases following VL suppression were 87, 52 and 19 cells/microl per year in the three periods. In the multiple regression model, median CD4+ T-cell increases over all three periods were significantly higher for female gender, lower age, higher VL at cART start, CD4+ T-cell <650 cells/microl at start of the period and low CD4+ T-cell increase in the previous period. Patients on tenofovir showed significantly lower CD4+ T-cell increases compared with stavudine. CONCLUSIONS: In our observational study, long-term CD4+ T-cell increase in drug-naive patients with suppressed VL was higher in regimens without tenofovir. The clinical relevance of these findings must be confirmed in, ideally, clinical trials or large, collaborative cohort projects but could influence treatment of older patients and those starting cART at low CD4+ T-cell levels.

Towards greater accuracy in individual-tree mortality regression

Relevância:

90.00% 90.00%

Publicador:

Resumo:

Background mortality is an essential component of any forest growth and yield model. Forecasts of mortality contribute largely to the variability and accuracy of model predictions at the tree, stand and forest level. In the present study, I implement and evaluate state-of-the-art techniques to increase the accuracy of individual tree mortality models, similar to those used in many of the current variants of the Forest Vegetation Simulator, using data from North Idaho and Montana. The first technique addresses methods to correct for bias induced by measurement error typically present in competition variables. The second implements survival regression and evaluates its performance against the traditional logistic regression approach. I selected the regression calibration (RC) algorithm as a good candidate for addressing the measurement error problem. Two logistic regression models for each species were fitted, one ignoring the measurement error, which is the “naïve” approach, and the other applying RC. The models fitted with RC outperformed the naïve models in terms of discrimination when the competition variable was found to be statistically significant. The effect of RC was more obvious where measurement error variance was large and for more shade-intolerant species. The process of model fitting and variable selection revealed that past emphasis on DBH as a predictor variable for mortality, while producing models with strong metrics of fit, may make models less generalizable. The evaluation of the error variance estimator developed by Stage and Wykoff (1998), and core to the implementation of RC, in different spatial patterns and diameter distributions, revealed that the Stage and Wykoff estimate notably overestimated the true variance in all simulated stands, but those that are clustered. Results show a systematic bias even when all the assumptions made by the authors are guaranteed. I argue that this is the result of the Poisson-based estimate ignoring the overlapping area of potential plots around a tree. Effects, especially in the application phase, of the variance estimate justify suggested future efforts of improving the accuracy of the variance estimate. The second technique implemented and evaluated is a survival regression model that accounts for the time dependent nature of variables, such as diameter and competition variables, and the interval-censored nature of data collected from remeasured plots. The performance of the model is compared with the traditional logistic regression model as a tool to predict individual tree mortality. Validation of both approaches shows that the survival regression approach discriminates better between dead and alive trees for all species. In conclusion, I showed that the proposed techniques do increase the accuracy of individual tree mortality models, and are a promising first step towards the next generation of background mortality models. I have also identified the next steps to undertake in order to advance mortality models further.

Heart rate response determines long term exercise capacity after heart transplantation

Relevância:

90.00% 90.00%

Publicador:

Resumo:

BACKGROUND: Exercise capacity after heart transplantation (HTx) remains limited despite normal left ventricular systolic function of the allograft. Various clinical and haemodynamic parameters are predictive of exercise capacity following HTx. However, the predictive significance of chronotropic competence has not been demonstrated unequivocally despite its immediate relevance for cardiac output. AIMS: This study assesses the predictive value of various clinical and haemodynamic parameters for exercise capacity in HTx recipients with complete chronotropic competence evolving within the first 6 postoperative months. METHODS: 51 patients were enrolled in this exercise study. Patients were included when at least >6 months after HTx and without negative chronotropic medication or factors limiting exercise capacity such as significant transplant vasculopathy or allograft rejection. Clinical parameters were obtained by chart review, haemodynamic parameters from current cardiac catheterisation, and exercise capacity was assessed by treadmill stress testing. A stepwise multiple regression model analysed the proportion of the variance explained by the predictive parameters. RESULTS: The mean age of these 51 HTx recipients was 55.4 +/- 13.2 yrs on inclusion, 42 pts were male and the mean time interval after cardiac transplantation was 5.1 +/- 2.8 yrs. Five independent predictors explained 47.5% of the variance observed for peak exercise capacity (adjusted R2 = 0.475). In detail, heart rate response explained 31.6%, male gender 5.2%, age 4.1%, pulmonary vascular resistance 3.7%, and body-mass index 2.9%. CONCLUSION: Heart rate response is one of the most important predictors of exercise capacity in HTx recipients with complete chronotropic competence and without relevant transplant vasculopathy or acute allograft rejection.

«
1
2
...
10
11
12
13
14
15
16
...
61
62
»