851 resultados para Area under the ROC curve
Resumo:
In active learning, a machine learning algorithmis given an unlabeled set of examples U, and is allowed to request labels for a relatively small subset of U to use for training. The goal is then to judiciously choose which examples in U to have labeled in order to optimize some performance criterion, e.g. classification accuracy. We study how active learning affects AUC. We examine two existing algorithms from the literature and present our own active learning algorithms designed to maximize the AUC of the hypothesis. One of our algorithms was consistently the top performer, and Closest Sampling from the literature often came in second behind it. When good posterior probability estimates were available, our heuristics were by far the best.
Resumo:
Background: Renal transplant recipients were noted to appear cushingoid while on low doses of steroid as part of a triple therapy immunosuppression of cyclosporin A (CsA), prednisolone, and azathioprine. Methods: The study group comprised adult renal transplant recipients with stable graft function who had received their renal allograft a minimum of 1 year previously (43 studies undertaken in 22 men and 20 women) with median daily prednisone dose of 7 mg (range 3-10). The control group was healthy nontransplant subjects [median dose 10 mg (10-30)]. Prednisolone bioavailability was measured using a limited 6-hour area under the curve (AUC), with prednisolone measured using specific HPLC assay. Results: The median prednisolone AUC/mg dose for all transplant recipients was significantly greater than the control group by approximately 50% (316 nmol(.)h/L/mg prednisolone versus 218). AUC was significantly higher in female recipients (median 415 versus 297 for men) and in recipients receiving cyclospotin (348 versus 285). The highest AUC was in women on estrogen supplements who were receiving cyclosporin (median 595). A significantly higher proportion of patients on triple therapy had steroid side effects compared with those on steroid and azathioprine (17/27 versus 4/15), more women than men had side effects (14/16 versus 7/22), and the AUC/mg prednisone was greater in those with side effects than without (median 377 versus 288 nmol-h/L/mg). Discussion: The results are consistent with the hypothesis that CsA increases the bioavailability of prednisolone, most likely through inhibition of beta-glycoprotein. The increased exposure to steroid increased the side-effect profile of steroids in the majority of patients. Because the major contributor to AUC is the maximum postdose concentration, it may be possible to use single-point monitoring (2 hours postdose) for routine clinical studies.
Resumo:
The high morbidity and mortality associated with atherosclerotic coronary vascular disease (CVD) and its complications are being lessened by the increased knowledge of risk factors, effective preventative measures and proven therapeutic interventions. However, significant CVD morbidity remains and sudden cardiac death continues to be a presenting feature for some subsequently diagnosed with CVD. Coronary vascular disease is also the leading cause of anaesthesia related complications. Stress electrocardiography/exercise testing is predictive of 10 year risk of CVD events and the cardiovascular variables used to score this test are monitored peri-operatively. Similar physiological time-series datasets are being subjected to data mining methods for the prediction of medical diagnoses and outcomes. This study aims to find predictors of CVD using anaesthesia time-series data and patient risk factor data. Several pre-processing and predictive data mining methods are applied to this data. Physiological time-series data related to anaesthetic procedures are subjected to pre-processing methods for removal of outliers, calculation of moving averages as well as data summarisation and data abstraction methods. Feature selection methods of both wrapper and filter types are applied to derived physiological time-series variable sets alone and to the same variables combined with risk factor variables. The ability of these methods to identify subsets of highly correlated but non-redundant variables is assessed. The major dataset is derived from the entire anaesthesia population and subsets of this population are considered to be at increased anaesthesia risk based on their need for more intensive monitoring (invasive haemodynamic monitoring and additional ECG leads). Because of the unbalanced class distribution in the data, majority class under-sampling and Kappa statistic together with misclassification rate and area under the ROC curve (AUC) are used for evaluation of models generated using different prediction algorithms. The performance based on models derived from feature reduced datasets reveal the filter method, Cfs subset evaluation, to be most consistently effective although Consistency derived subsets tended to slightly increased accuracy but markedly increased complexity. The use of misclassification rate (MR) for model performance evaluation is influenced by class distribution. This could be eliminated by consideration of the AUC or Kappa statistic as well by evaluation of subsets with under-sampled majority class. The noise and outlier removal pre-processing methods produced models with MR ranging from 10.69 to 12.62 with the lowest value being for data from which both outliers and noise were removed (MR 10.69). For the raw time-series dataset, MR is 12.34. Feature selection results in reduction in MR to 9.8 to 10.16 with time segmented summary data (dataset F) MR being 9.8 and raw time-series summary data (dataset A) being 9.92. However, for all time-series only based datasets, the complexity is high. For most pre-processing methods, Cfs could identify a subset of correlated and non-redundant variables from the time-series alone datasets but models derived from these subsets are of one leaf only. MR values are consistent with class distribution in the subset folds evaluated in the n-cross validation method. For models based on Cfs selected time-series derived and risk factor (RF) variables, the MR ranges from 8.83 to 10.36 with dataset RF_A (raw time-series data and RF) being 8.85 and dataset RF_F (time segmented time-series variables and RF) being 9.09. The models based on counts of outliers and counts of data points outside normal range (Dataset RF_E) and derived variables based on time series transformed using Symbolic Aggregate Approximation (SAX) with associated time-series pattern cluster membership (Dataset RF_ G) perform the least well with MR of 10.25 and 10.36 respectively. For coronary vascular disease prediction, nearest neighbour (NNge) and the support vector machine based method, SMO, have the highest MR of 10.1 and 10.28 while logistic regression (LR) and the decision tree (DT) method, J48, have MR of 8.85 and 9.0 respectively. DT rules are most comprehensible and clinically relevant. The predictive accuracy increase achieved by addition of risk factor variables to time-series variable based models is significant. The addition of time-series derived variables to models based on risk factor variables alone is associated with a trend to improved performance. Data mining of feature reduced, anaesthesia time-series variables together with risk factor variables can produce compact and moderately accurate models able to predict coronary vascular disease. Decision tree analysis of time-series data combined with risk factor variables yields rules which are more accurate than models based on time-series data alone. The limited additional value provided by electrocardiographic variables when compared to use of risk factors alone is similar to recent suggestions that exercise electrocardiography (exECG) under standardised conditions has limited additional diagnostic value over risk factor analysis and symptom pattern. The effect of the pre-processing used in this study had limited effect when time-series variables and risk factor variables are used as model input. In the absence of risk factor input, the use of time-series variables after outlier removal and time series variables based on physiological variable values’ being outside the accepted normal range is associated with some improvement in model performance.
Resumo:
8 p.
Resumo:
PURPOSE. Scanning laser tomography with the Heidelberg retina tomograph (HRT; Heidelberg Engineering, Heidelberg, Germany) has been proposed as a useful diagnostic test for glaucoma. This study was conducted to evaluate the quality of reporting of published studies using the HRT for diagnosing glaucoma. METHODS. A validated Medline and hand search of English-language articles reporting on measures of diagnostic accuracy of the HRT for glaucoma was performed. Two reviewers selected and appraised the papers independently. The Standards for Reporting of Diagnostic Accuracy (STARD) checklist was used to evaluate the quality of each publication. RESULTS. A total of 29 articles were included. Interobserver rating agreement was observed in 83% of items (? = 0.76). The number of STARD items properly reported ranged from 5 to 18. Less than a third of studies (7/29) explicitly reported more than half of the STARD items. Descriptions of key aspects of the methodology were frequently missing. For example, the design of the study (prospective or retrospective) was reported in 6 of 29 studies, and details of participant sampling (e.g., consecutive or random selection) were described in 5 of 29 publications. The commonest description of diagnostic accuracy was sensitivity and specificity (25/29) followed by area under the ROC curve (13/29), with 9 of 29 publications reporting both. CONCLUSIONS. The quality of reporting of diagnostic accuracy tests for glaucoma with HRT is suboptimal. The STARD initiative may be a useful tool for appraising the strengths and weaknesses of diagnostic accuracy studies. Copyright © Association for Research in Vision and Ophthalmology.
American Society of Anesthesiologists Score: Still Useful After 60 Years? Results of the EuSOS Study
Resumo:
OBJECTIVE: The European Surgical Outcomes Study described mortality following in-patient surgery. Several factors were identified that were able to predict poor outcomes in a multivariate analysis. These included age, procedure urgency, severity and type and the American Association of Anaesthesia score. This study describes in greater detail the relationship between the American Association of Anaesthesia score and postoperative mortality. METHODS: Patients in this 7-day cohort study were enrolled in April 2011. Consecutive patients aged 16 years and older undergoing inpatient non-cardiac surgery with a recorded American Association of Anaesthesia score in 498 hospitals across 28 European nations were included and followed up for a maximum of 60 days. The primary endpoint was in-hospital mortality. Decision tree analysis with the CHAID (SPSS) system was used to delineate nodes associated with mortality. RESULTS: The study enrolled 46,539 patients. Due to missing values, 873 patients were excluded, resulting in the analysis of 45,666 patients. Increasing American Association of Anaesthesia scores were associated with increased admission rates to intensive care and higher mortality rates. Despite a progressive relationship with mortality, discrimination was poor, with an area under the ROC curve of 0.658 (95% CI 0.642 - 0.6775). Using regression trees (CHAID), we identified four discrete American Association of Anaesthesia nodes associated with mortality, with American Association of Anaesthesia 1 and American Association of Anaesthesia 2 compressed into the same node. CONCLUSION: The American Association of Anaesthesia score can be used to determine higher risk groups of surgical patients, but clinicians cannot use the score to discriminate between grades 1 and 2. Overall, the discriminatory power of the model was less than acceptable for widespread use.
Resumo:
QUESTIONS UNDER STUDY: The diagnostic significance of clinical symptoms/signs of influenza has mainly been assessed in the context of controlled studies with stringent inclusion criteria. There was a need to extend the evaluation of these predictors not only in the context of general practice but also according to the duration of symptoms and to the dynamics of the epidemic. PRINCIPLES: A prospective study conducted in the Medical Outpatient Clinic in the winter season 1999-2000. Patients with influenza-like syndrome were included, as long as the primary care physician envisaged the diagnosis of influenza. The physician administered a questionnaire, a throat swab was performed and a culture acquired to document the diagnosis of influenza. RESULTS: 201 patients were included in the study. 52% were culture positive for influenza. By univariate analysis, temperature >37.8 degrees C (OR 4.2; 95% CI 2.3-7.7), duration of symptoms <48 hours (OR 3.2; 1.8-5.7), cough (OR 3.2; 1-10.4) and myalgia (OR 2.8; 1.0-7.5) were associated with a diagnosis of influenza. In a multivariable logistic analysis, the best model predicting influenza was the association of a duration of symptom <48 hours, medical attendance at the beginning of the epidemic (weeks 49-50), fever >37.8 and cough, with a sensitivity of 79%, specificity of 69%, positive predictive value of 67%, negative predictive value of 73% and an area under the ROC curve of 0.74. CONCLUSIONS: Besides relevant symptoms and signs, the physician should also consider the duration of symptoms and the epidemiological context (start, peak or end of the epidemic) in his appraisal, since both parameters considerably modify the value of the clinical predictors when assessing the probability of a patient having influenza.
Resumo:
Objectives:The aim of this in vitro study was to assess the inter- and intra-examiner reproducibility and the accuracy of the International Caries Detection and Assessment System-II (ICDAS-II) in detecting occlusal caries.Methods:One hundred and sixty-three molars were independently assessed twice by two experienced dentists using the 0- to 6-graded ICDAS-II. The teeth were histologically prepared and classified using two different histological systems [Ekstrand et al. (1997) Caries Research vol. 31, pp. 224-231; Lussi et al. (1999) Caries Research vol. 33, pp. 261-266] and assessed for caries extension. Sensitivity, specificity, accuracy and area under the ROC curve (A(z)) were obtained at D(2) and D(3) thresholds. Unweighted kappa coefficient was used to assess inter- and intra-examiner reproducibility.Results:For the Ekstrand et al. histological classification the sensitivity was 0.99 and 1.00, specificity 1.00 and 0.69 and accuracy 0.99 and 0.76 at D(2) and D(3), respectively. For the Lussi et al. histological classification the sensitivity was 0.91 and 0.75, specificity 0.47 and 0.62 and accuracy 0.86 and 0.68 at D(2) and D(3), respectively. The A(z) varied from 0.54 to 0.73. The inter- and intra-examiner kappa values were 0.51 and 0.58, respectively.Conclusions:ICDAS-II presented good reproducibility and accuracy in detecting occlusal caries, especially caries lesions in the outer half of the enamel.
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
Objective: to determine the ability of the reduced form of a screening instrument, the Patient Health Questionnaire-2 (PHQ-2), to assess the presence of depressive disorders in patients admitted to a general hospital. Method: A sample of 227 patients admitted to the clinical wards of a Brazilian general university hospital were assessed with Module A of the Diagnostic Structured Interview for the DSM-IV (SCID-IV) and filled out the PHQ-9 and PHQ-2. Results: The PHQ-2 demonstrated an area under the ROC curve of 0.89 (p < 0.0001), with a cutoff point of three or more being the one that best equilibrated the sensitivity (0.86) and specificity (0.75) values. The agreement index between the PHQ-2 and module A of SCID-W was 78.4% and the Kappa value was 0.51. Regarding reliability, the Cronbach alpha value obtained was 0.64 and the intraclass correlation coefficient was 0.52. Conclusion: PHQ-2 proved to be an instrument with good psychometric properties comparable to those of PHQ-9, being superior to the latter regarding the rate of false-positive results. In addition, it is a brief instrument that elicits little resistance on the part of the patient, being inexpensive and requiring little time, thus being of important help to the treatment teams for the detection of depressive disorder, being suitable for incorporation into hospital admission protocols and thus possibly favoring more immediate interventions. (Int'l J. Psychiatry in Medicine 2012;44:141-148)
Resumo:
Objective: To validate the 2000 Bernstein Parsonnet (2000BP) and additive EuroSCORE (ES) to predict mortality in patients who underwent coronary bypass surgery and/or heart valve surgery at the Heart Institute, University of Sao Paulo (InCor/HC-FMUSP). Methods:A prospective observational design. We analyzed 3000 consecutive patients who underwent coronary bypass surgery and/or heart valve surgery, between May 2007 and July 2009 at the InCor/HC-FMUSP. Mortality was calculated with the 2000BP and ES models. The correlation between estimated mortality and observed mortality was validated by calibration and discrimination tests. Results: There were significant differences in the prevalence of risk factors between the study population, 2000BP and ES. Patients were stratified into five groups for 2000BP and three for the ES. In the validation of models, the ES showed good calibration (P = 0396), however, the 2000BP (P = 0.047) proved inadequate. In discrimination, the area under the ROC curve proved to be good for models, ES (0.79) and 2000BP (0.80). Conclusion: In the validation, 2000BP proved questionable and ES appropriate to predict mortality in patients who underwent coronary bypass surgery and/or heart valve surgery at the InCor/HC-FMUSP.
Resumo:
Introduction: An epidemiological study was undertaken to identify determinant factors in the occurrence of American cutaneous leishmaniasis in areas under the influence of hydroelectric plants in Paranapanema river, State of Parana, Brazil. The ecological aspects of the phlebotomine fauna were investigated. Methods: Sandflies were sampled with automatic light traps from February 2004 to June 2006 at 25 sites in the urban and rural areas of Itambaraca, and in Porto Almeida and Sao Joaquim do Pontal. Results: A total of 3,187 sandflies of 15 species were captured. Nyssomyia neivai predominated (34.4%), followed by Pintomyia pessoai (32.6%), Migonemyia migonei (11.6%), Nyssomyia whitmani (8.8%), and Pintomyia fischeri (2.7%), all implicated in the transmission of Leishmania. Males predominated for Ny. neivai, and females for the other vector species, with significant statistical differences (p < 0.001). Nyssomyia neivai, Pi. pessoai, Ny. whitmani, Brumptomyia brumpti, Mg. migonei, and Pi. fischeri presented the highest values for the Standardized Species Abundance Index (SSAI). The highest frequencies and diversities were found in the preserved forest in Porto Almeida, followed by forests with degradation in Sao Joaquim do Pontal and Vila Rural. Conclusions: Sandflies were captured in all localities, with the five vectors predominating. Ny. neivai had its highest frequencies in nearby peridomestic environments and Pi. pessoai in areas of preserved forests. The highest SSAI values of Ny. neivai and Pi. pessoai reflect their wider dispersion and higher frequencies compared with other species, which seems to indicate that these two species may be transmitting leishmaniasis in the area.
Resumo:
Abstract Background To identify the most appropriate cut-off points of fasting glycemia for the screening of diabetes mellitus type 2 (DM2) with the comparison of the properties of capillary glycemia (CG) and venous blood plasma glycemia (PG) in a population of Japanese origin from the community of Mombuca, Guatapará - SP, Brazil. Methods This was a population-based descriptive cross-sectional study conducted on a sample of 131 individuals of both genders aged 20 years or more (66.8% of the target population). CG was measured with a glucometer in a blood sample obtained from the fingertip and PG was determined by an enzymatic method (hexokinase) in venous blood plasma, after a 10-14 hour fast in both cases. Data were analyzed by the receiver operating characteristic (ROC) curve in order to identify the best cut-off point for fasting glycemia (CG and PG) for the diagnosis of DM, using the 2-hour plasma glycemia > 200 mg/dl as gold - standard. Results The ROC curve revealed that the best cut-off point for the screening of DM was 110 mg/dl for CG and 105 mg/dl for PG, values that would optimize the relation between individuals with positive and false-positive results. The area under the ROC curve was 0.814 for CG (p < 0.01) and 0.836 for PG (p < 0.01). Conclusions The cut-off points of 105 mg/dl(5.8 mmol/l) for PG and of 110 mg/dl(6.1 mmol/l) for CG appear to be the most appropriate for the screening of DM2 in the population under study, with emphasis on the fact that the value recommended for CG is 5 mg/dl higher than that for PG, in contrast to WHO recommendations.
Resumo:
INTRODUCTION: An epidemiological study was undertaken to identify determinant factors in the occurrence of American cutaneous leishmaniasis in areas under the influence of hydroelectric plants in Paranapanema river, State of Paraná, Brazil. The ecological aspects of the phlebotomine fauna were investigated. METHODS: Sandflies were sampled with automatic light traps from February 2004 to June 2006 at 25 sites in the urban and rural areas of Itambaracá, and in Porto Almeida and São Joaquim do Pontal. RESULTS: A total of 3,187 sandflies of 15 species were captured. Nyssomyia neivai predominated (34.4%), followed by Pintomyia pessoai (32.6%), Migonemyia migonei (11.6%), Nyssomyia whitmani (8.8%), and Pintomyia fischeri (2.7%), all implicated in the transmission of Leishmania. Males predominated for Ny. neivai, and females for the other vector species, with significant statistical differences (p < 0.001). Nyssomyia neivai, Pi. pessoai, Ny. whitmani, Brumptomyia brumpti, Mg. migonei, and Pi. fischeri presented the highest values for the Standardized Species Abundance Index (SSAI). The highest frequencies and diversities were found in the preserved forest in Porto Almeida, followed by forests with degradation in São Joaquim do Pontal and Vila Rural. CONCLUSIONS: Sandflies were captured in all localities, with the five vectors predominating. Ny. neivai had its highest frequencies in nearby peridomestic environments and Pi. pessoai in areas of preserved forests. The highest SSAI values of Ny. neivai and Pi. pessoai reflect their wider dispersion and higher frequencies compared with other species, which seems to indicate that these two species may be transmitting leishmaniasis in the area.