819 resultados para rater reliability
Resumo:
Background: Current guidelines underline the limitations of existing instruments to assess fitness to drive and the poor adaptability of batteries of neuropsychological tests in primary care settings. Aims: To provide a free, reliable, transparent computer based instrument capable of detecting effects of age or drugs on visual processing and cognitive functions. Methods: Relying on systematic reviews of neuropsychological tests and driving performances, we conceived four new computed tasks measuring: visual processing (Task1), movement attention shift (Task2), executive response, alerting and orientation gain (Task3), and spatial memory (Task4). We then planned five studies to test MedDrive's reliability and validity. Study-1 defined instructions and learning functions collecting data from 105 senior drivers attending an automobile club course. Study-2 assessed concurrent validity for detecting minor cognitive impairment (MCI) against useful field of view (UFOV) on 120 new senior drivers. Study-3 collected data from 200 healthy drivers aged 20-90 to model age related normal cognitive decline. Study-4 measured MedDrive's reliability having 21 healthy volunteers repeat tests five times. Study-5 tested MedDrive's responsiveness to alcohol in a randomised, double-blinded, placebo, crossover, dose-response validation trial including 20 young healthy volunteers. Results: Instructions were well understood and accepted by all senior drivers. Measures of visual processing (Task1) showed better performances than the UFOV in detecting MCI (ROC 0.770 vs. 0.620; p=0.048). MedDrive was capable of explaining 43.4% of changes occurring with natural cognitive decline. In young healthy drivers, learning effects became negligible from the third session onwards for all tasks except for dual tasking (ICC=0.769). All measures except alerting and orientation gain were affected by blood alcohol concentrations. Finally, MedDrive was able to explain 29.3% of potential causes of swerving on the driving simulator. Discussion and conclusions: MedDrive reveals improved performances compared to existing computed neuropsychological tasks. It shows promising results both for clinical and research purposes.
Resumo:
Many patients with Chagas disease live in remote communities that lack both equipment and trained personnel to perform a diagnosis by conventional serology (CS). Thus, reliable tests suitable for use under difficult conditions are required. In this study, we evaluated the ability of personnel with and without laboratory skills to perform immunochromatographic (IC) tests to detect Chagas disease at a primary health care centre (PHCC). We examined whole blood samples from 241 patients and serum samples from 238 patients. Then, we calculated the percentage of overall agreement (POA) between the two groups of operators for the sensitivity (S), specificity (Sp) and positive (PPV) and negative (NPV) predictive values of IC tests compared to CS tests. We also evaluated the level of agreement between ELISAs and indirect haemagglutination (IHA) tests. The readings of the IC test results showed 100% agreement (POA = 1). The IC test on whole blood showed the following values: S = 87.3%; Sp = 98.8%; PPV = 96.9% and NPV = 95.9%. Additionally, the IC test on serum displayed the following results: S = 95.7%; Sp = 100%; PPV = 100% and NPV = 98.2%. Using whole blood, the agreement with ELISA was 96.3% and the agreement with IHA was 94.1%. Using serum, the agreement with ELISA was 97.8% and the agreement with IHA was 96.6%. The IC test performance with serum samples was excellent and demonstrated its usefulness in a PHCC with minimal equipment. If the IC test S value and NPV with whole blood are improved, then this test could also be used in areas lacking laboratories or specialised personnel.
Resumo:
BACKGROUND Some controversy remains about the potential applicability of cognitive potentials for evaluating the cerebral activity associated with cognitive capacity. A fundamental requirement is that these neurophysiological parameters show a high level of stability over time. Previous studies have shown that the reliability of diverse parameters of the P3 component (latency and amplitude) ranges between moderate and high. However, few studies have paid attention to the retest reliability of the P3 topography in groups or individuals. Considering that changes in P3 topography have been related to different pathologies and healthy aging, the main objective of this article was to evaluate in a longitudinal study (two sessions) the reliability of P3 topography in a group and at the individual level. RESULTS The correlation between sessions for P3 topography in the grand average of groups was high (r = 0.977, p<0.001). The within-subject correlation values ranged from 0.626 to 0.981 (mean: 0.888). In the between-subjects topography comparisons, the correlation was always lower for comparisons between different subjects than for within-subjects correlations in the first session but not in the second session. CONCLUSIONS The present study shows that P3 topography is highly reliable for group analysis (comprising the same subjects) in different sessions. The results also confirmed that retest reliability for individual P3 maps is suitable for follow-up studies for a particular subject. Moreover, P3 topography appears to be a specific marker considering that the between-subjects correlations were lower than the within-subject correlations. However, P3 topography appears more similar between subjects in the second session, demonstrating that is modulated by experience. Possible clinical applications of all these results are discussed.
Resumo:
The use of observer-rated scales requires that raters be trained until they have become reliable in using the scales. However, few studies properly report how training in using a given rating scale is conducted or indeed how it should be conducted. This study examined progress in interrater reliability over 6 months of training with two observer-rated scales, the Cognitive Errors Rating Scale and the Coping Action Patterns Rating Scale. The evolution of the intraclass correlation coefficients was modeled using hierarchical linear modeling. Results showed an overall training effect as well as effects of the basic training phase and of the rater calibration phase, the latter being smaller than the former. The results are discussed in terms of implications for rater training in psychotherapy research.
Resumo:
Aims and objectives This study aimed to determine the discriminant validity and the test-retest reliability of a questionnaire testing the impact of evidence-based medicine (EBM) training on doctors' knowledge and skills. Methods Questionnaires were sent electronically to all doctors working as residents and chief residents in two French speaking hospital networks in Switzerland. Participants completed the questionnaire twice, within a 4-week interval. The discriminant validity was examined in comparing doctors' performance according to their reported EBM previous training. Proportion of agreement between both sessions of the questionnaire, Cohen's kappa and 'uniform kappa' determined its test-retest reliability. Results The participation rate was 9.8%/7.1% to first/second session. Performance increased according to the level of doctors' previous training in EBM. The observed proportion of agreement between both sessions was over 70% for 14/19 questions, and the 'uniform kappa' was superior to 0.60 for 15/19 questions. Conclusion The discriminant validity and test-retest reliability of the questionnaire were satisfying. The low participation rate did not prevent the study from achieving its aims.
Resumo:
Given the important role of the shoulder sensorimotor system in shoulder stability, its assessment appears of interest. Force platform monitoring of centre of pressure (CoP) in upper-limb weight-bearing positions is of interest as it allows integration of all aspects of shoulder sensorimotor control. This study aimed to determine the feasibility and reliability of shoulder sensorimotor control assessment by force platform. Forty-five healthy subjects performed two sessions of CoP measurement using Win-Posturo(®) Medicapteurs force platform in an upper-limb weight-bearing position with the lower limbs resting on a table to either the anterior superior iliac spines (P1) or upper patellar poles (P2). Four different conditions were tested in each position in random order: eyes open or eyes closed with trunk supported by both hands and eyes open with trunk supported on the dominant or non-dominant side. P1 reliability values were globally moderate to high for CoP length, CoP velocity and CoP standard deviation (SD), standard error of measurement ranged from 6·0% to 26·5%, except for CoP area. P2 reliability values were globally low and not clinically acceptable. Our results suggest that shoulder sensorimotor control assessment by force platform is feasible and has good reliability in upper-limb weight-bearing positions when the lower limbs are resting on a table to the anterior superior iliac spines. CoP length, CoP velocity and CoP SD velocity appear to be the most reliable variables.
Resumo:
In 58 newborn infants a new iridium oxide sensor was evaluated for transcutaneous carbon dioxide (tcPCO2) monitoring at 42 degrees C with a prolonged fixation time of 24 hours. The correlation of tcPCO2 (y; mm Hg) v PaCO2 (x; mm Hg) for 586 paired values was: y = 4.6 + 1.45x; r = .89; syx = 6.1 mm Hg. The correlation was not influenced by the duration of fixation. The transcutaneous sensor detected hypocapnia (PaCO2 less than 35 mm Hg) in 74% and hypercapnia (PCO2 greater than 45 mm Hg) in 74% of all cases. After 24 hours, calibration shifts were less than 4 mm Hg in 90% of the measuring periods. In 86% of the infants, no skin changes were observed; in 12% of infants, there were transitional skin erythemas and in 2% a blister which disappeared without scarring. In newborn infants with normal BPs, continuous tcPCO2 monitoring at 42 degrees C can be extended for as many as 24 hours without loss of reliability or increased risk for skin burns.
Resumo:
The purpose of this paper is to describe the development and to test the reliability of a new method called INTERMED, for health service needs assessment. The INTERMED integrates the biopsychosocial aspects of disease and the relationship between patient and health care system in a comprehensive scheme and reflects an operationalized conceptual approach to case mix or case complexity. The method is developed to enhance interdisciplinary communication between (para-) medical specialists and to provide a method to describe case complexity for clinical, scientific, and educational purposes. First, a feasibility study (N = 21 patients) was conducted which included double scoring and discussion of the results. This led to a version of the instrument on which two interrater reliability studies were performed. In study 1, the INTERMED was double scored for 14 patients admitted to an internal ward by a psychiatrist and an internist on the basis of a joint interview conducted by both. In study 2, on the basis of medical charts, two clinicians separately double scored the INTERMED in 16 patients referred to the outpatient psychiatric consultation service. Averaged over both studies, in 94.2% of all ratings there was no important difference between the raters (more than 1 point difference). As a research interview, it takes about 20 minutes; as part of the whole process of history taking it takes about 15 minutes. In both studies, improvements were suggested by the results. Analyses of study 1 revealed that on most items there was considerable agreement; some items were improved. Also, the reference point for the prognoses was changed so that it reflected both short- and long-term prognoses. Analyses of study 2 showed that in this setting, less agreement between the raters was obtained due to the fact that the raters were less experienced and the scoring procedure was more susceptible to differences. Some improvements--mainly of the anchor points--were specified which may further enhance interrater reliability. The INTERMED proves to be a reliable method for classifying patients' care needs, especially when used by experienced raters scoring by patient interview. It can be a useful tool in assessing patients' care needs, as well as the level of needed adjustment between general and mental health service delivery. The INTERMED is easily applicable in the clinical setting at low time-costs.
Resumo:
BACKGROUND: The Foot and Ankle Ability Measure (FAAM) is a self reported questionnaire for patients with foot and ankle disorders available in English, German, and Persian. This study plans to translate the FAAM from English to French (FAAM-F) and assess the validity and reliability of this new version.METHODS: The FAAM-F Activities of Daily Living (ADL) and sports subscales were completed by 105 French-speaking patients (average age 50.5 years) presenting various chronic foot and ankle disorders. Convergent and divergent validity was assessed by Pearson's correlation coefficients between the FAAM-F subscales and the SF-36 scales: Physical Functioning (PF), Physical Component Summary (PCS), Mental Health (MH) and Mental Component Summary (MCS). Internal consistency was calculated by Cronbach's Alpha (CA). To assess test re-test reliability, 22 patients filled out the questionnaire a second time to estimate minimal detectable changes (MDC) and intraclass correlation coefficients (ICC).RESULTS: Correlations for FAAM-F ADL subscale were 0.85 with PF, 0.81 with PCS, 0.26 with MH, 0.37 with MCS. Correlations for FAAM-F Sports subscale were 0.72 with PF, 0.72 with PCS, 0.21 with MH, 0.29 with MCS. CA estimates were 0.97 for both subscales. Respectively for the ADL and Sports subscales, ICC were 0.97 and 0.94, errors for a single measure were 8 and 10 points at 95% confidence and the MDC values at 95% confidence were 7 and 18 points.CONCLUSION: The FAAM-F is valid and reliable for the self-assessment of physical function in French-speaking patients with a wide range of chronic foot and ankle disorders.
Resumo:
Communication is an indispensable component of animal societies, yet many open questions remain regarding the factors affecting the evolution and reliability of signalling systems. A potentially important factor is the level of genetic relatedness between signallers and receivers. To quantitatively explore the role of relatedness in the evolution of reliable signals, we conducted artificial evolution over 500 generations in a system of foraging robots that can emit and perceive light signals. By devising a quantitative measure of signal reliability, and comparing independently evolving populations differing in within-group relatedness, we show a strong positive correlation between relatedness and reliability. Unrelated robots produced unreliable signals, whereas highly related robots produced signals that reliably indicated the location of the food source and thereby increased performance. Comparisons across populations also revealed that the frequency for signal production-which is often used as a proxy of signal reliability in empirical studies on animal communication-is a poor predictor of signal reliability and, accordingly, is not consistently correlated with group performance. This has important implications for our understanding of signal evolution and the empirical tools that are used to investigate communication.
Resumo:
PURPOSE: To conduct a cross-cultural adaptation of the Core Outcome Measures Index (COMI) into French according to established guidelines. METHODS: Seventy outpatients with chronic low back pain were recruited from six spine centres in Switzerland and France. They completed the newly translated COMI, and the Roland Morris disability (RMQ), Dallas Pain (DPQ), adjectival pain rating scale, WHO Quality of Life, and EuroQoL-5D questionnaires. After ~14 days RMQ and COMI were completed again to assess reproducibility; a transition question (7-point Likert scale; "very much worse" through "no change" to "very much better") indicated any change in status since the first questionnaire. RESULTS: COMI whole scores displayed no floor effects and just 1.5% ceiling effects. The scores for the individual COMI items correlated with their corresponding full-length reference questionnaire with varying strengths of correlation (0.33-0.84, P < 0.05). COMI whole scores showed a very good correlation with the "multidimensional" DPQ global score (Rho = 0.71). 55 patients (79%) returned a second questionnaire with no/minimal change in their back status. The reproducibility of individual COMI 5-point items was good, with test-retest differences within one grade ranging from 89% for 'social/work disability' to 98% for 'symptom-specific well-being'. The intraclass correlation coefficient for the COMI whole score was 0.85 (95% CI 0.76-0.91). CONCLUSIONS: In conclusion, the French version of this short, multidimensional questionnaire showed good psychometric properties, comparable to those reported for German and Spanish versions. The French COMI represents a valuable tool for future multicentre clinical studies and surgical registries (e.g. SSE Spine Tango) in French-speaking countries.
Resumo:
When facing age-related cerebral decline, older adults are unequally affected by cognitive impairment without us knowing why. To explore underlying mechanisms and find possible solutions to maintain life-space mobility, there is a need for a standardized behavioral test that relates to behaviors in natural environments. The aim of the project described in this paper was therefore to provide a free, reliable, transparent, computer-based instrument capable of detecting age-related changes on visual processing and cortical functions for the purposes of research into human behavior in computational transportation science. After obtaining content validity, exploring psychometric properties of the developed tasks, we derived (Study 1) the scoring method for measuring cerebral decline on 106 older drivers aged ≥70 years attending a driving refresher course organized by the Swiss Automobile Association to test the instrument's validity against on-road driving performance (106 older drivers). We then validated the derived method on a new sample of 182 drivers (Study 2). We then measured the instrument's reliability having 17 healthy, young volunteers repeat all tests included in the instrument five times (Study 3) and explored the instrument's psychophysical underlying functions on 47 older drivers (Study 4). Finally, we tested the instrument's responsiveness to alcohol and effects on performance on a driving simulator in a randomized, double-blinded, placebo, crossover, dose-response, validation trial including 20 healthy, young volunteers (Study 5). The developed instrument revealed good psychometric properties related to processing speed. It was reliable (ICC = 0.853) and showed reasonable association to driving performance (R (2) = 0.053), and responded to blood alcohol concentrations of 0.5 g/L (p = 0.008). Our results suggest that MedDrive is capable of detecting age-related changes that affect processing speed. These changes nevertheless do not necessarily affect driving behavior.