790 resultados para test retest reliability
Resumo:
PURPOSE:
To determine the test-retest variability in perimetric, optic disc, and macular thickness parameters in a cohort of treated patients with established glaucoma.
PATIENTS AND METHODS:
In this cohort study, the authors analyzed the imaging studies and visual field tests at the baseline and 6-month visits of 162 eyes of 162 participant in the Glaucoma Imaging Longitudinal Study (GILS). They assessed the difference, expressed as the standard error of measurement, of Humphrey field analyzer II (HFA) Swedish Interactive Threshold Algorithm fast, Heidelberg retinal tomograph (HRT) II, and retinal thickness analyzer (RTA) parameters between the two visits and assumed that this difference was due to measurement variability, not pathologic change. A statistically significant change was defined as twice the standard error of measurement.
RESULTS:
In this cohort of treated glaucoma patients, it was found that statistically significant changes were 3.2 dB for mean deviation (MD), 2.2 for pattern standard deviation (PSD), 0.12 for cup shape measure, 0.26 mm for rim area, and 32.8 microm and 31.8 microm for superior and inferior macular thickness, respectively. On the basis of these values, it was estimated that the number of potential progression events detectable in this cohort by the parameters of MD, PSD, cup shape measure, rim area, superior macular thickness, and inferior macular thickness was 7.5, 6.0, 2.3, 5.7, 3.1, and 3.4, respectively.
CONCLUSIONS:
The variability of the measurements of MD, PSD, and rim area, relative to the range of possible values, is less than the variability of cup shape measure or macular thickness measurements. Therefore, the former measurements may be more useful global measurements for assessing progressive glaucoma damage.
Resumo:
This study aimed to explore the reliability of self-reported trauma histories in a population with a diagnosis of Bipolar Disorder using the Childhood Trauma Questionnaire. Previous studies in other populations suggest high reliability of trauma histories over time and it was postulated that a similar high reliability would be demonstrated in this population. Thirty-nine patients with a confirmed diagnosis (DSM-IV criteria) were followed-up and re-administered the Childhood Trauma Questionnaire after 18 months. Cohen's kappa scores and intraclass correlations suggest reasonable test-retest reliability over the 18-month time period of the study for all types of childhood abuse, namely emotional, physical, sexual, and physical abuse and emotional neglect. Intraclass correlations ranged from r = .50 to (sexual abuse) to r = .96 (physical abuse). Cohen's kappas ranged from .44 (sexual abuse) to .76 (physical abuse). Retrospective reports of childhood trauma can be seen as reliable and are in keeping with results found with other mental health populations.
Resumo:
This study aimed to evaluate the reliability of Neupsilin Brief Neuropsychological Assessment Instrument, a brief battery developed in Brazil. Hundred two Brazilian man and women participated, from 18 to 40 years of age. It was evaluated the test-retest reliability of the Neupsilin tasks and the reliability of the correction of the constructional praxis task by different evaluators. The data were analyzed by Spearman’s correlation, intraclass correlation and Cronbach’s alpha. Language, memory, praxis and executive functions presented the highest correlations in the test-retest analyses. The agreement in the correction of the constructional praxis task was moderate to high. The results indicate temporal reliability of Neupsilin tasks and inter-rater agreement in the correction of the constructional praxis task. Suggestions to improve the tasks, the validity and reliability of Neupsilin were presented.
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)
Resumo:
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
Resumo:
BACKGROUND: Only few standardized apraxia scales are available and they do not cover all domains and semantic features of gesture production. Therefore, the objective of the present study was to evaluate the reliability and validity of a newly developed test of upper limb apraxia (TULIA), which is comprehensive and still short to administer. METHODS: The TULIA consists of 48 items including imitation and pantomime domain of non-symbolic (meaningless), intransitive (communicative) and transitive (tool related) gestures corresponding to 6 subtests. A 6-point scoring method (0-5) was used (score range 0-240). Performance was assessed by blinded raters based on videos in 133 stroke patients, 84 with left hemisphere damage (LHD) and 49 with right hemisphere damage (RHD), as well as 50 healthy subjects (HS). RESULTS: The clinimetric findings demonstrated mostly good to excellent internal consistency, inter- and intra-rater (test-retest) reliability, both at the level of the six subtests and at individual item level. Criterion validity was evaluated by confirming hypotheses based on the literature. Construct validity was demonstrated by a high correlation (r = 0.82) with the De Renzi-test. CONCLUSION: These results show that the TULIA is both a reliable and valid test to systematically assess gesture production. The test can be easily applied and is therefore useful for both research purposes and clinical practice.
Resumo:
In this study, three experiments are presented that investigate the reliability of memory measures. In Experiment 1, the well-known dissociation between explicit (recall, recognition) and implicit memory (picture clarification) as a function of age in a sample of 335 persons aged between 65 and 95 was replicated. Test-retest reliability was significantly lower in implicit than in explicit measures. In Experiment 2, parallel-test reliabilities in a student sample confirmed the finding of Experiment 1. In Experiment 3, the reliability of cued recall and word stem completion was investigated. There were significant priming effects and a dissociation between explicit and implicit memory as a function of levels of processing. However, the reliability of implicit memory measures was again substantially lower than in explicit tests in all test conditions. As a consequence, differential reliabilities of direct and indirect memory tests should be considered as a possible determinant of dissociations between explicit and implicit memory as a function of experimental or quasi-experimental manipulations.
Resumo:
Need for cognition (NFC) reflects a relatively stable trait regarding the degree to which one enjoys and engages in cognitive endeavors. We examined whether the previously demonstrated one-dimensional structure of the German NFC Scale could be replicated in three samples of undergraduates and secondary school students. Moreover, we investigated the test-retest reliability of the German NFC Scale, which has not yet been tested. Further, we investigated whether the scale would be valid in a sample of secondary school students. Multigroup confirmatory factor analyses established the one-dimensional factor structure of the long form as well as the short form of the German NFC Scale for undergraduates (N = 559), students of academic track secondary schools (German Gymnasium; N = 555), and students of vocational track secondary schools (German Realschule; N = 486). The scale proved to have a high test-retest reliability in a university student sample (N = 43). For secondary school students, we again found a high test-retest reliability (N = 157), and also found the scale to be valid (N = 181).
Resumo:
OBJECTIVE To provide guidance on standards for reporting studies of diagnostic test accuracy for dementia disorders. METHODS An international consensus process on reporting standards in dementia and cognitive impairment (STARDdem) was established, focusing on studies presenting data from which sensitivity and specificity were reported or could be derived. A working group led the initiative through 4 rounds of consensus work, using a modified Delphi process and culminating in a face-to-face consensus meeting in October 2012. The aim of this process was to agree on how best to supplement the generic standards of the STARD statement to enhance their utility and encourage their use in dementia research. RESULTS More than 200 comments were received during the wider consultation rounds. The areas at most risk of inadequate reporting were identified and a set of dementia-specific recommendations to supplement the STARD guidance were developed, including better reporting of patient selection, the reference standard used, avoidance of circularity, and reporting of test-retest reliability. CONCLUSION STARDdem is an implementation of the STARD statement in which the original checklist is elaborated and supplemented with guidance pertinent to studies of cognitive disorders. Its adoption is expected to increase transparency, enable more effective evaluation of diagnostic tests in Alzheimer disease and dementia, contribute to greater adherence to methodologic standards, and advance the development of Alzheimer biomarkers.
Resumo:
Physical activity is a key component of life-style modification process which helps to reduce the risk of developing chronic diseases. It is important to have accurate estimates of physical activity to identify sedentary populations where interventions might be helpful. The International Physical Activity Questionnaire (IPAQ) short version has been used to estimate physical activity in diverse populations. However, there is little literature depicting the use of the IPAQ short version in Mexican America population. This study addressed the predictive validity and test-retest reliability of the IPAQ short version in Mexican American adults. The analysis was performed on 97 participants enrolled in the Cameron County Hispanic Cohort. Individuals selected in this study were 18 years of age or older. The predictive validity was evaluated by studying the relationship between physical activity and biomarkers known to be correlated with physical activity, namely, TNF-α, Adiponectin, and HDL. Multiple linear regression analysis was performed to delineate predictive validity. To assess test-retest reliability, two IPAQ-short last seven days questionnaires were interviewer administered to the participants on the same day, approximately two hours apart. Test-Retest reliability of IPAQ was estimated by performing intraclass correlations between the readings at two different time periods. The study showed that the IPAQ – short version used in the above study had acceptable test-retest reliability in the Mexican American population. This study showed that the IPAQ – short version did not have acceptable predictive validity when looking at physical activity and TNF-α, Adiponectin, and HDL in this sample.^
Resumo:
The reliability of measurement refers to unsystematic error in observed responses. Investigations of the prevalence of random error in stated estimates of willingness to pay (WTP) are important to an understanding of why tests of validity in CV can fail. However, published reliability studies have tended to adopt empirical methods that have practical and conceptual limitations when applied to WTP responses. This contention is supported in a review of contingent valuation reliability studies that demonstrate important limitations of existing approaches to WTP reliability. It is argued that empirical assessments of the reliability of contingent values may be better dealt with by using multiple indicators to measure the latent WTP distribution. This latent variable approach is demonstrated with data obtained from a WTP study for stormwater pollution abatement. Attitude variables were employed as a way of assessing the reliability of open-ended WTP (with benchmarked payment cards) for stormwater pollution abatement. The results indicated that participants' decisions to pay were reliably measured, but not the magnitude of the WTP bids. This finding highlights the need to better discern what is actually being measured in VVTP studies, (C) 2003 Elsevier B.V. All rights reserved.
Resumo:
An adaptation of the traditional Stroop test, the California Older Adult Stroop Test (COAST) (Pachana, Marcopulos, Yoash-Gantz & Thompson, 1995), has been developed specifically for use with a geriatric population, utilizing larger typeface, fewer items (50) per task, and more easily distinguished colors (red, yellow and green). Test-retest reliability and validity data are reviewed for both control and clinical populations. Increased error rates on the Stroop test compared to the COAST were found for the color and color/word interference tasks. These results are discussed in terms of changes in the visual system with increasing age. The implications for better test sensitivity with the COAST for older adult populations are discussed.