66 resultados para reliability and validity
Resumo:
Osteoarticular allograft is one possible treatment in wide surgical resections with large defects. Performing best osteoarticular allograft selection is of great relevance for optimal exploitation of the bone databank, good surgery outcome and patient’s recovery. Current approaches are, however, very time consuming hindering these points in practice. We present a validation study of a software able to perform automatic bone measurements used to automatically assess the distal femur sizes across a databank. 170 distal femur surfaces were reconstructed from CT data and measured manually using a size measure protocol taking into account the transepicondyler distance (A), anterior-posterior distance in medial condyle (B) and anterior-posterior distance in lateral condyle (C). Intra- and inter-observer studies were conducted and regarded as ground truth measurements. Manual and automatic measures were compared. For the automatic measurements, the correlation coefficients between observer one and automatic method, were of 0.99 for A measure and 0.96 for B and C measures. The average time needed to perform the measurements was of 16 h for both manual measurements, and of 3 min for the automatic method. Results demonstrate the high reliability and, most importantly, high repeatability of the proposed approach, and considerable speed-up on the planning.
Resumo:
The objectives of this study were to develop and validate a tool for assessing pain in population-based observational studies and to develop three subscales for back/neck, upper extremity and lower extremity pain. Based on a literature review, items were extracted from validated questionnaires and reviewed by an expert panel. The initial questionnaire consisted of a pain manikin and 34 items relating to (i) intensity of pain in different body regions (7 items), (ii) pain during activities of daily living (18 items) and (iii) various pain modalities (9 items). Psychometric validation of the initial questionnaire was performed in a random sample of the German-speaking Swiss population. Analyses included tests for reliability, correlation analysis, principal components factor analysis, tests for internal consistency and validity. Overall, 16,634 of 23,763 eligible individuals participated (70%). Test-retest reliability coefficients ranged from 0.32 to 0.97, but only three coefficients were below 0.60. Subscales were constructed combining four items for each of the subscales. Item-total coefficients ranged from 0.76 to 0.86 and Cronbach's alpha were 0.75 or higher for all subscales. Correlation coefficients between subscales and three validated instruments (WOMAC, SPADI and Oswestry) ranged from 0.62 to 0.79. The final Pain Standard Evaluation Questionnaire (SEQ Pain) included 28 items and the pain manikin and accounted for the multidimensionality of pain by assessing pain location and intensity, pain during activity, triggers and time of onset of pain and frequency of pain medication. It was found to be reliable and valid for the assessment of pain in population-based observational studies.
Resumo:
Synaesthesia is a heterogeneous phenomenon, even when considering one particular sub-type. The purpose of this study was to design a reliable and valid questionnaire for grapheme-colour synaesthesia that captures this heterogeneity. By the means of a large sample of 628 synaesthetes and a factor analysis, we created the Coloured Letters and Numbers (CLaN) questionnaire with 16 items loading on 4 different factors (i.e., localisation, automaticity/attention, deliberate use, and longitudinal changes). These factors were externally validated with tests which are widely used in the field of synaesthesia research. The questionnaire showed good test–retest reliability and construct validity (i.e., internally and externally). Our findings are discussed in the light of current theories and new ideas in synaesthesia research. More generally, the questionnaire is a useful tool which can be widely used in synaesthesia research to reveal the influence of individual differences on various performance measures and will be useful in generating new hypotheses.
Resumo:
OBJECTIVE Visual hallucinations (VHs) are a very personal experience, and it is not clear whether information about them is best provided by informants or patients. Some patients may not share their hallucinatory experiences with caregivers to avoid distress or for fear of being labeled insane, and others do not have informants at all, which limits the use of informant-based questionnaires. The aim of this study was to compare patient and caregiver views about VHs in Parkinson disease (PD), using the North-East Visual Hallucinations Interview (NEVHI). METHODS Fifty-nine PD patient-informant pairs were included. PD patients and informants were interviewed separately about VHs using the NEVHI. Informants were additionally interviewed using the four-item version of the Neuropsychiatric Inventory. Inter-reliability and concurrent validity of the different measures were compared. RESULTS VHs were more commonly reported by patients than informants. The inter-rater agreement between NEVHI-patient and NEVHI-informant was moderate for complex VHs (Cohen's kappa = 0.44; 95% confidence interval [CI]: 0.13-0.75; t = 3.43, df = 58, p = 0.001) and feeling of presence (Cohen's kappa = 0.35; 95% CI: 0.00-0.70; t = 2.75, df = 58, p = 0.006), but agreement was poor for illusions (Cohen's kappa = 0.25; 95% CI: -0.07-0.57; t = 2.36, df = 58, p = 0.018) and passage hallucinations (Cohen's kappa = 0.16; 95% CI: -0.04-0.36; t = 2.26, df = 58, p = 0.024). CONCLUSION When assessing VHs in PD patients, it is best to rely on patient information, because not all patients share the details of their hallucinations with their caregivers.
Resumo:
Objective: Since 2011, the new national final examination in human medicine has been implemented in Switzerland, with a structured clinical-practical part in the OSCE format. From the perspective of the national Working Group, the current article describes the essential steps in the development, implementation and evaluation of the Federal Licensing Examination Clinical Skills (FLE CS) as well as the applied quality assurance measures. Finally, central insights gained from the last years are presented. Methods: Based on the principles of action research, the FLE CS is in a constant state of further development. On the foundation of systematically documented experiences from previous years, in the Working Group, unresolved questions are discussed and resulting solution approaches are substantiated (planning), implemented in the examination (implementation) and subsequently evaluated (reflection). The presented results are the product of this iterative procedure. Results: The FLE CS is created by experts from all faculties and subject areas in a multistage process. The examination is administered in German and French on a decentralised basis and consists of twelve interdisciplinary stations per candidate. As important quality assurance measures, the national Review Board (content validation) and the meetings of the standardised patient trainers (standardisation) have proven worthwhile. The statistical analyses show good measurement reliability and support the construct validity of the examination. Among the central insights of the past years, it has been established that the consistent implementation of the principles of action research contributes to the successful further development of the examination. Conclusion: The centrally coordinated, collaborative-iterative process, incorporating experts from all faculties, makes a fundamental contribution to the quality of the FLE CS. The processes and insights presented here can be useful for others planning a similar undertaking. Keywords: national final examination, licensing examination, summative assessment, OSCE, action research
Resumo:
Clinicians believe that psychosocial factors play a causal role in the etiology of many forms of functional dysphonia (FD). But for decades, all attempts to confirm such causation have failed. This paper aims to show the logic of this failure, to discuss the possibilities of employing psychology in therapy nonetheless, and to encourage clinicians to use their psychosocial knowledge and skills. The failure to confirm psychic and social factors as causal in the etiology of FD is basically a consequence of a principal shortcoming of evidence-based medicine (EBM). As the gold standard for validity, reliability, and objectivity in medical research, EBM is based on calculability and hence the processing of quantitative data. But life paths and life situations are best or sometimes only expressible in qualitative, experiential, and idiographic terms. Thus EBM-guided evaluation undervalues most psychosocial studies. This report of an experienced multidisciplinary voice team proposes alternative pathways for integrating psychosocial knowledge into the diagnosis and the treatment of FD. The difference between the fields of activity of psychotherapists and speech-language pathologists is discussed, and the latter group is shown the potential benefits of using more of their psychosocial knowledge and skills.
Resumo:
Hip dysplasia is characterized by insufficient femoral head coverage (FHC). Quantification of FHC is of importance as the underlying goal of the surgery to treat hip dysplasia is to restore a normal acetabular morphology and thereby to improve FHC. Unlike a pure 2D X-ray radiograph-based measurement method or a pure 3D CT-based measurement method, previously we presented a 2.5D method to quantify FHC from a single anteriorposterior (AP) pelvic radiograph. In this study, we first quantified and compared 3D FHC between a normal control group and a patient group using a CT-based measurement method. Taking the CT-based 3D measurements of FHC as the gold standard, we further quantified the bias, precision and correlation between the 2.5D measurements and the 3D measurements on both the control group and the patient group. Based on digitally reconstructed radiographs (DRRs), we investigated the influence of the pelvic tilt on the 2.5D measurements of FHC. The intraclass correlation coefficients (ICCs) for absolute agreement was used to quantify interobserver reliability and intraobserver reproducibility of the 2.5D measurement technique. The Pearson correlation coefficient, r, was used to determine the strength of the linear association between the 2.5D and the 3D measurements. Student's t-test was used to determine whether the differences between different measurements were statistically significant. Our experimental results demonstrated that both the interobserver reliability and the intraobserver reproducibility of the 2.5D measurement technique were very good (ICCs > 0.8). Regression analysis indicated that the correlation was very strong between the 2.5D and the 3D measurements (r = 0.89, p < 0.001). Student's t-test showed that there were no statistically significant differences between the 2.5D and the 3D measurements of FHC on the patient group (p > 0.05). The results of this study provided convincing evidence demonstrating the validity of the 2.5D measurements of FHC from a single AP pelvic radiograph and proved that it could serve as a surrogate for 3D CT-based measurements. Thus it may be possible to use this method to avoid a CT scan for the purpose of estimating 3D FHC in diagnosis and post-operative treatment evaluation of patients with hip dysplasia.
Resumo:
PURPOSE Stress urinary incontinence (SUI) affects women of all ages including young athletes, especially those involved in high-impact sports. To date, hardly any studies are available testing pelvic floor muscles (PFM) during sports activities. The aim of this study was the description and reliability test of six PFM electromyography (EMG) variables during three different running speeds. The secondary objective was to evaluate whether there was a speed-dependent difference between the PFM activity variables. METHODS This trial was designed as an exploratory and reliability study including ten young healthy female subjects to characterize PFM pre-activity and reflex activity during running at 7, 9 and 11 km/h. Six variables for each running speed, averaged over ten steps per subject, were presented descriptively, tested regarding their reliability (Friedman, ICC, SEM, MD) and speed difference (Friedman). RESULTS PFM EMG variables varied between 67.6 and 106.1 %EMG, showed no systematic error and were low for SEM and MD using the single value model. Applying the average model over ten steps, ICC (3,k) were >0.75 and SEM and MD about 50 % lower than for the single value model. Activity was found to be highest in 11 km/h. CONCLUSION EMG variables showed excellent ICC and very low SEM and MD. Further studies should investigate inter-session reliability and PFM reactivity patterns of SUI patients using the average over ten steps for each variable as it showed very high ICC and very low SEM and MD. Subsequently, longer running distances and other high-impact sports disciplines could be studied.
Resumo:
Background Many medical exams use 5 options for multiple choice questions (MCQs), although the literature suggests that 3 options are optimal. Previous studies on this topic have often been based on non-medical examinations, so we sought to analyse rarely selected, 'non-functional' distractors (NF-D) in high stakes medical examinations, and their detection by item authors as well as psychometric changes resulting from a reduction in the number of options. Methods Based on Swiss Federal MCQ examinations from 2005-2007, the frequency of NF-D (selected by <1% or <5% of the candidates) was calculated. Distractors that were chosen the least or second least were identified and candidates who chose them were allocated to the remaining options using two extreme assumptions about their hypothetical behaviour: In case rarely selected distractors were eliminated, candidates could randomly choose another option - or purposively choose the correct answer, from which they had originally been distracted. In a second step, 37 experts were asked to mark the least plausible options. The consequences of a reduction from 4 to 3 or 2 distractors - based on item statistics or on the experts' ratings - with respect to difficulty, discrimination and reliability were modelled. Results About 70% of the 5-option-items had at least 1 NF-D selected by <1% of the candidates (97% for NF-Ds selected by <5%). Only a reduction to 2 distractors and assuming that candidates would switch to the correct answer in the absence of a 'non-functional' distractor led to relevant differences in reliability and difficulty (and to a lesser degree discrimination). The experts' ratings resulted in slightly greater changes compared to the statistical approach. Conclusions Based on item statistics and/or an expert panel's recommendation, the choice of a varying number of 3-4 (or partly 2) plausible distractors could be performed without marked deteriorations in psychometric characteristics.
Resumo:
The widespread use of artificial nestboxes has led to significant advances in our knowledge of the ecology, behaviour and physiology of cavity nesting birds, especially small passerines Nestboxes have made it easier to perform routine monitoring and experimental manipulation of eggs or nestlings, and also repeatedly to capture, identify and manipulate the parents However, when comparing results across study sites the use of nestboxes may also Introduce a potentially significant confounding variable in the form of differences in nestbox design amongst studies, such as their physical dimensions, placement height, and the way in which they are constructed and maintained However, the use of nestboxes may also introduce an unconsidered and potentially significant confounding variable clue to differences in nestbox design amongst studies, such as their physical dimensions, placement height, and the way in which they are constructed and maintained Here we review to what extent the characteristics of artificial nestboxes (e g size, shape, construction material, colour) are documented in the 'methods' sections of publications involving hole-nesting passerine birds using natural or excavated cavities or artificial nestboxes for reproduction and roosting Despite explicit previous recommendations that authors describe in detail the characteristics of the nestboxes used, we found that the description of nestbox characteristics in most recent publications remains poor and insufficient We therefore list the types of descriptive data that should be included in the methods sections of relevant manuscripts and justify this by discussing how variation in nestbox characteristics can affect or confound conclusions from nestbox studies We also propose several recommendations to improve the reliability and usefulness of research based on long-term studies of any secondary hole-nesting species using artificial nestboxes for breeding or roosting.
Resumo:
This article reviews the psychophysiological and brain imaging literature on emotional brain function from a methodological point of view. The difficulties in defining, operationalising and measuring emotional activation and, in particular, aversive learning will be considered. Emotion is a response of the organism during an episode of major significance and involves physiological activation, motivational, perceptual, evaluative and learning processes, motor expression, action tendencies and monitoring/subjective feelings. Despite the advances in assessing the physiological correlates of emotional perception and learning processes, a critical appraisal shows that functional neuroimaging approaches encounter methodological difficulties regarding measurement precision (e.g., response scaling and reproducibility) and validity (e.g., response specificity, generalisation to other paradigms, subjects or settings). Since emotional processes are not only the result of localised but also of widely distributed activation, a more representative model of assessment is needed that systematically relates the hierarchy of high- and low-level emotion constructs with the corresponding patterns of activity and functional connectivity of the brain.
Resumo:
PURPOSE: Family needs and expectations are often unmet in the intensive care unit (ICU), leading to dissatisfaction. This study assesses cross-cultural adaptability of an instrument evaluating family satisfaction in the ICU. MATERIALS AND METHODS: A Canadian instrument on family satisfaction was adapted for German language and central European culture and then validated for feasibility, validity, internal consistency, reliability, and sensitivity. RESULTS: Content validity of a preliminary translated version was assessed by staff, patients, and next of kin. After adaptation, content and comprehensibility were considered good. The adapted translation was then distributed to 160 family members. The return rate was 71.8%, and 94.4% of questions in returned forms were clearly answered. In comparison with a Visual Analogue Scale, construct validity was good for overall satisfaction with care (Spearman rho = 0.60) and overall satisfaction with decision making (rho = 0.65). Cronbach alpha was .95 for satisfaction with care and .87 for decision-making. Only minor differences on repeated measurements were found for interrater and intrarater reliability. There was no floor or ceiling effect. CONCLUSIONS: A cross-cultural adaptation of a questionnaire on family satisfaction in the ICU can be feasible, valid, internally consistent, reliable, and sensitive.
Resumo:
BACKGROUND: Genetically transmitted traits such as cytokine gene polymorphisms may accentuate the host inflammatory response to the bacterial challenge and influence susceptibility to periodontitis. OBJECTIVE: To systematically review the evidence of an association between the interleukin-1 (IL-1) composite genotype, i.e. presence of the allele 2 in the gene clusters IL-1A-889 and in IL-1B +3953, and periodontitis progression and/or treatment outcomes. Material and Methods: Based on the focused question, a search was conducted for longitudinal clinical trials comparing progression of periodontitis and/or treatment outcomes in IL-1 genotype-positive (carrying allele 2) and IL-1 genotype-negative (not carrying allele 2) subjects. A search in the National Library of Medicine computerized bibliographic database MEDLINE and a manual search were performed. Selection of publications, extraction of data and validity assessment were made independently by two reviewers. RESULTS: The search provided 122 titles of which 11 longitudinal publications were included. The heterogeneity of the data prevented the performance of a meta-analysis. While findings from some publications rejected a possible role of IL-1 composite genotype on progression of periodontitis after various therapies, other reported a prognostic value for disease progression of the positive IL-1 genotype status. When assessed on a multivariate risk assessment model, several publications concluded that the assessment of the IL-1 composite genotype in conjunction with other covariates (e.g. smoking and presence of specific bacteria) may provide additional information on disease progression. The small sample size of the available publications, however, requires caution in the interpretation of the results. CONCLUSION: Based on these findings, (i) there is insufficient evidence to establish if a positive IL-1 genotype status contributes to progression of periodontitis and/or treatment outcomes. Therefore, (ii) results obtained with commercially available tests should be interpreted with caution.
Resumo:
The purpose of this study was to validate the accuracy, consistency, and reproducibility/reliability of a new method for correction of pelvic tilt and rotation of radiographic hip parameters for pincer type of femoroacetabular impingement on an anteroposterior pelvic radiograph. Thirty cadaver hips and 100 randomized, blinded AP pelvic radiographs were used for investigation. To detect the software accuracy, the calculated femoral head coverage and classic hip parameters determined with our software were compared to reference measurements based on CT scans or conventional radiographs in a neutral orientation as gold standard. To investigate software consistency, differences among the different parameters for each cadaver pelvis were calculated when reckoned back from a random to the neutral orientation. Intra- and interobserver comparisons were used to analyze the reliability and reproducibility of all parameters. All but two parameters showed a good-to-very good accuracy with the reference measurements. No relevant systematic errors were detected in the Bland-Altman analysis. Software consistency was good-to-very good for all parameters. A good-to-very good reliability and reproducibility was found for a substantial number of the evaluated radiographic acetabular parameters. The software appears to be an accurate, consistent, reliable, and reproducible method for analysis of acetabular pathomorphologies.
Resumo:
Acromegaly is a chronic disease with an important impact on quality of life. An acromegaly disease-generated quality of life questionnaire (AcroQoL) has recently been developed. We aimed to confirm reliability, construct validity and disease-specificity of the AcroQoL questionnaire. Second, we investigated the effect of remission status on health-related quality of life (HRQoL) in patients with acromegaly.