984 resultados para Test reproducibility
Resumo:
Background—Palpation is an important clinical test for jumper's knee. Objectives—To (a) test the reproducibility of palpation tenderness, (b) evaluate the sensitivity and specificity of palpation in subjects with clinical symptoms of jumper's knee, and (c) determine whether tenderness to palpation may serve as a useful screening test for patellar tendinopathy. The yardstick for diagnosis of patellar tendinopathy was ultrasonographic abnormality. Methods—In 326 junior symptomatic and asymptomatic athletes' tendons, palpation was performed by a single examiner before ultrasonographic examination by a certified ultrasound radiologist. In 58 tendons, palpation was performed twice to test reliability. Tenderness to palpation was scored on a scale from 0 to 3 where 0 represented no pain, and 1, 2, and 3 represented mild, moderate, and severe tenderness respectively. Results—Patellar tendon palpation was a reliable examination for a single examiner (Pearson r = 0.82). In symptomatic tendons, the positive predictive value of palpation was 68%. As a screening examination in asymptomatic subjects, the positive predictive value of tendon palpation was 36–38%. Moderate and severe palpation tenderness were better predictors of ultrasonographic tendon pathology than absent or mild tenderness (p<0.001). Tender and symptomatic tendons were more likely to have ultrasound abnormality than tenderness alone (p<0.01). Conclusions—In this age group, palpation is a reliable test but it is not cost effective in detecting patellar tendinopathy in a preparticipation examination. In symptomatic tendons, palpation is a moderately sensitive but not specific test. Mild tenderness in the patellar tendons in asymptomatic jumping athletes should be considered normal.
Resumo:
The human connectome has recently become a popular research topic in neuroscience, and many new algorithms have been applied to analyze brain networks. In particular, network topology measures from graph theory have been adapted to analyze network efficiency and 'small-world' properties. While there has been a surge in the number of papers examining connectivity through graph theory, questions remain about its test-retest reliability (TRT). In particular, the reproducibility of structural connectivity measures has not been assessed. We examined the TRT of global connectivity measures generated from graph theory analyses of 17 young adults who underwent two high-angular resolution diffusion (HARDI) scans approximately 3 months apart. Of the measures assessed, modularity had the highest TRT, and it was stable across a range of sparsities (a thresholding parameter used to define which network edges are retained). These reliability measures underline the need to develop network descriptors that are robust to acquisition parameters.
Resumo:
Aerobic exercise training performed at the intensity eliciting maximal fat oxidation (Fatmax) has been shown to improve the metabolic profile of obese patients. However, limited information is available on the reproducibility of Fatmax and related physiological measures. The aim of this study was to assess the intra-individual variability of: a) Fatmax measurements determined using three different data analysis approaches and b) fat and carbohydrate oxidation rates at rest and at each stage of an individualized graded test. Fifteen healthy males [body mass index 23.1±0.6 kg/m2, maximal oxygen consumption () 52.0±2.0 ml/kg/min] completed a maximal test and two identical submaximal incremental tests on ergocycle (30-min rest followed by 5-min stages with increments of 7.5% of the maximal power output). Fat and carbohydrate oxidation rates were determined using indirect calorimetry. Fatmax was determined with three approaches: the sine model (SIN), measured values (MV) and 3rd polynomial curve (P3). Intra-individual coefficients of variation (CVs) and limits of agreement were calculated. CV for Fatmax determined with SIN was 16.4% and tended to be lower than with P3 and MV (18.6% and 20.8%, respectively). Limits of agreement for Fatmax were −2±27% of with SIN, −4±32 with P3 and −4±28 with MV. CVs of oxygen uptake, carbon dioxide production and respiratory exchange rate were <10% at rest and <5% during exercise. Conversely, CVs of fat oxidation rates (20% at rest and 24–49% during exercise) and carbohydrate oxidation rates (33.5% at rest, 8.5–12.9% during exercise) were higher. The intra-individual variability of Fatmax and fat oxidation rates was high (CV>15%), regardless of the data analysis approach employed. Further research on the determinants of the variability of Fatmax and fat oxidation rates is required.
Resumo:
In the 'free-ball' version of the micro-scale abrasion or ball-cratering test the rotating ball rests against a tilted sample and a grooved drive shaft. Tests under nominally identical conditions with different apparatus commonly show small but significant differences in measured wear rate. An indirect method has been developed and demonstrated for continuous on-line measurement of the coefficient of friction in the free-ball test. Experimental investigation of the effects of sample tilt angle and drive shaft groove width shows that both these factors influence the stability of the rotation of the ball, and the shape of the abrasive slurry pool, which in turn affect the coefficient of friction in the wear scar area and the measured wear rate. It is suggested that in order to improve the reproducibility of this method the geometry of the apparatus should be specified. For the apparatus used in this work with a steel ball of 25 mm diameter, a sample tilt angle of 60-75° and a shaft groove width of about 10mm provided the most stable ball motion and a wear rate which showed least variability. © 2004 Elsevier B.V. All rights reserved.
Resumo:
BACKGROUND: Measurement of CD4+ T-lymphocytes (CD4) is a crucial parameter in the management of HIV patients, particularly in determining eligibility to initiate antiretroviral treatment (ART). A number of technologies exist for CD4 enumeration, with considerable variation in cost, complexity, and operational requirements. We conducted a systematic review of the performance of technologies for CD4 enumeration. METHODS AND FINDINGS: Studies were identified by searching electronic databases MEDLINE and EMBASE using a pre-defined search strategy. Data on test accuracy and precision included bias and limits of agreement with a reference standard, and misclassification probabilities around CD4 thresholds of 200 and 350 cells/μl over a clinically relevant range. The secondary outcome measure was test imprecision, expressed as % coefficient of variation. Thirty-two studies evaluating 15 CD4 technologies were included, of which less than half presented data on bias and misclassification compared to the same reference technology. At CD4 counts <350 cells/μl, bias ranged from -35.2 to +13.1 cells/μl while at counts >350 cells/μl, bias ranged from -70.7 to +47 cells/μl, compared to the BD FACSCount as a reference technology. Misclassification around the threshold of 350 cells/μl ranged from 1-29% for upward classification, resulting in under-treatment, and 7-68% for downward classification resulting in overtreatment. Less than half of these studies reported within laboratory precision or reproducibility of the CD4 values obtained. CONCLUSIONS: A wide range of bias and percent misclassification around treatment thresholds were reported on the CD4 enumeration technologies included in this review, with few studies reporting assay precision. The lack of standardised methodology on test evaluation, including the use of different reference standards, is a barrier to assessing relative assay performance and could hinder the introduction of new point-of-care assays in countries where they are most needed.
Resumo:
Data obtained with any research tool must be reproducible, a concept referred to as reliability. Three techniques are often used to evaluate reliability of tools using continuous data in aging research: intraclass correlation coefficients (ICC), Pearson correlations, and paired t tests. These are often construed as equivalent when applied to reliability. This is not correct, and may lead researchers to select instruments based on statistics that may not reflect actual reliability. The purpose of this paper is to compare the reliability estimates produced by these three techniques and determine the preferable technique. A hypothetical dataset was produced to evaluate the reliability estimates obtained with ICC, Pearson correlations, and paired t tests in three different situations. For each situation two sets of 20 observations were created to simulate an intrarater or inter-rater paradigm, based on 20 participants with two observations per participant. Situations were designed to demonstrate good agreement, systematic bias, or substantial random measurement error. In the situation demonstrating good agreement, all three techniques supported the conclusion that the data were reliable. In the situation demonstrating systematic bias, the ICC and t test suggested the data were not reliable, whereas the Pearson correlation suggested high reliability despite the systematic discrepancy. In the situation representing substantial random measurement error where low reliability was expected, the ICC and Pearson coefficient accurately illustrated this. The t test suggested the data were reliable. The ICC is the preferred technique to measure reliability. Although there are some limitations associated with the use of this technique, they can be overcome.
Resumo:
Introducción La mutación genética Val30Met de la proteína transtiretina (TTR) es causante de la polineuropatía amiloidótica familiar, comprometiendo en fases iniciales las fibras nerviosas pequeñas (mielinizadas Aδ y amielínicas tipo C), involucradas en funciones autonómicas, nocicepción, percepción térmica y sudoración. Los métodos neurofisiológicos convencionales, no logran detectar dichas anormalidades, retardando el inicio de tratamientos específicos para la enfermedad. Metodología El objetivo principal fue evaluar el test de cuantificación sensitiva (QST) como método de detección temprana de anormalidades de fibra pequeña, en individuos Val30Met, seguidos en el Hospital Universitario Santa María, Lisboa. Se clasificaron los pacientes en 3 grupos, según sintomatología y examen neurológico. Se analizaron los umbrales para percepción de frío, dolor con el calor y vibración en los grupos, en correlación con controles sanos. Resultados 18 registros de controles sanos y 33 de individuos con la mutación, divididos en asintomáticos (24,2%), sintomáticos con examen neurológico normal (42,4%) y sintomáticos con examen neurológico anormal (33,3%). No se encontraron diferencias entre los pacientes asintomáticos y los controles. Los umbrales para frío (p=0,042) y en el dolor intermedio con el calor (HP 5) (p=0,007) se encuentran elevados en individuos Val30Met sintomáticos con examen normal. En los pacientes sintomáticos con alteraciones al examen, también se presentaron alteraciones en el intervalo entre el inicio y el dolor intermedio con el calor (HP 5-0,5) (p=0,009). Discusión Los umbrales de frío y de percepción de dolor con el calor, permiten detectar anormalidades en personas con la mutación TTR Val30Met, sintomáticos, incluyendo aquellos sin cambios objetivos al examen neurológico.
Resumo:
El objetivo de este estudio fue realizar una prueba de validez diagnostica del test neural 1 para el diagnóstico del Síndrome de Túnel del Carpo (STC) utilizando como prueba de referencia o de oro el test de conducción nerviosa. En este estudio participaron 115 sujetos, 230 manos con sospecha clínica de STC quienes fueron evaluados con el test de conducción nerviosa y el test neural 1. Se encontró una sensibilidad del 93.0% (IC 95%:88,21-96,79) y una especificidad del 6,67% (IC 95%:0,0-33,59), razón de verosimilitud positiva fue de 1,00 y razón de verosimilitud negativa de 1,05. Valor predictivo positivo de 86,9% y un valor predictivo negativo de 12,5%. Se concluye que el test neural 1 es una prueba clínica de alta sensibilidad y baja especificidad de gran utilidad para el monitoreo e identificación del STC. Es un procedimiento para el diagnóstico clínico de bajo costo que puede incluirse en los exámenes de rutina de los trabajadores como complemento a las pruebas clínicas sugeridas por las Gatiso para dar mayor precisión a la identificación temprana del STC. Se sugiere combinarla con otros test de mayor especificidad para ser aplicada en trabajadores en condiciones de riesgo o que presenten síntomas en miembros superiores y realizar otros estudios en donde participen sujetos sin diagnóstico clínico del STC.
Resumo:
Existen importantes pruebas de valoración que miden habilidades o competencias motoras en el niño; a pesar de ello Colombia carece de estudios que demuestren la validez y la confiabilidad de un test de medición que permita emitir un juicio valorativo relacionado con las competencias motoras infantiles, teniendo presente que la intervención debe basarse en la rigurosidad que exigen los procesos de valoración y evaluación del movimiento corporal. Objetivo. El presente estudio se centró en determinar las propiedades psicométricas del test de competencias motoras Bruininiks Oseretsky –BOT 2- segunda edición. Materiales y métodos. Se realizó una evaluación de pruebas diagnósticas con 24 niños aparentemente sanos de ambos géneros, entre 4 y 7 años, residentes en las ciudades de Chía y Bogotá. La evaluación fue realizada por 3 evaluadores expertos; el análisis para consistencia interna se realizó utilizando el Coeficiente Alfa de Cronbach, el análisis de reproducibilidad se estableció a través del Coeficiente de Correlación Intraclase –CCI- y para el análisis de la validez concurrente se utilizó el Coeficiente de Correlación de Pearson, considerando un alfa=0.05. Resultados. Para la totalidad de las pruebas, se encontraron altos índices de confiabilidad y validez. Conclusiones. El BOT 2 es un instrumento válido y confiable, que puede ser utilizado para la evaluación e identificación del nivel de desarrollo en que se encuentran las competencias motoras en el niño.
Resumo:
Background: Advances in nutritional assessment are continuing to embrace developments in computer technology. The online Food4Me food frequency questionnaire (FFQ) was created as an electronic system for the collection of nutrient intake data. To ensure its accuracy in assessing both nutrient and food group intake, further validation against data obtained using a reliable, but independent, instrument and assessment of its reproducibility are required. Objective: The aim was to assess the reproducibility and validity of the Food4Me FFQ against a 4-day weighed food record (WFR). Methods: Reproducibility of the Food4Me FFQ was assessed using test-retest methodology by asking participants to complete the FFQ on 2 occasions 4 weeks apart. To assess the validity of the Food4Me FFQ against the 4-day WFR, half the participants were also asked to complete a 4-day WFR 1 week after the first administration of the Food4Me FFQ. Level of agreement between nutrient and food group intakes estimated by the repeated Food4Me FFQ and the Food4Me FFQ and 4-day WFR were evaluated using Bland-Altman methodology and classification into quartiles of daily intake. Crude unadjusted correlation coefficients were also calculated for nutrient and food group intakes. Results: In total, 100 people participated in the assessment of reproducibility (mean age 32, SD 12 years), and 49 of these (mean age 27, SD 8 years) also took part in the assessment of validity. Crude unadjusted correlations for repeated Food4Me FFQ ranged from .65 (vitamin D) to .90 (alcohol). The mean cross-classification into “exact agreement plus adjacent” was 92% for both nutrient and food group intakes, and Bland-Altman plots showed good agreement for energy-adjusted macronutrient intakes. Agreement between the Food4Me FFQ and 4-day WFR varied, with crude unadjusted correlations ranging from .23 (vitamin D) to .65 (protein, % total energy) for nutrient intakes and .11 (soups, sauces and miscellaneous foods) to .73 (yogurts) for food group intake. The mean cross-classification into “exact agreement plus adjacent” was 80% and 78% for nutrient and food group intake, respectively. There were no significant differences between energy intakes estimated using the Food4Me FFQ and 4-day WFR, and Bland-Altman plots showed good agreement for both energy and energy-controlled nutrient intakes. Conclusions: The results demonstrate that the online Food4Me FFQ is reproducible for assessing nutrient and food group intake and has moderate agreement with the 4-day WFR for assessing energy and energy-adjusted nutrient intakes. The Food4Me FFQ is a suitable online tool for assessing dietary intake in healthy adults.
Resumo:
The aim of this in vitro study was to assess the influence of varying examiner's clinical experience on the reproducibility and accuracy of radiographic examination for occlusal caries detection. Standardized bitewing radiographs were obtained from 166 permanent molars. Radiographic examination was performed by final-year dental students from two universities (A, n=5; B, n=5) and by dentists with 5 to 7 years of experience who work in two different countries (C, n=5; D, n=5). All examinations were repeated after 1-week interval. The teeth were histologically prepared and assessed for caries extension. For intraexaminer reproducibility, the unweighted kappa values were: A (0.11-0.40), B (0.12-0.33), C (0.47-0.58), and D (0.42-0.71). Interexaminer reproducibility statistics were computed based on means +/- SD of unweighted kappa values: A (0.07 +/- 0.05), B (0.12 +/- 0.09), C (0.24 +/- 0.08), and D (0.33 +/- 0.10). Sensitivity, specificity, and accuracy were calculated at D(1) and D(3) thresholds and compared by performing McNemar test (p=0.05). D(1) sensitivity ranged between 0.29 and 0.75 and specificity between 0.24 and 0.85. D(3) specificity was moderate to high (between 0.62 and 0.95) for all groups, with statistically significant difference between the dentists groups (C and D). Sensitivity was low to moderate (between 0.21 and 0.57) with statistically significant difference for groups B and D. Accuracy was similar for all groups (0.55). Spearman's correlations were: A (0.12), B (0.24), C (0.30), and D (0.38). In conclusion, the reproducibility of radiographic examination was influenced by the examiner's clinical experience, training, and dental education as well as the accuracy in detecting occlusal caries.
Resumo:
The aim of this study was to evaluate the efficacy of a dip slide test for mutans streptococci in caries risk assessment, when the microbiological results were compared to well-defined clinical criteria (DCC) for caries risk, clinically measured through high and low caries activity. Eighty-one volunteers from the 6(th) to 8(th) grades from public schools of Piracicaba, São Paulo, Brazil, were evaluated for dental caries. All free smooth surfaces were evaluated to check whether or not there were white spots. Based on the subjects' caries experience, a calibrated clinician divided them into groups of high and low caries activity. The subjects were submitted to a salivary test (CARITEST SM (R)) from the same batch number. Kappa statistics (kappa) were applied to verify the reproducibility of the simplified test, checked through interexaminer agreement when the results were classified by independent and blind means. The microbiological results were validated according to expressions of sensitivity and specificity. A moderate agreement was verified as the results were classified according to 6 scores (kappa =0.55), and the agreement was substantial when the results were classified according to high and low microbiological count (kappa =0.78). The sensitivity and specificity values were 0.59 and 0.85, respectively, showing that the test was more specific than sensitive, and could thus better identify the low caries risk subjects.
Resumo:
Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
Resumo:
The reproducibility of measurements of alveolar bone loss on radiographs may be a problem on epidemiologic studies, as they are based on comparisons of the diagnosis of various examiners. The aim of the present research paper was to assess the inter- and intra-examiner reproducibility of measurements of the interproximal alveolar bone loss on non-manipulated digital radiographs and after the application of image filters. Five Oral Radiologists measured the distance between the cementoenamel junction (CEJ) to the alveolar crest or to the deepest point of the bony defect on 12 interproximal digital radiographs of molars and bicuspids of a dry human skull. The digital manipulation and the linear measurements were obtained with the Trophy Windows software (Throphy®). For each image, six different versions were created: 1) non-manipulated; 2) bright-contrast adjustment; 3) negative; 4) negative with brightness-contrast adjustment; 5) pseudo-colored; 6) pseudo-colored with brightness-contrast adjustment. In order to prevent interpretation bias because of the repetition of measurements, the examiners measured the radiographs in a random sequence. The two-way ANOVA test at 5% level of significance to compare the means of readings of the same operator with each filter indicated p<0.05 for the majority of operators, while the comparison between the mean values of operators using the same filter indicated p>0.05 for all filters. Based on the results, we concluded that linear measurements of interproximal alveolar bone loss on digital radiographs are highly reproducible among examiners. Nevertheless, the application of image filters significantly influenced the degree of intra-examiner reproducibility. Some filters even reduced the reproducibility of intra-examiner readings.