874 resultados para VALIDITY OF TESTS
Resumo:
We consider a robust version of the classical Wald test statistics for testing simple and composite null hypotheses for general parametric models. These test statistics are based on the minimum density power divergence estimators instead of the maximum likelihood estimators. An extensive study of their robustness properties is given though the influence functions as well as the chi-square inflation factors. It is theoretically established that the level and power of these robust tests are stable against outliers, whereas the classical Wald test breaks down. Some numerical examples confirm the validity of the theoretical results.
Resumo:
Background/Aims: To develop and assess the psychometric validity of a Chinese language Vision Health related quality-of-life (VRQoL) measurement instrument for the Chinese visually impaired. Methods: The Low Vision Quality of Life Questionnaire (LVQOL) was translated and adapted into the Chinese-version Low Vision Quality of Life Questionnaire (CLVQOL). The CLVQOL was completed by 100 randomly selected people with low vision (primary group) and 100 people with normal vision (control group). Ninety-four participants from the primary group completed the CLVQOL a second time 2 weeks later (test-retest group). The internal consistency reliability, test-retest reliability, item-internal consistency, item-discrimination validity, construct validity and discriminatory power of the CLVQOL were calculated. Results: The review committee agreed that the CLVQOL replicated the meaning of the LVQOL and was sensitive to cultural differences. The Cronbach's α coefficient and the split-half coefficient for the four scales and total CLVQOL scales were 0.75-0.97. The test-retest reliability as estimated by the intraclass correlations coefficient was 0.69-0.95. Item-internal consistency was >0.4 and item-discrimination validity was generally <0.40. The Varimax rotation factor analysis of the CLVQOL identified four principal factors. the quality-of-life rating of four subscales and the total score of the CLVQOL of the primary group were lower than those of the Control group, both in hospital-based subjects and community-based subjects. Conclusion: The CLVQOL Chinese is a culturally specific vision-related quality-of-life measure instrument. It satisfies conventional psychometric criteria, discriminates visually healthy populations from low vision patients and may be valuable in screening the local community as well as for use in clinical practice or research. © Springer 2005.
Resumo:
Доклад, поместен в сборника на Националната конференция "Образованието в информационното общество", Пловдив, май, 2011 г.
Resumo:
The importance of checking the normality assumption in most statistical procedures especially parametric tests cannot be over emphasized as the validity of the inferences drawn from such procedures usually depend on the validity of this assumption. Numerous methods have been proposed by different authors over the years, some popular and frequently used, others, not so much. This study addresses the performance of eighteen of the available tests for different sample sizes, significance levels, and for a number of symmetric and asymmetric distributions by conducting a Monte-Carlo simulation. The results showed that considerable power is not achieved for symmetric distributions when sample size is less than one hundred and for such distributions, the kurtosis test is most powerful provided the distribution is leptokurtic or platykurtic. The Shapiro-Wilk test remains the most powerful test for asymmetric distributions. We conclude that different tests are suitable under different characteristics of alternative distributions.
Resumo:
Background: Internationally, tests of general mental ability are used in the selection of medical students. Examples include the Medical College Admission Test, Undergraduate Medicine and Health Sciences Admission Test and the UK Clinical Aptitude Test. The most widely used measure of their efficacy is predictive validity.A new tool, the Health Professions Admission Test- Ireland (HPAT-Ireland), was introduced in 2009. Traditionally, selection to Irish undergraduate medical schools relied on academic achievement. Since 2009, Irish and EU applicants are selected on a combination of their secondary school academic record (measured predominately by the Leaving Certificate Examination) and HPAT-Ireland score. This is the first study to report on the predictive validity of the HPAT-Ireland for early undergraduate assessments of communication and clinical skills. Method. Students enrolled at two Irish medical schools in 2009 were followed up for two years. Data collected were gender, HPAT-Ireland total and subsection scores; Leaving Certificate Examination plus HPAT-Ireland combined score, Year 1 Objective Structured Clinical Examination (OSCE) scores (Total score, communication and clinical subtest scores), Year 1 Multiple Choice Questions and Year 2 OSCE and subset scores. We report descriptive statistics, Pearson correlation coefficients and Multiple linear regression models. Results: Data were available for 312 students. In Year 1 none of the selection criteria were significantly related to student OSCE performance. The Leaving Certificate Examination and Leaving Certificate plus HPAT-Ireland combined scores correlated with MCQ marks.In Year 2 a series of significant correlations emerged between the HPAT-Ireland and subsections thereof with OSCE Communication Z-scores; OSCE Clinical Z-scores; and Total OSCE Z-scores. However on multiple regression only the relationship between Total OSCE Score and the Total HPAT-Ireland score remained significant; albeit the predictive power was modest. Conclusion: We found that none of our selection criteria strongly predict clinical and communication skills. The HPAT- Ireland appears to measures ability in domains different to those assessed by the Leaving Certificate Examination. While some significant associations did emerge in Year 2 between HPAT Ireland and total OSCE scores further evaluation is required to establish if this pattern continues during the senior years of the medical course.
Resumo:
PURPOSE To identify the factors responsible for the poor validity of the most common aniseikonia tests, which involve size comparisons of red-green stimuli presented haploscopically. METHODS Aniseikonia was induced by afocal size lenses placed before one eye. Observers compared the sizes of semicircles presented haploscopically via color filters. The main factor under study was viewing mode (free viewing versus short presentations under central fixation). To eliminate response bias, a three-response format allowed observers to respond if the left, the right, or neither semicircle appeared larger than the other. To control decisional (criterion) bias, measurements were taken with the lens-magnified stimulus placed on the left and on the right. To control for size-color illusions, measurements were made with color filters in both arrangements before the eyes and under binocular vision (without color filters). RESULTS Free viewing resulted in a systematic underestimation of lens-induced aniseikonia that was absent with short presentations. Significant size-color illusions and decisional biases were found that would be mistaken for aniseikonia unless appropriate action is taken. CONCLUSIONS To improve their validity, aniseikonia tests should use short presentations and include control conditions to prevent contamination from decisional/response biases. If anaglyphs are used, presence of size-color illusions must be checked for. TRANSLATIONAL RELEVANCE We identified optimal conditions for administration of aniseikonia tests and appropriate action for differential diagnosis of aniseikonia in the presence of response biases or size-color illusions. Our study has clinical implications for aniseikonia management.
Resumo:
The International FItness Scale (IFIS) is a self-reported measure of physical fitness that could easily. This scale has been validated in children, adolescents, and young adults; however, it is unknown whether the IFIS represents a valid and reliable estimate of physical fitness in Latino-American youth population. In the present study we aimed to examine the validity and reliability of the IFIS on a population-based sample of schoolchildren in Bogota, Colombia. Participants were 1,875 Colombian youth (56.2% girls) aged 9 to 17.9 years old. We measured adiposity markers (body fat, waist-to-height ratio, skinfold thicknesses and BMI), blood pressure, lipids profile, fasting glucose, and physical fitness level (self reported and measured). Also, a validated cardiometabolic risk index was used. An age- and sex-matched sample of 229 Schoolchildren originally not included in the study sample fulfilled IFIS twice for reliability purposes. Our data suggest that both measured and self-reported overall fitness were associated inversely with adiposity indicators and a cardiometabolic risk score. Overall, schoolchildren who self-reported “good” and “very good” fitness had better measured fitness than those who reported “very poor” and “poor” fitness (all p<0.001). Test–retest reliability of IFIS items was also good, with an average weighted Kappa of 0.811. Therefore, our findings suggest that self-reported fitness, as assessed by IFIS, is a valid, reliable, and health-related measure, and it can be a good alternative for future use in large studies with Latin-schoolchildren from Colombia.
Resumo:
OBJETIVO: Avaliar a validade do peso, estatura e Índice de Massa Corporal (IMC) referidos e sua confiabilidade para o diagnóstico do estado nutricional de adolescentes de Piracicaba. MÉTODOS: Participaram do estudo 360 adolescentes de ambos os sexos, de escolas públicas de Piracicaba, com idade entre 10 e 15 anos. Os adolescentes auto-relataram seu peso e estatura, sendo esses valores obtidos por medidas diretas, logo em seguida, pelos entrevistadores. A validade do IMC referido foi calculada segundo índices de sensibilidade, especificidade e valor preditivo positivo (VPP). Avaliou-se a concordância entre as categorias de IMC obtido por meio das medidas referidas e aferidas a partir do coeficiente kappa ponderado, coeficiente de correlação de Lin. e gráficos de Bland e Altman e Lin. RESULTADOS: Verificou-se que tanto os meninos quanto as meninas subestimaram o peso (-1,0 meninas e meninos) e a estatura (meninas -1,2 e meninos -0,8) (p < 0,001). Os valores de IMC aferidos e referidos apresentaram uma concordância moderada. A sensibilidade do IMC referido para classificar os indivíduos obesos foi maior para os meninos (87,5%), enquanto a especificidade foi maior para as meninas (92,7%). O VPP foi elevado somente para a classificação da eutrofia. CONCLUSÕES: As medidas referidas de peso e estatura de adolescentes não representam medidas válidas e, portanto, não devem ser usadas em substituição aos valores mensurados. Além disso, verificou-se que 10% dos meninos obesos e 40% das meninas obesas poderiam permanecer não identificados utilizando-se as medidas auto-referidas, confirmando a baixa validade das medidas auto-referidas.
Resumo:
Phaethornis longuemareus aethopyga was described by John T. Zimmer in 1950 and treated as a valid subspecies until it was proposed that the three known specimens were hybrids between R ruber and P. rupurumii amazonicus. On the basis of some recently collected specimens, we reevaluated the validity of P. l. aethopyga. Despite showing some differences related to age and sex, all specimens agree in the general plumage pattern and are fully diagnosable when compared with any other taxon of the genus. The hypothesis of a hybrid origin becomes unsustainable when one notes that (1) P. l. aethopyga has characters that are unique and absent in the purported parental species, such as the white outer margins at the base of the rectrices; and (2) P. l. aethopyga occurs far from the distribution of one of the alleged parental species. Furthermore, field data show that P. l. aethopyga has attributes typical of a valid and independent taxon, such as lekking behavior. Therefore, given its overall diagnosis, P. aethopyga could at least be treated as a phylogenetic species. Yet its morphological and vocal distinctiveness with respect to other Phaethornis spp. in the ""Pygmornis group"" is greater than that observed between some species pairs traditionally regarded as separate biological species within the group, which supports its recognition as a species under the biological species concept. Received 13 July 2008, accepted 9 March 2009.
Resumo:
We evaluated the reliability and validity of a Brazilian-Portuguese version of the Epilepsy Medication Treatment Complexity Index (EMTCI). Interrater reliability was evaluated with the intraclass correlation coefficient (ICC), and validity was evaluated by correlation of mean EMTCI scores with the following variables: number of antiepileptic drugs (AEDs), seizure control, patients` perception of seizure control, and adherence to the therapeutic regimen as measured with the Morisky scale. We studied patients with epilepsy followed in a tertiary university-based hospital outpatient clinic setting, aged 18 years or older, independent in daily living activities, and without cognitive impairment or active psychiatric disease. ICCs ranged from 0.721 to 0.999. Mean EMTCI scores were significantly correlated with the variables assessed. Higher EMTCI scores were associated with an increasing number of AEDs, uncontrolled seizures, patients` perception of lack of seizure control, and poorer adherence to the therapeutic regimen. The results indicate that the Brazilian-Portuguese EMTCI is reliable and valid to be applied clinically in the country. The Brazilian-Portuguese EMTCI version may be a useful tool in developing strategies to minimize treatment complexity, possibly improving seizure control and quality of life in people with epilepsy in our milieu. (C) 2011 Elsevier Inc. All rights reserved.
Resumo:
The objective of this study was to validate the Piper Fatigue Scale-Revised (PFS-R) for use in Brazilian culture. Translation of the PFS-R into Portuguese and validity and reliability tests were performed. Convenience samples in Brazil we as follows: 584 cancer patients (mean age 57 +/- 13 years; 51.3% female); 184 caregivers (mean age 50 +/- 12.7 years; 65.8% female); and 189 undergraduate nursing students (mean age 21.6 +/- 2.8 years; 96.2% female); Instruments used were as follows: Brazilian PFS, Beck Depression Inventory (BDI), and Karnofsky Performance Scale (KPS). The 22 items of the Brazilian PFS loaded well (factor loading > 0.35) on three dimensions identified by factor analysis (behavioral, affective, and sensorial-psychological). These dimensions explained 65% of the variance. Internal consistency reliability was very good (Cronbach`s alpha ranged from 0.841 to 0.943 for the total scale and its dimensions). Cancer patients and their caregivers completed the Brazilian PFS twice for test-retest reliability and results showed good stability (Pearson`s r a parts per thousand yenaEuro parts per thousand 0,60, p < 0,001). Correlations among the Brazilian PFS and other scales were significant, in hypothesized directions, and mostly moderate contributing to divergent (Brazilian PFS x KPS) and convergent validity (Brazilian PFS x BDI). Mild, moderate, and severe fatigue in patients were reported by 73 (12.5%), 167 (28.6%), and 83 (14.2%), respectively. Surprisingly, students had the highest mean total fatigue scores; no significant differences were observed between patients and caregivers showing poor discriminant validity. While the Brazilian PFS is a reliable and valid instrument to measure fatigue in Brazilian cancer patients, further work is needed to evaluate the discriminant validity of the scale in Brazil.
Resumo:
In this paper results of tests on 32 concrete-filled steel tubular columns under axial load are reported. The test parameters were the concrete compressive strength, the column slenderness (L/D) and the wall thickness (t). The test results were compared with predictions from the codes NBR 8800:2008 and EN 1994-1-1:2004 (EC4). The columns were 3, 5, 7 and 10 length to diameter ratios (L/D) and were tested with 30MPa, 60MPa, 80MPa and 100MPa concrete compressive strengths. The results of ultimate strength predicted by codes showed good agreement with experimental results. The results of NBR 8800 code were the most conservative and the EC4 showed the best results, in mean, but it was not conservative for usual concrete-filled short columns.
Resumo:
This paper presents new experimental results of Vortex-Induced Vibration (VIV) on inclined cylinders. Models are mounted on a low damping air-bearing elastic base with one degree-of-freedom, constrained to oscillate only in the transverse direction to a free stream. The Reynolds number varied in the range 2000 less than or similar to Re less than or similar to 8000. New measurements on the dynamic response oscillations of inclined cylinders, due to VIV, are compared with previous experiments of a vertical cylinder. Models with circular and elliptical cross sections have been tested. The purpose of this work is to check the validity of the normal velocity correction of VIV studies of inclined structures. The results show that the reduced velocity range, in which the upper and lower branches of VIV occurs, is similar to the vertical cylinder case if the proper projected velocity is considered. Tests have been conducted to support this observation with inclinations up to 45 degrees. We have also observed that the amplitudes of oscillation of the inclined circular cylinder are comparable, but slightly lower than, to the amplitudes observed in the vertical cylinder experiments. Measured forces and added mass also show similar behaviour. However, for cases with an elliptical cylinder, the amplitudes of oscillation are considerably lower than those observed for a circular cylinder. This difference is explained by the higher added mass of the elliptical cylinder. (C) 2009 Elsevier Ltd. All rights reserved.
Resumo:
Background: The Rivermead Behavioural Memory Test (RBMT) assesses everyday memory by means of tasks which mimic daily challenges. The objective was to examine the validity of the Brazilian version of the RBMT to detect cognitive decline. Methods: 195 older adults were diagnosed as normal controls (NC) or with mild cognitive impairment (MCI) or Alzheimer`s disease (AD) by a multidisciplinary team, after participants completed clinical and neuropsychological protocols. Results: Cronbach`s alpha was high for the total sample for the RBMT profile (PS) and screening scores (SS) (PS=0.91, SS=0.87) and for the AD group (PS=0.84, SS=0.85), and moderate for the MCI (PS=0.62, SS=0.55)and NC (PS=0.62, SS=0.60) groups. RBMT total scores, Appointment, Pictures, Immediate and Delayed Story, Immediate and Delayed Route, Delayed Message and Date contributed to differentiate NC from MCI. ROC curve analyses indicated high accuracy to differentiate NC from AD patients, and, moderate accuracy to differentiate NC from MCI. Conclusions: The Brazilian version of the RBMT seems to be an appropriate instrument to identify memory decline in Brazilian older adults.
Resumo:
Objective: Clinical evaluation of the stomatognathic system is indispensable for the diagnosis of orofacial myofunctional disorders. In order to obtain a more precise diagnosis, the protocol of orofacial myofunctional evaluation with scores (OMES protocol) (Int. J. Pediatr. Otorhinolaryngol. 72 (2008) 367-375) was expanded in terms of number of items and scale amplitude. The proposal of this study is to describe the expanded OMES protocol (OMES-E) for the evaluation of children. Validity of the protocol, reliability of the examiners and agreement between them were analyzed, as also were the sensitivity, specificity and predictive values of the instrument. Methods: The sample consisted of videorecorded images of 50 children, 25 boys (mean age = 8.4 years, SD = 1.8) and 25 girls (mean age = 8.2 years, SD = 1.7) selected at random from 200 samples. Three speech therapists prepared for orofacial myofunctional evaluation participated as examiners (E). The OMES and OMES-E protocols were used for evaluation on different days. E1 evaluated all images, E2 analyzed children with recordings from 1 to 25 and E3 analyzed children with recordings from 26 to 50. The validity of OMES-E was analyzed by comparing the instrument to the OMES protocol using the Pearson correlation test complemented with the split-half reliability test (p < 0.05). The linear weighted Kappa coefficient of agreement (Kw`), the sensitivity, specificity and predictive values and the prevalence of OMD were calculated. Results: There was a statistically significant correlation between the OMES and OMES-E protocols (0.79 > r < 0.94, p < 0.01) and a significant test-retest correlation with the OMES-E (0.75 > r < 0.86, p < 0.01), with a reliability range of 0.86-0.93. The correlation and reliability coefficients between examiners were: E1 x E2 (r = 0.74, 0.84), E1 x E3 (r = 0.70, 0.83) (p < 0.01). Kw` coefficients with moderate and good strength predominated. The OMES-E protocol presented mean sensitivity = 0.91, specificity = 0.77, positive predictive value = 0.87 and negative predictive value = 0.85. The mean prevalence of OMD was 0.58. Conclusion: The OMES-E protocol is valid and reliable for orofacial myofunctional evaluation. (C) 2010 Elsevier Ireland Ltd. All rights reserved.