874 resultados para VALIDITY OF TESTS
Resumo:
The purpose of the study was to determine the degree of relationships among GRE scores, undergraduate GPA (UGPA), and success in graduate school, as measured by first year graduate GPA (FGPA), cumulative graduate GPA, and degree attainment status. A second aim of the study was to determine whether the relationships between the composite predictor (GRE scores and UGPA) and the three success measures differed by race/ethnicity and sex. A total of 7,367 graduate student records (masters, 5,990; doctoral: 1,377) from 2000 to 2010 were used to evaluate the relationships among GRE scores, UGPA and the three success measures. Pearson’s correlation, multiple linear and logistic regression, and hierarchical multiple linear and logistic regression analyses were performed to answer the research questions. The results of the correlational analyses differed by degree level. For master’s students, the ETS proposed prediction that GRE scores are valid predictors of first year graduate GPA was supported by the findings from the present study; however, for doctoral students, the proposed prediction was only partially supported. Regression and correlational analyses indicated that UGPA was the variable that consistently predicted all three success measures for both degree levels. The hierarchical multiple linear and logistic regression analyses indicated that at master’s degree level, White students with higher GRE Quantitative Reasoning Test scores were more likely to attain a degree than Asian Americans, while International students with higher UGPA were more likely to attain a degree than White students. The relationships between the three predictors and the three success measures were not significantly different between men and women for either degree level. Findings have implications both for practice and research. They will provide graduate school administrators with institution-specific validity data for UGPA and the GRE scores, which can be referenced in making admission decisions, while they will provide empirical and professionally defensible evidence to support the current practice of using UGPA and GRE scores for admission considerations. In addition, new evidence relating to differential predictions will be useful as a resource reference for future GRE validation researchers.
Resumo:
Power distance can produce contextual effects that surpass the cultural level of analysis, allowing predicting how the assimilation of these cultural values impacts individuals motivations to attain power positions and behaviors towards authorities. Power distance value can be conceived both at a micro and macro level of analysis. However existing measures used at a cultural level have been the object of several critics, and others applied at the individual level need further study in terms of their psychometric properties. This article presents the main psychometric properties of the Earley and Erez (1997) Power Differential Scale. This scale measures the acceptability of power and status differences both at micro and macro level. Two studies analyse the scale’s construct validity and its factorial invariance across groups of participants (Study 1); and its predictive validity at an individual level (Study 2). The results obtained support the proposed unidimensionality of the scale. Furthermore, it demonstrated predictive power by showing the role of power distance in the prediction of individual motivations to attain power and to respond to power situations using withdrawal or confrontational strategies. Future research is discussed, specifically the impact of power differential construct in individual attitudes and behavior.
Resumo:
BACKGROUND: The Life-Space Assessment (LSA), developed in the USA, is an instrument focusing on mobility with respect to reaching different areas defined as life-spaces, extending from the room where the person sleeps to mobility outside one's hometown. A newly translated Swedish version of the LSA (LSA-S) has been tested for test-retest reliability, but the validity remains to be tested. The purpose of the present study was to examine the concurrent validity of the LSA-S, by comparing and correlating the LSA scores to other measures of mobility. METHOD: The LSA was included in a population-based study of health, functioning and mobility among older persons in Sweden, and the present analysis comprised 312 community-dwelling participants. To test the concurrent validity, the LSA scores were compared to a number of other mobility-related variables, including the Short Physical Performance Battery (SPPB) as well as "stair climbing", "transfers", "transportation", "food shopping", "travel for pleasure" and "community activities". The LSA total mean scores for different levels of the other mobility-related variables, and measures of correlation were calculated. RESULTS: Higher LSA total mean scores were observed with higher levels of all the other mobility related variables. Most of the correlations between the LSA and the other mobility variables were large (r = 0.5-1.0) and significant at the 0.01 level. The LSA total score, as well as independent life-space and assistive life-space correlated with transportation (0.63, 0.66, 0.64) and food shopping (0.55, 0.58, 0.55). Assistive life-space also correlated with SPPB (0.47). With respect to maximal life-space, the correlations with the mobility-related variables were generally lower (below 0.5), probably since this aspect of life-space mobility is highly influenced by social support and is not so dependent on the individual's own physical function. CONCLUSION: LSA was shown to be a valid measure of mobility when using the LSA total, independent LS or assistive LSA.
Resumo:
Regular physical activity (PA) decreases mortality risk in survivors of breast and colorectal cancer. Such impacts of exercise have prompted initiatives designed both to promote and adequately monitor PA in cancer survivors. This study examines the validity of 2 widely used self-report methods for PA determination, the International Physical Activity Questionnaire short version (IPAQ-SF) and Global Physical Activity Questionnaire (GPAQ). Both instruments were compared with the triaxial accelerometry (Actigraph) method as an objective reference standard. Study participants were 204 cancer survivors (both sexes, aged 18-79 years). Compared with accelerometry, both questionnaires significantly overestimated PA levels (across all intensities) and underestimated physical inactivity levels. No differences were detected between the 2 questionnaires except for a shorter inactivity time estimated by GPAQ (p=0.001). The Bland and Altman method confirmed that both questionnaires overestimated all PA levels. Receiver operating characteristic (ROC) analysis classified IPAQ and GPAQ as fair and poor predictors, respectively, of the proportions of survivors fulfilling international PA recommendations (≥150 min·week-1 of moderate-vigorous PA). IPAQ-SF showed a higher sensitivity but lower specificity than GPAQ. Our data do not support the use of IPAQ-SF or GPAQ to determine PA or inactivity levels in cancer survivors.
Resumo:
Background: To implement appropriate programs for promoting physical activity (PA) in people who are Deaf, it is important to have valid instruments for assessing PA in this population. Objective: The main purpose of this study was to examine the criterion validity of the short form of the International Physical Activity Questionnaire (IPAQ-S) in Deaf adults. Method: This study included 44 adults (18e65 years) of both genders (63.6% were females) who met the inclusion criteria. Objective measures of PAwere collected using accelerometers, which were worn by each participant during one week. After using the accelerometer, the IPAQ-S was applied to assess participants’ physical activity during the last 7 days. Results: There was no significant correlation between the average time spent in moderate to vigorous physical activity (MVPA) as measured by the accelerometer (40.1 6 24.5 min/day) and by the IPAQ-S (41.3 6 57.5 min/day). The IPAQ-S significantly underestimated the time spent in sedentary behavior (7.6 6 2.7 h/day vs. 10.1 6 1.6 h/day). Sedentary behavior and MVPA as measured by the accelerometer and the IPAQ-S showed limited agreement. Conclusions: Our results show some limitations on the use of IPAQ-S for quantifying PA among adults who are Deaf. The IPAQ-S tends to overestimate the MVPA and to underestimate sedentary behavior in adults who are Deaf.
Resumo:
Este estudo teve como objetivo realizar a adaptação cultural do The Environmental Stressor Questionnaire - (ESQ) para a língua portuguesa do Brasil e verificar sua confiabilidade e validade. Foram empregadas as etapas metodológicas recomendadas pela literatura para adaptação cultural. A versão brasileira do ESQ foi aplicada a 106 pacientes de Unidade de Terapia Intensiva (UTI) de dois hospitais, público e privado, do interior do Estado de São Paulo. A confiabilidade foi avaliada quanto à consistência interna e estabilidade (teste e reteste); a validade convergente foi verificada por meio da correlação entre o ESQ e questão genérica sobre estresse em UTI. A confiabilidade foi satisfatória com Alfa de Crombach=0,94 e Coeficiente de Correlação Intraclasse=0,861 (IC95% 0,723; 0,933). Constatou-se correlação entre o escore total do ESQ e a questão genérica sobre estresse (r=0,70), confirmando a validade convergente. A versão brasileira do ESQ mostrou-se uma ferramenta confiável e válida para avaliação de estressores em UTI.
Resumo:
OBJETIVO: Comparar duas abordagens baseadas em critérios do Quality Assessment of Diagnostic Accuracy Studies (QUADAS) e do Standards for Reporting Studies of Diagnostic Accuracy (STARD) na avaliação de qualidade de estudos de validação do teste rápido OptiMal®, para diagnóstico de malária. MÉTODOS: Foi realizada busca de artigos de validação do teste rápido na base bibliográfica Medline acessada pelo PubMed, no ano de 2007. Treze artigos foram recuperados na busca. Foram combinados 12 critérios do QUADAS e três do STARD para comparação com os critérios do QUADAS isoladamente. Foi considerado que artigos de regular a boa qualidade atenderiam pelo menos 50% dos critérios do QUADAS. RESULTADOS: Dos 13 artigos recuperados, 12 cumpriram pelo menos 50% dos critérios do QUADAS, e apenas dois atenderam à combinação dos critérios. Considerando-se a combinação dos dois critérios (> 6 QUADAS e > 3STARD), dois estudos (15,4%) apresentaram boa qualidade metodológica. A seleção de artigos usando a combinação proposta variou de dois a oito artigos, dependendo do número de itens considerados como ponto de corte. CONCLUSÕES: A combinação do QUADAS com o STARD tem o potencial de conferir maior rigor nas avaliações da qualidade de artigos publicados sobre validação de testes diagnósticos em malária, por incorporar a checagem de informações relevantes não alcançáveis pelo uso do QUADAS isoladamente.
Resumo:
OBJETIVO: Avaliar a validade e a confiabilidade da versão brasileira de índice de capacidade para o trabalho. MÉTODOS:Estudo transversal com amostra de 475 trabalhadores de empresa do setor elétrico no estado de São Paulo (dez municípios em Campinas e região), realizado em 2005. Foram avaliados os seguintes aspectos da versão brasileira do Índice de Capacidade para o Trabalho: validade de construto, por meio de análise fatorial confirmatória e da capacidade discriminante; validade de critério, correlacionado o escore do índice com medidas de saúde auto-referidas; e confiabilidade, por meio da análise da consistência interna utilizando o coeficiente alfa de Cronbach. RESULTADOS: A análise fatorial indicou três fatores do construto capacidade para o trabalho: questões relativas aos "recursos mentais" (20,6% da variância), à autopercepção da capacidade para o trabalho (18,9% da variância) e à presença de doenças e limitações decorrentes do estado de saúde (18,4% da variância). O índice discriminou os trabalhadores segundo nível de absenteísmo, identificando média estatisticamente significativa (p<0,001) entre aqueles com absenteísmo elevado (37,2 pontos) e baixo (42,3 pontos). A análise de critério mostrou correlação do índice com todas as dimensões do estado de saúde analisadas (p<0,0001). O índice apresentou boa confiabilidade com coeficiente alfa de Cronbach (0,72). CONCLUSÕES: A versão brasileira do Índice de Capacidade para o Trabalho mostrou propriedades psicométricas satisfatórias quanto à validade de construto, de critério e de confiabilidade, representando uma opção adequada para avaliação da capacidade para o trabalho em abordagens individuais e inquéritos populacionais.
Resumo:
OBJETIVO:Validar escala de insatisfação corporal para adolescentes. MÉTODOS: Participaram do estudo 386 adolescentes, de ambos os sexos, entre dez e 17 anos de idade, de uma escola particular de ensino fundamental e médio, de São Bernardo do Campo, SP, em 2006. Foram realizadas tradução e adaptação cultural da "Escala de Evaluación de Insatisfación Corporal para Adolescentes" para o português. Foram avaliadas consistência interna por meio do coeficiente alfa de Cronbach, análise fatorial pelo método Varimax e validade discriminante pelas diferenças entre médias de estado nutricional, utilizando-se o teste de Kruskal-Wallis. Na validação concorrente, calculou-se o coeficiente de correlação de Spearman entre a escala e o índice de massa corporal, a razão circunferência quadril e a circunferência da cintura. Para reprodutibilidade, foram utilizados o teste de Wilcoxon, o coeficiente de correlação intra-classe. RESULTADOS: A escala traduzida não apresentou discordâncias significativas com a original. A escala apresentou consistência interna satisfatória para todos os subgrupos estudados (fases inicial e intermediária de adolescência, ambos os sexos) e foi capaz de discriminar os adolescentes segundo o estado nutricional. Na análise concorrente, as três medidas corporais foram correlacionadas, exceto adolescentes do sexo masculino em fase inicial, e sua reprodutibilidade foi confirmada. CONCLUSÕES: A Escala de Avaliação da Insatisfação Corporal para Adolescentes está traduzida e adaptada para o português e apresentou resultados satisfatórios, sendo recomendada para avaliação do aspecto atitudinal da imagem corporal de adolescentes.
Resumo:
OBJETIVO: Avaliar a reprodutibilidade e a validade de indicadores de atividade física e sedentarismo, obtidos por sistema de vigilância baseado em inquéritos telefônicos. MÉTODOS: Foram realizadas análises de reprodutibilidade e validade em duas subamostras aleatórias (n=110 e n=111, respectivamente) da amostra total (N=2.024) de adultos (>18 anos), estudada pelo sistema, no município de São Paulo, em 2005. Os indicadores avaliados incluíram a freqüência de "suficientemente ativos no lazer", "inativos em quatro domínios da atividade física (lazer, trabalho, transporte e atividades domésticas)" e "ver televisão por longos períodos". A reprodutibilidade foi estudada comparando-se resultados obtidos a partir da entrevista telefônica original do sistema e de outra entrevista idêntica repetida após sete a 15 dias e feita por entrevistador diferente do que fez a entrevista original. A validade foi estudada comparando-se resultados obtidos a partir da entrevista telefônica original e de três recordatórios de 24 horas (método de referência) realizados na semana seguinte à entrevista original. RESULTADOS: A freqüência dos três indicadores avaliados foi idêntica ou muito próxima entre a primeira e a segunda entrevistas telefônicas, e os coeficientes kappa se situaram entre 0,53 e 0,80, indicando boa reprodutibilidade de todos os indicadores. Relativamente ao método de referência, evidenciou-se especificidade de 80% ou mais para os três indicadores e sensibilidade de 69,7% para "ver televisão por longos períodos", 59,1% para "inativos em quatro domínios" e 50% para "suficientemente ativos no lazer". CONCLUSÕES: Os indicadores de atividade física e sedentarismo empregados pelo sistema aparentam ser reprodutíveis e suficientemente acurados. Se mantido em operação nos próximos anos, o sistema poderá oferecer ao Brasil um instrumento útil para avaliação de políticas públicas de promoção da atividade física e controle das doenças crôni
Resumo:
Existen importantes pruebas de valoración que miden habilidades o competencias motoras en el niño; a pesar de ello Colombia carece de estudios que demuestren la validez y la confiabilidad de un test de medición que permita emitir un juicio valorativo relacionado con las competencias motoras infantiles, teniendo presente que la intervención debe basarse en la rigurosidad que exigen los procesos de valoración y evaluación del movimiento corporal. Objetivo. El presente estudio se centró en determinar las propiedades psicométricas del test de competencias motoras Bruininiks Oseretsky –BOT 2- segunda edición. Materiales y métodos. Se realizó una evaluación de pruebas diagnósticas con 24 niños aparentemente sanos de ambos géneros, entre 4 y 7 años, residentes en las ciudades de Chía y Bogotá. La evaluación fue realizada por 3 evaluadores expertos; el análisis para consistencia interna se realizó utilizando el Coeficiente Alfa de Cronbach, el análisis de reproducibilidad se estableció a través del Coeficiente de Correlación Intraclase –CCI- y para el análisis de la validez concurrente se utilizó el Coeficiente de Correlación de Pearson, considerando un alfa=0.05. Resultados. Para la totalidad de las pruebas, se encontraron altos índices de confiabilidad y validez. Conclusiones. El BOT 2 es un instrumento válido y confiable, que puede ser utilizado para la evaluación e identificación del nivel de desarrollo en que se encuentran las competencias motoras en el niño.
Resumo:
Background: This study was aimed at assessing the psychometric qualities of the abbreviated versions of the Alcohol Use Disorders Identification Test (AUDIT-3, AUDIT-4, AUDIT-C, AUDIT-PC, AUDIT-QF, FAST, and Five-Shot) and at comparing them to the 10-item AUDIT and the CAGE in 2 samples of Brazilian adults. Methods: The validity and internal consistency of the scales were assessed in a sample of 530 subjects attended at an emergency department and at a Psychosocial Care Center for Alcohol and Drugs. The Structured Clinical Interview for DSM-IV was used as the diagnostic comparative measure for the predictive validity assessment. The concurrent validity between the scales was analyzed by means of Pearson`s correlation coefficient. Results: The assessment of the predictive validity of the abbreviated versions showed high sensitivity (of 0.78 to 0.96) and specificity (of 0.74 to 0.94) indices, with areas under the curve as elevated as those of the AUDIT (0.89 and 0.92 to screen for abuse and 0.93 and 0.95 in the screening of dependence). The CAGE presented lower indices: 0.81 for abuse and 0.87 for dependence. The analysis of the internal consistency of the AUDIT and its versions exhibited Cronbach`s alpha coefficients between 0.83 and 0.94, while the coefficient for the CAGE was 0.78. Significant correlations were found between the 10-item AUDIT and its versions, ranging from 0.91 to 0.99. Again, the results for the CAGE were satisfactory (0.77), although inferior to the other instruments. Conclusions: The results obtained in this study confirm the validity of the abbreviated versions of the AUDIT for the screening of alcohol use disorders and show that their psychometric properties are as satisfactory as those of the 10-item AUDIT and the CAGE.
Resumo:
We explore in depth the validity of a recently proposed scaling law for earthquake inter-event time distributions in the case of the Southern California, using the waveform cross-correlation catalog of Shearer et al. Two statistical tests are used: on the one hand, the standard two-sample Kolmogorov-Smirnov test is in agreement with the scaling of the distributions. On the other hand, the one-sample Kolmogorov-Smirnov statistic complemented with Monte Carlo simulation of the inter-event times, as done by Clauset et al., supports the validity of the gamma distribution as a simple model of the scaling function appearing on the scaling law, for rescaled inter-event times above 0.01, except for the largest data set (magnitude greater than 2). A discussion of these results is provided.
Resumo:
INTRODUCTION: Two important risk factors for abnormal neurodevelopment are preterm birth and neonatal hypoxic ischemic encephalopathy. The new revisions of Griffiths Mental Development Scale (Griffiths-II, [1996]) and the Bayley Scales of Infant Development (BSID-II, [1993]) are two of the most frequently used developmental diagnostics tests. The Griffiths-II is divided into five subscales and a global development quotient (QD), and the BSID-II is divided into two scales, the Mental scale (MDI) and the Psychomotor scale (PDI). The main objective of this research was to establish the extent to which developmental diagnoses obtained using the new revisions of these two tests are comparable for a given child. MATERIAL AND METHODS: Retrospective study of 18-months-old high-risk children examined with both tests in the follow-up Unit of the Clinic of Neonatology of our tertiary care university Hospital between 2011 and 2012. To determine the concurrent validity of the two tests paired t-tests and Pearson product-moment correlation coefficients were computed. Using the BSID-II as a gold standard, the performance of the Griffiths-II was analyzed with receiver operating curves. RESULTS: 61 patients (80.3% preterm, 14.7% neonatal asphyxia) were examined. For the BSID-II the MDI mean was 96.21 (range 67-133) and the PDI mean was 87.72 (range 49-114). For the Griffiths-II, the QD mean was 96.95 (range 60-124), the locomotors subscale mean was 92.57 (range 49-119). The score of the Griffiths locomotors subscale was significantly higher than the PDI (p<0.001). Between the Griffiths-II QD and the BSID-II MDI no significant difference was found, and the area under the curve was 0.93, showing good validity. All correlations were high and significant with a Pearson product-moment correlation coefficient >0.8. CONCLUSIONS: The meaning of the results for a given child was the same for the two tests. Two scores were interchangeable, the Griffiths-II QD and the BSID-II MDI.
Resumo:
In the analysis of instrumented indentation data, it is common practice to incorporate the combined moduli of the indenter (E-i) and the specimen (E) in the so-called reduced modulus (E-r) to account for indenter deformation. Although indenter systems with rigid or elastic tips are considered as equivalent if E-r is the same, the validity of this practice has been questioned over the years. The present work uses systematic finite element simulations to examine the role of the elastic deformation of the indenter tip in instrumented indentation measurements and the validity of the concept of the reduced modulus in conical and pyramidal (Berkovich) indentations. It is found that the apical angle increases as a result of the indenter deformation, which influences in the analysis of the results. Based upon the inaccuracies introduced by the reduced modulus approximation in the analysis of the unloading segment of instrumented indentation applied load (P)-penetration depth (delta) curves, a detailed examination is then conducted on the role of indenter deformation upon the dimensionless functions describing the loading stages of such curves. Consequences of the present results in the extraction of the uniaxial stress-strain characteristics of the indented material through such dimensional analyses are finally illustrated. It is found that large overestimations in the assessment of the strain hardening behavior result by neglecting tip compliance. Guidelines are given in the paper to reduce such overestimations.