988 resultados para test score
Resumo:
We re-examine the theoretical concept of a production function for cognitive achievement, and argue that an indirect production function that depends upon the variables that constrain parents' choices is both moretractable from an econometric point of view, and more interesting from an economic point of view than is a direct production function that depends upon a detailed list of direct inputs such as number of books in the household. We estimate flexible econometric models of indirect production functions for two achievement measures from the Woodcock-Johnson Revised battery, using data from two waves of the Child Development Supplement to the PSID. Elasticities of achievement measures with respect to family income and parents' educational levels are positive and significant. Gaps between scores of black and white children narrow or remain constant as children grow older, a result that differs from previous findings in the literature. The elasticities of achievement scores with respect to family income are substantially higher for children of black families, and there are some notable difference in elasticities with respect to parents' educational levels across blacks and whites.
Resumo:
We propose a novel method for scoring the accuracy of protein binding site predictions – the Binding-site Distance Test (BDT) score. Recently, the Matthews Correlation Coefficient (MCC) has been used to evaluate binding site predictions, both by developers of new methods and by the assessors for the community wide prediction experiment – CASP8. Whilst being a rigorous scoring method, the MCC does not take into account the actual 3D location of the predicted residues from the observed binding site. Thus, an incorrectly predicted site that is nevertheless close to the observed binding site will obtain an identical score to the same number of nonbinding residues predicted at random. The MCC is somewhat affected by the subjectivity of determining observed binding residues and the ambiguity of choosing distance cutoffs. By contrast the BDT method produces continuous scores ranging between 0 and 1, relating to the distance between the predicted and observed residues. Residues predicted close to the binding site will score higher than those more distant, providing a better reflection of the true accuracy of predictions. The CASP8 function predictions were evaluated using both the MCC and BDT methods and the scores were compared. The BDT was found to strongly correlate with the MCC scores whilst also being less susceptible to the subjectivity of defining binding residues. We therefore suggest that this new simple score is a potentially more robust method for future evaluations of protein-ligand binding site predictions.
Resumo:
This paper presents an efficient construction algorithm for obtaining sparse kernel density estimates based on a regression approach that directly optimizes model generalization capability. Computational efficiency of the density construction is ensured using an orthogonal forward regression, and the algorithm incrementally minimizes the leave-one-out test score. A local regularization method is incorporated naturally into the density construction process to further enforce sparsity. An additional advantage of the proposed algorithm is that it is fully automatic and the user is not required to specify any criterion to terminate the density construction procedure. This is in contrast to an existing state-of-art kernel density estimation method using the support vector machine (SVM), where the user is required to specify some critical algorithm parameter. Several examples are included to demonstrate the ability of the proposed algorithm to effectively construct a very sparse kernel density estimate with comparable accuracy to that of the full sample optimized Parzen window density estimate. Our experimental results also demonstrate that the proposed algorithm compares favorably with the SVM method, in terms of both test accuracy and sparsity, for constructing kernel density estimates.
Resumo:
In this paper we test whether the disclosure of test scores has direct impacts on student performance, school composition and school inputs. We take advantage of the discontinuity on the disclosure rules of The National Secondary Education Examination (ENEM) run in Brazil by the Ministry of Education: In 2006 it was established that the 2005 mean score results would be disclosed for schools with ten or more students who took the exam in the previous year. We use a regression discontinuity design to estimate the e ects of test disclosure. Our results indicate that private schools that had their average scores released in 2005 outperformed those that did not by 0.2-0.6 in 2007. We did not nd same results for public schools. Moreover, we did not nd evidence that treated schools adjusted their inputs or that there was major changes in the students composition of treated schools. These ndings allow us to interpret that the main mechanism driving the di erences in performance was the increased levels of students', teachers' and principals' e ort exerted by those in schools that had scores publicized.
Resumo:
BACKGROUND A single non-invasive gene expression profiling (GEP) test (AlloMap®) is often used to discriminate if a heart transplant recipient is at a low risk of acute cellular rejection at time of testing. In a randomized trial, use of the test (a GEP score from 0-40) has been shown to be non-inferior to a routine endomyocardial biopsy for surveillance after heart transplantation in selected low-risk patients with respect to clinical outcomes. Recently, it was suggested that the within-patient variability of consecutive GEP scores may be used to independently predict future clinical events; however, future studies were recommended. Here we performed an analysis of an independent patient population to determine the prognostic utility of within-patient variability of GEP scores in predicting future clinical events. METHODS We defined the GEP score variability as the standard deviation of four GEP scores collected ≥315 days post-transplantation. Of the 737 patients from the Cardiac Allograft Rejection Gene Expression Observational (CARGO) II trial, 36 were assigned to the composite event group (death, re-transplantation or graft failure ≥315 days post-transplantation and within 3 years of the final GEP test) and 55 were assigned to the control group (non-event patients). In this case-controlled study, the performance of GEP score variability to predict future events was evaluated by the area under the receiver operator characteristics curve (AUC ROC). The negative predictive values (NPV) and positive predictive values (PPV) including 95 % confidence intervals (CI) of GEP score variability were calculated. RESULTS The estimated prevalence of events was 17 %. Events occurred at a median of 391 (inter-quartile range 376) days after the final GEP test. The GEP variability AUC ROC for the prediction of a composite event was 0.72 (95 % CI 0.6-0.8). The NPV for GEP score variability of 0.6 was 97 % (95 % CI 91.4-100.0); the PPV for GEP score variability of 1.5 was 35.4 % (95 % CI 13.5-75.8). CONCLUSION In heart transplant recipients, a GEP score variability may be used to predict the probability that a composite event will occur within 3 years after the last GEP score. TRIAL REGISTRATION Clinicaltrials.gov identifier NCT00761787.
Resumo:
Previous research found personality test scores to be inflated on average among individuals who were motivated to present themselves in a desirable fashion in high stakes situations, such as during the employee selection process. One apparently effective way to reduce the undesirable test score inflation in such situations was to warn participants against faking. This research set out to investigate whether warning against faking would indeed affect personality test scores in the theoretically expected fashion. Contrary to expectations, the results did not support the hypothesized causal chain. Results across three studies show that while a warning may lower test scores in participants motivated to respond desirably (i.e., to fake), the effect of warning on test scores was not fully mediated by: a reduction in motivation to do well and self-reports of exaggerated responses in the personality test. Theoretical and practical implications are discussed.
Resumo:
This paper argues that low-stakes test scores, available in surveys, may be partially determined by test-taking motivation, which is associated with personality traits but not with cognitive ability. Therefore, such test score distributions may not be informative regarding cognitive ability distributions. Moreover, correlations, found in survey data, between high test scores and economic success may be partially caused by favorable personality traits. To demonstrate these points, I use the coding speed test that was administered without incentives to National Longitudinal Survey of Youth 1979 (NLSY) participants. I suggest that due to its simplicity its scores may especially depend on individuals' test-taking motivation. I show that controlling for conventional measures of cognitive skills, the coding speed scores are correlated with future earnings of male NLSY participants. Moreover, the coding speed scores of highly motivated, though less educated, population (potential enlists to the armed forces) are higher than NLSY participants' scores. I then use controlled experiments to show that when no performance-based incentives are provided, participants' characteristics, but not their cognitive skills, affect effort invested in the coding speed test. Thus, participants with the same ability (measured by their scores on an incentivized test) have significantly different scores on tests without performance- based incentives.
Resumo:
This paper argues that low-stakes test scores, available in surveys, may be partially determinedby test-taking motivation, which is associated with personality traits but not with cognitiveability. Therefore, such test score distributions may not be informative regarding cognitiveability distributions. Moreover, correlations, found in survey data, between high test scoresand economic success may be partially caused by favorable personality traits. To demonstratethese points, I use the coding speed test that was administered without incentives to NationalLongitudinal Survey of Youth 1979 (NLSY) participants. I suggest that due to its simplicityits scores may especially depend on individuals' test-taking motivation. I show that controllingfor conventional measures of cognitive skills, the coding speed scores are correlated with futureearnings of male NLSY participants. Moreover, the coding speed scores of highly motivated,though less educated, population (potential enlists to the armed forces) are higher than NLSYparticipants' scores. I then use controlled experiments to show that when no performance-basedincentives are provided, participants' characteristics, but not their cognitive skills, affect effortinvested in the coding speed test. Thus, participants with the same ability (measured by theirscores on an incentivized test) have significantly different scores on tests without performance-based incentives.
Resumo:
Background: A test battery consisting of self-assessments and motor tests (tapping and spiral drawing) was developed for a hand computer with touch screen in a telemedicine setting. Objectives: To develop and evaluate a web-based system that delivers decision support information to the treating clinical staff for assessing PD symptoms in their patients based on the test battery data. Methods: The test battery is currently being used in a clinical trial (DAPHNE, EudraCT No. 2005-002654-21) by sixty five patients with advanced Parkinson’s disease (PD) on 9991 test occasions (four tests per day during in all 362 week-long test periods) at nine clinics around Sweden. Test results are sent continuously from the hand unit over a mobile net to a central computer and processed with statistical methods. They are summarized into scores for different dimensions of the symptom state and an ‘overall test score’ reflecting the overall condition of the patient during a test period. The information in the web application is organized and presented graphically in a way that the general overview of the patient performance per test period is emphasized. Focus is on the overall test score, symptom dimensions and daily summaries. In a recent preliminary user evaluation, the web application was demonstrated to the fifteen study nurses who had used the test battery in the clinical trial. At least one patient per clinic was shown. Results: In general, the responses from nurses were positive. They claimed that the test results shown in the system were consistent with their own clinical observations. They could follow complications, changes and trends within their patients. Discussion: In conclusion, the system is able to summarise the various time series of motor test results and self-assessments during test periods and present them in a useful manner. Its main contribution is a novel and reliable way to capture and easily access symptom information from patients’ home environment. The convenient access to current symptom profile as well as symptom history provides a basis for individualized evaluation and adjustment of treatments.
Resumo:
A novel test battery consisting of self-assessments and motor tests (tapping and spiral drawing) for patients with Parkinson’s disease (PD) was developed for a hand computer with touch screen in a telemedicine setting. Tests are performed four times per day in the home environment during weeklong test periods. Results are processed into scores for different dimensions of the symptom state and an ‘overall score’ reflecting the global condition of a patient during a test period. The test battery was validated in a separate study recently submitted to Mov Disord. This test battery is currently being used in an open longitudinal trial (DAPHNE, EudraCT No. 2005- 002654-21) by sixty-five patients with advanced PD at nine clinics around Sweden. On inclusion, the patients were either receiving treatment with duodenal levodopa/carbidopa infusion (Duodopa®) (n=36), or they were candidates for receiving this treatment (n=29). We now present interim results for the first twelve months. Test periods were performed in three-month intervals. During most of the periods, UPDRS ratings were performed in afternoons at the start of the week. In twenty of the patients, scores were available during individually optimized oral polypharamacy, before receiving infusion and at least one test period after having started infusion treatment. Usability and compliance with performing tests, this far are good, both with patients and clinical staff. Correlations between test periods 2 and 3 during infusion treatment (three months apart) are stronger for overall test score than for total UPDRS, indicating good reliability. The correlation between overall test score and UPDRS for all test periods is adequate (r=-0.6). In an exact Wilcoxon signed rank test, where the endpoint is the change from the first to the twelve month test period (n=25), there was no change in test results in any of the test battery dimensions for the patients already receiving infusion when included. However, in the patients entering the study before receiving infusion, there was a significant change (improvement) from the baseline to the twelve month test period in dimensions; ‘off’, ‘dyskinesia’ and ‘satisfied’ and in the ‘overall score’ (n=15). The mean improvement in overall score after infusion was 29% (p=0.015). We conclude that the test battery is able to measure a functional improvement with infusion that is sustained over at least twelve months.
Resumo:
This study aims to compare a psychological evaluation test to classical psychoanalysis in infertile women. Two hundred women were submitted to the Psychological Evaluation Test (PET). The sum of the scores for the responses ranged from 15 to 60 points, with scores 30 points being defined as 'psycho-emotional maladjustment' (cut-off point: median + 25%). For comparison, the patients were simultaneously submitted to a psychological examination by a psychologist, who was unaware of the PET results. of the 200 patients, 66 (33%) presented a test with greater than or equal to30 points ('psycho-emotional maladjustment') and 134 (67%) a test with <30 points (normal). Upon psychological examination, 105 (52.5%) presented an abnormal evaluation and 95 (47.5%) a normal evaluation. For the PET, statistical analysis showed 82% efficiency, 62% sensitivity, 98% positive predictive value, 99% specificity, 70% negative predictive value, likelihood ratio for a positive test result 62, and likelihood ratio for negative test result 0.38. The PET proved to be a useful clinical instrument, being of help in the selection of patients with psychological needs induced by infertility.
Resumo:
BACKGROUND Driving a car is a complex instrumental activity of daily living and driving performance is very sensitive to cognitive impairment. The assessment of driving-relevant cognition in older drivers is challenging and requires reliable and valid tests with good sensitivity and specificity to predict safe driving. Driving simulators can be used to test fitness to drive. Several studies have found strong correlation between driving simulator performance and on-the-road driving. However, access to driving simulators is restricted to specialists and simulators are too expensive, large, and complex to allow easy access to older drivers or physicians advising them. An easily accessible, Web-based, cognitive screening test could offer a solution to this problem. The World Wide Web allows easy dissemination of the test software and implementation of the scoring algorithm on a central server, allowing generation of a dynamically growing database with normative values and ensures that all users have access to the same up-to-date normative values. OBJECTIVE In this pilot study, we present the novel Web-based Bern Cognitive Screening Test (wBCST) and investigate whether it can predict poor simulated driving performance in healthy and cognitive-impaired participants. METHODS The wBCST performance and simulated driving performance have been analyzed in 26 healthy younger and 44 healthy older participants as well as in 10 older participants with cognitive impairment. Correlations between the two tests were calculated. Also, simulated driving performance was used to group the participants into good performers (n=70) and poor performers (n=10). A receiver-operating characteristic analysis was calculated to determine sensitivity and specificity of the wBCST in predicting simulated driving performance. RESULTS The mean wBCST score of the participants with poor simulated driving performance was reduced by 52%, compared to participants with good simulated driving performance (P<.001). The area under the receiver-operating characteristic curve was 0.80 with a 95% confidence interval 0.68-0.92. CONCLUSIONS When selecting a 75% test score as the cutoff, the novel test has 83% sensitivity, 70% specificity, and 81% efficiency, which are good values for a screening test. Overall, in this pilot study, the novel Web-based computer test appears to be a promising tool for supporting clinicians in fitness-to-drive assessments of older drivers. The Web-based distribution and scoring on a central computer will facilitate further evaluation of the novel test setup. We expect that in the near future, Web-based computer tests will become a valid and reliable tool for clinicians, for example, when assessing fitness to drive in older drivers.
Resumo:
The present research represents a coherent approach to understanding the root causes of ethnic group differences in ability test performance. Two studies were conducted, each of which was designed to address a key knowledge gap in the ethnic bias literature. In Study 1, both the LR Method of Differential Item Functioning (DIF) detection and Mixture Latent Variable Modelling were used to investigate the degree to which Differential Test Functioning (DTF) could explain ethnic group test performance differences in a large, previously unpublished dataset. Though mean test score differences were observed between a number of ethnic groups, neither technique was able to identify ethnic DTF. This calls into question the practical application of DTF to understanding these group differences. Study 2 investigated whether a number of non-cognitive factors might explain ethnic group test performance differences on a variety of ability tests. Two factors – test familiarity and trait optimism – were able to explain a large proportion of ethnic group test score differences. Furthermore, test familiarity was found to mediate the relationship between socio-economic factors – particularly participant educational level and familial social status – and test performance, suggesting that test familiarity develops over time through the mechanism of exposure to ability testing in other contexts. These findings represent a substantial contribution to the field’s understanding of two key issues surrounding ethnic test performance differences. The author calls for a new line of research into these performance facilitating and debilitating factors, before recommendations are offered for practitioners to ensure fairer deployment of ability testing in high-stakes selection processes.
Resumo:
PURPOSE: To evaluate the ocular surface toxicity of two nitric oxide donors in ex vivo and in vivo animal models: S-nitrosoglutathione (GSNO) and S-nitroso-N-acetylcysteine (SNAC) in a hydroxypropyl methylcellulose (HPMC) matrix at final concentrations 1.0 and 10.0 mM. METHODS: Ex vivo GSNO and SNAC toxicities were clinically and histologically analyzed using freshly excised pig eyeballs. In vivo experiments were performed with 20 albino rabbits which were randomized into 4 groups (5 animals each): Groups 1 and 2 received instillations of 150 µL of aqueous HPMC solution containing GSNO 1.0 and 10.0 mM, respectively, in one of the eyes; Groups 3 and 4 received instillations of 150 µL of aqueous HPMC solution-containing SNAC 1.0 and 10.0 mM, respectively, in one of the eyes. The contralateral eyes in each group received aqueous HPMC as a control. All animals underwent clinical evaluation on a slit lamp and the eyes were scored according to a modified Draize eye test and were histologically analyzed. RESULTS: Pig eyeballs showed no signs of perforation, erosion, corneal opacity or other gross damage. These findings were confirmed by histological analysis. There was no difference between control and treated rabbit eyes according to the Draize eye test score in all groups (p>0.05). All formulations showed a mean score under 1 and were classified as non-irritating. There was no evidence of tissue toxicity in the histological analysis in all animals. CONCLUSION: Aqueous HPMC solutions containing GSNO and SNAC at concentrations up to 10.0 mM do not induce ocular irritation.