11 resultados para accumulative test item

em Deakin Research Online - Australia


Relevância:

100.00% 100.00%

Publicador:

Resumo:

A key test used in Australia to assess the mathematical knowledge of young children uses illustrations of objects such as coins and three-dimensional shapes. This study explored the effects of giving 104 kindergarten children, aged 4-5 years, the questions with either moveable objects or illustrations. It
was found that children who were categorized by their teachers as having “higher levels of numeracy” scored well on test questions using either illustrations or objects, while children who were categorized as having “lower levels of numeracy” scored higher with objects than with illustrations. This result could have implications for consideration of test item readability in relation to graphicacy.

Relevância:

80.00% 80.00%

Publicador:

Resumo:

The overarching goal of this dissertation was to evaluate the contextual components of instructional strategies for the acquisition of complex programming concepts. A meta-knowledge processing model is proposed, on the basis of the research findings, thereby facilitating the selection of media treatment for electronic courseware. When implemented, this model extends the work of Smith (1998), as a front-end methodology, for his glass-box interpreter called Bradman, for teaching novice programmers. Technology now provides the means to produce individualized instructional packages with relative ease. Multimedia and Web courseware development accentuate a highly graphical (or visual) approach to instructional formats. Typically, little consideration is given to the effectiveness of screen-based visual stimuli, and curiously, students are expected to be visually literate, despite the complexity of human-computer interaction. Visual literacy is much harder for some people to acquire than for others! (see Chapter Four: Conditions-of-the-Learner) An innovative research programme was devised to investigate the interactive effect of instructional strategies, enhanced with text-plus-textual metaphors or text-plus-graphical metaphors, and cognitive style, on the acquisition of a special category of abstract (process) programming concept. This type of concept was chosen to focus on the role of analogic knowledge involved in computer programming. The results are discussed within the context of the internal/external exchange process, drawing on Ritchey's (1980) concepts of within-item and between-item encoding elaborations. The methodology developed for the doctoral project integrates earlier research knowledge in a novel, interdisciplinary, conceptual framework, including: from instructional science in the USA, for the concept learning models; British cognitive psychology and human memory research, for defining the cognitive style construct; and Australian educational research, to provide the measurement tools for instructional outcomes. The experimental design consisted of a screening test to determine cognitive style, a pretest to determine prior domain knowledge in abstract programming knowledge elements, the instruction period, and a post-test to measure improved performance. This research design provides a three-level discovery process to articulate: 1) the fusion of strategic knowledge required by the novice learner for dealing with contexts within instructional strategies 2) acquisition of knowledge using measurable instructional outcome and learner characteristics 3) knowledge of the innate environmental factors which influence the instructional outcomes This research has successfully identified the interactive effect of instructional strategy, within an individual's cognitive style construct, in their acquisition of complex programming concepts. However, the significance of the three-level discovery process lies in the scope of the methodology to inform the design of a meta-knowledge processing model for instructional science. Firstly, the British cognitive style testing procedure, is a low cost, user friendly, computer application that effectively measures an individual's position on the two cognitive style continua (Riding & Cheema,1991). Secondly, the QUEST Interactive Test Analysis System (Izard,1995), allows for a probabilistic determination of an individual's knowledge level, relative to other participants, and relative to test-item difficulties. Test-items can be related to skill levels, and consequently, can be used by instructional scientists to measure knowledge acquisition. Finally, an Effect Size Analysis (Cohen,1977) allows for a direct comparison between treatment groups, giving a statistical measurement of how large an effect the independent variables have on the dependent outcomes. Combined with QUEST's hierarchical positioning of participants, this tool can assist in identifying preferred learning conditions for the evaluation of treatment groups. By combining these three assessment analysis tools into instructional research, a computerized learning shell, customised for individuals' cognitive constructs can be created (McKay & Garner,1999). While this approach has widespread application, individual researchers/trainers would nonetheless, need to validate with an extensive pilot study programme (McKay,1999a; McKay,1999b), the interactive effects within their specific learning domain. Furthermore, the instructional material does not need to be limited to a textual/graphical comparison, but could be applied to any two or more instructional treatments of any kind. For instance: a structured versus exploratory strategy. The possibilities and combinations are believed to be endless, provided the focus is maintained on linking of the front-end identification of cognitive style with an improved performance outcome. My in-depth analysis provides a better understanding of the interactive effects of the cognitive style construct and instructional format on the acquisition of abstract concepts, involving spatial relations and logical reasoning. In providing the basis for a meta-knowledge processing model, this research is expected to be of interest to educators, cognitive psychologists, communications engineers and computer scientists specialising in computer-human interactions.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

This study empirically investigated consumer involvement with a product class. Data was collected from 178 vehicle buyers. Reliability and factor analyses investigated the structure of the Bloch (1981) instrument and the dimensions underlying involvement. In terms of replication, the results suggest the reduced-item version of the instrument previously proposed by Shimp and Sharma (1983) is reliable and is a less excessive measurement instrument. Similar dimensions underlying involvement with the product class are reported here. The study extends previous work by obtaining similar results in a different cultural setting, producing findings from a more relevant sample, applying an additional method of data collection, and suggesting that the underlying dimensions may be temporally stable.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Purpose : Which functional tests on mobility and balance can better screen older people at risk of falls is unclear. This study aims to compare the Berg Balance Scale (BBS), Tinetti Mobility Score (TMS), Elderly Mobility Scale (EMS) and Timed Up and Go test (TUG) in discriminating fallers from non-fallers in older people.
Method : This was a case-control study involving one rater who conducted a mobility and balance assessment on subjects using the four functional tests in random sequence. Subjects recruited included 17 and 22 older people with a history of single and multiple falls respectively from a public Falls Clinic, and 39 community-dwellers without fall history and whose age, sex and BMI matched those of the fallers. All subjects underwent the mobility and balance assessment within one day.
Results : Single fallers performed better than multiple fallers in all four functional tests but were worse than non-fallers in the BBS, TMS and TUG. The BBS demonstrated the best discriminating ability, with high sensitivity and specificity. The BBS item 'pick up an object from the floor' was the best at screening fallers.
Conclusion : BBS was the most powerful functional test of the four in discriminating fallers from non-faller.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

There are limited practical tools to help clinicians or public health workers manage obesity in their patients. We have previously developed a scanning technique for diagnosing environments leading to obesity (Analysis Grid for Environments/Elements Leading to Obesity). Here we describe the development of a tool for identifying behaviours in an individual most likely to lead to obesity. A questionnaire battery of five tests called the DAB-Q (Diet, Activity and Behaviour Questionnaire) was developed, piloted and internally validated with overweight women from a commercial weight loss programme. Outcome from the tests, which are available free on the Internet, provides clinicians with a simple, effective and time-saving tool for ranking foods, drinks and activities likely to be most effectively targeted for weight loss in an individual. This is based on total scores derived from measures of frequency, potential for change and potency of each item as a potential contributor to overweight.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective: This study aimed to test the validity of the 21-item Depression Anxiety Stress Scales (DASS-21) as a routine clinical outcome measure in the private in-patient setting. We hypothesized that it would be a suitable routine outcome instrument in this setting.

Method: All in-patients treated at a private psychiatric hospital over a period of 24 months were included in the study. Data were collected on demographics, service utilization, diagnosis and a set of four routine measures both at admission and discharge. These measures consisted of the Clinical Global Impressions (CGI) scales, Health of the Nation Outcome Scales (HoNOS), the Mental Health Questionnaire (MHQ-14) and DASS-21. The results of these measures were compared.

Results: Of 786 admissions in total, the number of fully completed (ie paired admission and discharge) data sets for the DASS-21 depression, anxiety and stress subscales were 337, 328 and 347, respectively. All subscales showed statistically significant reductions in mean scores from admission to discharge (P < 0.001) and were significantly correlated with all MHQ-14 subscales and significantly related to CGI scale categories. The total DASS-21 and total HoNOS scores were also significantly correlated.

Conclusions: The findings from the present study support the validity of DASS-21 as a routine clinical outcome measure in the private in-patient setting.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Sixty-six English-speaking postgraduate distance-education medical students completed the Learning Styles Questionnaire (LSQ: 40-item version). This was completed while attending a residential workshop at the beginning of the semester, and 44 of these students completed the same LSQ questionnaire 5 months later at the completion of the semester. The psychometric properties of the LSQ were assessed using Cronbach’s alpha (internal consistency), test-retest, correlational analyses and factor analysis. The results indicated that the LSQ (40-item version) has poor reliability and validity, and therefore requires further development and psychometric evaluation.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objectives: The aims of this study were to develop Taiwan's Child Health Literacy Test and to undertake a nation-wide survey in order to determine the current status of Taiwanese sixth graders' health literacy, and to understand the association between health literacy, healthy behavior, and health status. absp Methods: Taiwan's Child Health Literacy Test was developed through the process of concept clarification, a qualitative pilot, a development pilot, and a field test. In the field test, 162,609 sixth graders (56.9%) from 2,235 schools (83.3%) nationwide completed the questionnaire. We also collected the students' dates of birth, BMIs, self-reported health and healthy behaviors. absp Results: The final test consisted of 32 questions with item discrimination of 0.55-1.89 and item difficulty of-1.7-0.41 according to IRT; Cronbach's a was 0.87. Based on this information, the test was deemed appropriate for basic health literacy screening among children. Nation-wide, the average score for sixth graders' health literacy was 23.97 points (total score 32 points), with a correct rate of 74.9%. Those who were "good" in self-reported health scored highest in health literacy (M = 24.29). Health literacy was significantly positively related to healthy behavior (r = .25, p< .05), and negatively to risky behavior (r =-.28, p< .05). absp Conclusions: This study was the first curriculum-based child health literacy test developed from the viewpoints of both teachers and pupils in Taiwan through a rigorous procedure. The nationwide survey results may serve as a reference for decision-makers at the national health education level.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Purpose: Assessing health-related quality of life (HRQoL) via Computerized Adaptive Tests (CAT) provides greater measurement precision coupled with a lower test burden compared to conventional tests. Currently, there are no European pediatric HRQoL CATs available. This manuscript aims at describing the development of a HRQoL CAT for children and adolescents: the Kids-CAT, which was developed based on the established KIDSCREEN-27 HRQoL domain structure. Methods: The Kids-CAT was developed combining classical test theory and item response theory methods and using large archival data of European KIDSCREEN norm studies (n = 10,577–19,580). Methods were applied in line with the US PROMIS project. Item bank development included the investigation of unidimensionality, local independence, exploration of Differential Item Functioning (DIF), evaluation of Item Response Curves (IRCs), estimation and norming of item parameters as well as first CAT simulations. Results: The Kids-CAT was successfully built covering five item banks (with 26–46 items each) to measure physical well-being, psychological well-being, parent relations, social support and peers, and school well-being. The Kids-CAT item banks proved excellent psychometric properties: high content validity, unidimensionality, local independence, low DIF, and model conform IRCs. In CAT simulations, seven items were needed to achieve a measurement precision between.8 and.9 (reliability). It has a child-friendly design, is easy accessible online and gives immediate feedback reports of scores. Conclusions: The Kids-CAT has the potential to advance pediatric HRQoL measurement by making it less burdensome and enhancing the patient–doctor communication.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract
Purpose
Assessing health-related quality of life (HRQoL) via Computerized Adaptive Tests (CAT) provides greater measurement precision coupled with a lower test burden compared to conventional tests. Currently, there are no European pediatric HRQoL CATs available. This manuscript aims at describing the development of a HRQoL CAT for children and adolescents: the Kids-CAT, which was developed based on the established KIDSCREEN-27 HRQoL domain structure.
Methods
The Kids-CAT was developed combining classical test theory and item response theory methods and using large archival data of European KIDSCREEN norm studies (n=10,577–19,580). Methods were applied in line with the US PROMIS project. Item bank development included the investigation of unidimensionality, local independence, exploration of Differential Item Functioning (DIF), evaluation of Item Response Curves (IRCs), estimation and norming of item parameters as well as first CAT simulations.
Results
The Kids-CAT was successfully built covering five item banks (with 26–46 items each) to measure physical well-being, psychological well-being, parent relations, social support and peers, and school well-being. The Kids-CAT item banks proved excellent psychometric properties: high content validity, unidimensionality, local independence, low DIF, and model conform IRCs. In CAT simulations, seven items were needed to achieve a measurement precision between .8 and .9 (reliability). It has a child-friendly design, is easy accessible online and gives immediate feedback reports of scores.
Conclusions
The Kids-CAT has the potential to advance pediatric HRQoL measurement by making it less burdensome and enhancing the patient–doctor communication.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective: Significant life events such as severe health status changes or intensive medical treatment often trigger response shifts in individuals that may hamper the comparison of measurements over time. Drawing from the Oort model, this study aims at detecting response shift at the item level in psychosomatic inpatients and evaluating its impact on the validity of comparing repeated measurements. Study design and setting: Complete pretest and posttest data were available from 1188 patients who had filled out the ICD-10 Symptom Rating (ISR) scale at admission and discharge, on average 24 days after intake. Reconceptualization, reprioritization, and recalibration response shifts were explored applying tests of measurement invariance. In the item-level approach, all model parameters were constrained to be equal between pretest and posttest. If non-invariance was detected, these were linked to the different types of response shift. Results: When constraining across-occasion model parameters, model fit worsened as indicated by a significant Satorra–Bentler Chi-square difference test suggesting potential presence of response shifts. A close examination revealed presence of two types of response shift, i.e., (non)uniform recalibration and both higher- and lower-level reconceptualization response shifts leading to four model adjustments. Conclusions: Our analyses suggest that psychosomatic inpatients experienced some response shifts during their hospital stay. According to the hierarchy of measurement invariance, however, only one of the detected non-invariances is critical for unbiased mean comparisons over time, which did not have a substantial impact on estimating change. Hence, the use of the ISR can be recommended for outcomes assessment in clinical routine, as change score estimates do not seem hampered by response shift effects.