3 resultados para high-stakes testing
em QSpace: Queen's University - Canada
Resumo:
Most essay rating research in language assessment has examined human raters’ essay rating as a cognitive process, thus overlooking or oversimplifying the interaction between raters and sociocultural contexts. Given that raters are social beings, their practices have social meanings and consequences. Hence it is important to situate essay rating within its sociocultural context for a more meaningful understanding. Drawing on Engeström’s (1987, 2001) cultural-historical activity theory (CHAT) framework with a sociocultural perspective, this study reconceptualized essay rating as a socially mediated activity with both cognitive (individual raters’ goal-directed decision-making actions) and social layers (raters’ collective object-oriented essay rating activity at related settings). In particular, this study explored raters’ essay rating at one provincial rating centre in China within the context of a high-stakes university entrance examination, the National Matriculation English Test (NMET). This study adopted a multiple-method multiple-perspective qualitative case study design. Think-aloud protocols, stimulated recalls, interviews, and documents served as the data sources. This investigation involved 25 participants at two settings (rating centre and high schools), including rating centre directors, team leaders, NMET essay raters who were high school teachers, and school principals and teaching colleagues of these essay raters. Data were analyzed using Strauss and Corbin’s (1990) open and axial coding techniques, and CHAT for data integration. The findings revealed the interaction between raters and the NMET sociocultural context. Such interaction can be understood through a surface structure (cognitive layer) and a deep structure (social layer) concerning how raters assessed NMET essays, where the surface structure reflected the “what” and the deep structure explained the “how” and “why” in raters’ decision-making. This study highlighted the roles of goals and rules in rater decision-making, rating tensions and raters’ solutions, and the relationship between essay rating and teaching. This study highlights the value of a sociocultural view to essay rating research, demonstrates CHAT as a sociocultural approach to investigate essay rating, and proposes a direction for future washback research on the effect of essay rating. This study also provides support for NMET rating practices that can potentially bring positive washback to English teaching in Chinese high schools.
Resumo:
Abstract Professional language assessment is a new concept that has great potential to benefit Internationally Educated Professionals and the communities they serve. This thesis reports on a qualitative study that examined the responses of 16 Canadian English Language Benchmark Assessment for Nurses (CELBAN) test-takers on the topic of their perceptions of the CELBAN test-taking experience in Ontario in the winter of 2015. An Ontario organization involved in registering participants distributed an e-mail through their listserv. Thematic analyses of focus group and interview transcripts identified 7 themes from the data. These themes were used to inform conclusions to the following questions: (1) How do IENs characterize their assessment experience? (2) How do IENs describe the testing constructs measured by the CELBAN? (3) What, if any, potential sources of construct irrelevant variance (CIV) do the test-takers describe based on their assessment experience? (4) Do IENs feel that the CELBAN tasks provide a good reflection of the types of communicative tasks required of a nurse? Overall, participants reported positive experiences with the CELBAN as an assessment of their language skills, and noted some instances in which they felt some factors external to the assessment impacted their demonstration of their knowledge and skill. Lastly, some test-takers noted the challenge of completing the CELBAN where the types of communicative nursing tasks included in the assessment differed from nursing tasks typical of an IENs country or origin. The findings are discussed in relation to literature on high-stakes large-scale assessment and IEPs, and a set of recommendations are offered to future CELBAN administration. These recommendations include (1) the provision of a webpage listing all licensure requirements (2) monitoring of CELBAN location and dates in relation to the wider certification timeline for applicants (3) The provision of additional CELBAN preparatory materials (4) Minor changes to the CELBAN administrative protocols. Given that the CELBAN is a relatively new assessment format and its widespread use for high-stakes decisions (a component of nursing certification and licensure), research validating IEN-test-taker responses to construct representation and construct irrelevant variance is critical to our understanding of the role of competency testing for IENs.
Resumo:
Thermal and fatigue cracking are the two of the major pavement distress phenomena that contribute significantly towards increased premature pavement failures in Ontario. This in turn puts a massive burden on the provincial budgets as the government spends huge sums of money on the repair and rehabilitation of roads every year. Governments therefore need to rethink and re-evaluate their current measures in order to prevent it in future. The main objectives of this study include: the investigation of fatigue distress of 11 contract samples at 10oC, 15oC, 20oC and 25oC and the use of crack-tip-opening-displacement (CTOD) requirements at temperatures other than 15oC; investigation of thermal and fatigue distress of the comparative analysis of 8 Ministry of Transportation (MTO) recovered and straight asphalt samples through double-edge-notched-tension test (DENT) and extended bending beam rheometry (EBBR); chemical testing of all samples though X-ray Fluorescence (XRF) and Fourier transform infrared analysis (FTIR); Dynamic Shear Rheometer (DSR) higher and intermediate temperature grading; and the case study of a local Kingston road. Majority of 11 contract samples showed satisfactory performance at all temperatures except one sample. Study of CTOD at various temperatures found a strong correlation between the two variables. All recovered samples showed poor performance in terms of their ability to resist thermal and fatigue distress relative to their corresponding straight asphalt as evident in DENT test and EBBR results. XRF and FTIR testing of all samples showed the addition of waste engine oil (WEO) to be the root cause of pavement failures. DSR high temperature grading showed superior performance of recovered binders relative to straight asphalt. The local Kingston road showed extensive signs of damage due to thermal and fatigue distress as evident from DENT test, EBBR results and pictures taken in the field. In the light of these facts, the use of waste engine oil and recycled asphalt in pavements should be avoided as these have been shown to cause premature failure in pavements. The DENT test existing CTOD requirements should be implemented at other temperatures in order to prevent the occurrences of premature pavement failures in future.