872 resultados para High Stakes Testing
Resumo:
The purpose of this study was to analyze the evolution of Florida state level policy efforts and to assess the responding educational policy development and implementation at the local school district level. The focus of this study was the secondary language arts curriculum in Miami-Dade County Public Schools. Data was collected using document analysis as a source of meaning making out of the language sets proffered by agencies at each level. A matrix was created based on Klein's levels of curriculum decision-making and Functional Process Theory categories of policy formation. The matrix allowed the researcher to code and classify specific information in terms accountability/high-stakes testing; authority; outside influences; and operational/structural organization. Federal policy documents provided a background and impetus for much of what originated at the State level. The State then produced policy directives which were accepted by the District and specific policy directives and guidelines for practice. No evidence was found indicating the involvement of any other agencies in the development, transmission or implementation of the State level initiated policies. After analyzing the evolutionary process, it became clear that state policy directives were never challenged or discussed. Rather, they were accepted as standards to be met and as such, school districts complied. Policy implementation is shown to be a top-down phenomenon. No evidence was found indicating a dialogue between state and local systems, rather the state, as the source of authority, issued specifically worded policy directives and the district complied. Finally, this study recognizes that outside influences play an important role in shaping the education reform policy in the state of Florida. The federal government, through NCLB and other initiatives created a climate which led almost naturally to the creation of the Florida A+ Plan. Similarly, the concern of the business community, always interested in the production of competent workers, continued to support efforts at raising the minimum skill level of Florida high school graduates. Suggestions are made for future research including the examination of local school sites in order to assess the overall nature of the school experience rather than rely upon performance indicators mandated by state policy.
Resumo:
High-stakes testing and accountability have infiltrated the education system in the United States; the top priority for all teachers must be student progress on standardized tests. This has resulted in the predominance of reading for test-taking, (efferent reading), in the English, language arts, and reading classrooms. Authentic uses of print activities, like aesthetic reading, that encourage students to engage individually with a text, have been pushed aside. During a 3-week time period, regular level, English 3/American literature students in a Title I magnet high school, participated in this quasi-experimental study (N = 62). It measured the effects of an intervention of reading American literature texts aesthetically and writing aesthetically-evoked reader responses on students’ self-efficacy beliefs regarding their comprehension of American literature. One trained teacher and the researcher participated in the study: student participants were pre- and post- tested using the Confidence in Reading American Literature Survey which examined their self-efficacy beliefs regarding their comprehension of American literature. Several statistical analyses were performed. The results of the linear regression analyses partially supported a positive relationship between aesthetically-evoked reader responses and students’ self-efficacy beliefs regarding their comprehension of American literature. Additionally, the results of the 2 (sex) x 2 (treatment) ANCOVAs conducted to test group differences in self-efficacy beliefs regarding the comprehension of American literature between treatment and control groups indicated a main effect for treatment (but not sex; nor was there a significant sex x treatment interaction), suggesting the treatment was partially effective in increasing students’ self-efficacy beliefs. Seven of the twelve ANCOVAs indicated a statistically significant increase in the treatment group’s adjusted group mean self-efficacy belief scores as a result of being exposed to the intervention. In six of these seven analyses, increases in self-efficacy beliefs occurred in tasks that required three or more higher-order levels of thinking/learning. The results are discussed in terms of theoretical, empirical and practical significance. Future research is recommended to extend the intervention beyond the narrow confines of a Title I magnet school to settings where the intervention could be tested longitudinally, e. g., honors and gifted students, elementary and middle schools.
Resumo:
The principalship has changed significantly over the past 20 years. Today’s principals must be effective instructional leaders, managers of large facilities, and experts at analyzing data to successfully meet the accountability demands of high-stakes testing, along with state, and federal mandates. The primary purpose of this quantitative study was to examine how 43 first- and second-year sitting school principals perceived their mentoring experiences and the degree to which a principal mentoring program—offered by their large urban school district—was effective in building their leadership capacity. A second purpose of this inquiry was to understand these principals’ perceptions of the most beneficial aspects of the mentoring program. The study used quantitative data gathered via an online questionnaire distributed during Fall 2015. The results indicated that respondents perceived that the components of the large urban school-mentoring program were generally effective in training principal mentees to become highly-effective school leaders. This study enriches the literature on mentoring by providing the voices of first and second year school leaders to add depth to the characteristics of successful mentoring programs.
Resumo:
The primary purpose of this study was to examine the influences of literacy variables on high-stakes test performance including: (a) student achievement on the Metropolitan Achievement Test, Seventh Edition (MAT-7) as correlated to the high-stakes test such as the FCAT examination and (b) the English language proficiency attained by English Language Learners (ELL) students when participating in, or exiting from English Speakers of Other Languages (ESOL) program as determined by the Limited English Proficient (LEP) committee. ^ Two one-sample Chi-square tests were conducted to investigate the relationship between passing the MAT-7 Reading and Language examinations and the FCAT-SSS Reading Comprehension and FCAT-NRT examinations. In addition, 2x2 Analyses of Variance (ANOVAs) were conducted to address the relationship between the time ELL students spent in the ESOL program and the level of achievement on MAT-7 Reading and Language examinations and the FCAT-SSS Reading Comprehension and FCAT-NRT. ^ Findings of this study indicated that more ELL students exit the program based on the LEP committee decisions than by passing the MAT-7. The majority of ELL students failed the 10th grade FCAT, the passing of which is needed for graduation. A significant number of ELL students failed, even when passing the MAT-7 or being duly exited through the decision of the LEP committee. The data also indicated that ELL students who exited the ESOL program in six semesters or fewer had higher FCAT scores than those who exited the program in seven semesters or more. The MAT-7 and the decision of the LEP committee were shown to be ineffective as predictors of success on the FCAT. ^ Further research to determine the length of time a student in the ESOL program uses English to read, write, and speak should be conducted. Additionally, the development of a new assessment instrument to better predict student success should be considered. However, it should be noted that the results of this study are limited to the context in which it was conducted and does not warrant generalizations beyond that context. ^
Resumo:
The present research represents a coherent approach to understanding the root causes of ethnic group differences in ability test performance. Two studies were conducted, each of which was designed to address a key knowledge gap in the ethnic bias literature. In Study 1, both the LR Method of Differential Item Functioning (DIF) detection and Mixture Latent Variable Modelling were used to investigate the degree to which Differential Test Functioning (DTF) could explain ethnic group test performance differences in a large, previously unpublished dataset. Though mean test score differences were observed between a number of ethnic groups, neither technique was able to identify ethnic DTF. This calls into question the practical application of DTF to understanding these group differences. Study 2 investigated whether a number of non-cognitive factors might explain ethnic group test performance differences on a variety of ability tests. Two factors – test familiarity and trait optimism – were able to explain a large proportion of ethnic group test score differences. Furthermore, test familiarity was found to mediate the relationship between socio-economic factors – particularly participant educational level and familial social status – and test performance, suggesting that test familiarity develops over time through the mechanism of exposure to ability testing in other contexts. These findings represent a substantial contribution to the field’s understanding of two key issues surrounding ethnic test performance differences. The author calls for a new line of research into these performance facilitating and debilitating factors, before recommendations are offered for practitioners to ensure fairer deployment of ability testing in high-stakes selection processes.
Resumo:
Educational assessment was a worldwide commonplace practice in the last century. With the theoretical underpinnings of education shifting from behaviourism and social efficiency to constructivism and cognitive theories in the past two decades, the assessment theories and practices show a widespread changing movement. The emergent assessment paradigm, with a futurist perspective, indicates a deviation away from the prevailing large scale high-stakes standardised testing and an inclination towards classroom-based formative assessment. Innovations and reforms initiated in attempts to achieve better education outcomes for a sustainable future via more developed learning and assessment theories have included the 2007 College English Reform Program (CERP) in Chinese higher education context. This paper focuses on the College English Test (CET) - the national English as a Foreign Language (EFL) testing system for non-English majors at tertiary level in China. It seeks to explore the roles that the CET played in the past two College English curriculum reforms, and the new role that testing and assessment assumed in the newly launched reform. The paper holds that the CET was operationalised to uplift the standards. However, the extended use of this standardised testing system brings constraints as well as negative washback effects on the tertiary EFL education. Therefore in the newly launched reform -CERP, a new assessment model which combines summative and formative assessment approaches is proposed. The testing and assessment, assumed a new role - to engender desirable education outcomes. The question asked is: will the mixed approach to formative and summative assessment provide the intended cure to the agony that tertiary EFL education in China has long been suffering - spending much time, yet achieving little effects? The paper reports the progresses and challenges as informed by the available research literature, yet asserts a lot needs to be explored on the potential of the assessment mix in this examination tradition deep-rooted and examination-obsessed society.
Resumo:
Paired speaking tests are increasingly used in both low-and high-stakes second language assessment contexts. Until recently, very little was known about the way in which raters interpret and apply descriptors relating to interactional competence to a performance that is co-constructed. This book presents a study which explores the interactional features of a paired speaking test that were sailient to raters and the extent to which raters viewed the performance as separable. The study shows that raters use their own frames of reference to interpret descriptors and that they viewed certain features of the performance as mutual accomplishments. The book takes us 'beyond scores', and in doing so, contributes to the growing body of research on paired speaking tests.
Resumo:
Throughout the world, state and nation standardised testing of children, has become a "huge industry" (English, 2002). Although English is referring to the American system which has been involved in standardised testing for over half a century, the same could be said of many other countries, including Australia. It has been only in recent years that Australia has embraced national testing as part of a wider reform effort to bring about increased accountability in schooling. The results of high-stakes tests in Australia are now published in newspapers and electronically on the Australian federal government's MySchool website (www.myschoold.edu.au). MySchool provides results on the National Assessment Program - Literacy and Numeracy (NAPLAN) for students in Years 3,5, 7 and 9. Data are available that compare schools to statistically similar schools. This more recent publication of national testing results in Australia is a visible example of "contractual accountability", described by Mulford, Edmunds, Kendall, Kendall and Bishop (2008) as " the degree to which [actors] are fulfilling the expectations of particular audiences in terms of standards, outcomes and results" (p.20).
Resumo:
Current English-as-a-second and foreign-language (ESL/EFL) research has encouraged to treat each communicative macroskill separately due to space constraint, but the interrelationship among these skills (listening, speaking, reading, and writing) is not paid due attention. This study attempts to examine first the existing relationship among the four dominant skills, second the potential impact of reading background on the overall language proficiency, and finally the relationship between listening and overall language proficiency as listening is considered an overlooked/passive skill in the pedagogy of the second/foreign language classroom. However, the literature in language learning has revealed that listening skill has salient importance in both first and second language learning. The purpose of this study is to investigate the role of each of four skills in EFL learning and their existing interrelationships in an EFL setting. The outcome of 701 Iranian applicants undertaking International English Language Testing System (IELTS) in Tehran demonstrates that all communicative macroskills have varied correlations from moderate (reading and writing) to high (listening and reading). The findings also show that the applicants’ reading history assisted them in better performing at high stakes tests, and what is more, listening skill was strongly correlated with the overall language proficiency.
Resumo:
Pedagogical styles, methods, models, practices or strategies are valued for what they claim they can achieve. In recent times curriculum documents and governments have called for a range of teaching approaches to meet the variety of learner differences and allow students to make more independent decision making in physical education (Hardy and Mawer, 1999). One well known system of categorizing teaching styles is the Mosston and Ashworth’s Spectrum of Teaching Styles (2002). In Queensland, prior to 2005, no research had been conducted on the teaching styles used by teachers of Physical Education. However, many teachers self-reported that they employed a variety of teaching styles depending on the aims and content of the material to be taught (Cothran, et al., 2005). This research, for the first time, collected teacher’s self-reported use of teaching styles and through observations verify the styles that were being used to teach Senior Physical Education in Queensland. More specifically the aims of the research were to determine: a) What teaching styles teachers of Senior Physical Education in Queensland believe they use? i) Were they using a range of teaching styles? ii) Were teachers of Senior Physical Education in Queensland using teaching styles that the Queensland Senior Physical Education Syllabus (2004) required? b) If Mosston and Ashworth’s (2002) Spectrum of Teaching Styles were used to categorise styles observed during the teaching of Senior Physical Education did the styles being used provide opportunities for evaluating as described by the Queensland Senior Physical Education Syllabus (2004)? The research was conducted in two phases. Part A involved use of a questionnaire to determine the teaching styles Queensland teachers of Senior Physical Education reported using and how often they reported using them. The questionnaire was administered to 110 teachers throughout Queensland. The sample was determined from 346 schools teaching Senior Physical Education (in 2006) across the state of Queensland, Australia. 286 questionnaires were sent to 77 non-randomised schools. There were 66 male and 44 female respondents in the sample. A wide range of teaching styles were reportedly used by teachers of Senior Physical Education with Practice Style-Style B, Command Style-Style A and Divergent Discovery Style-Style H, the most reportedly used. The Self-Teaching Style-Style K was reportedly used the least by teachers involved in this study. From the respondents a group of teachers were identified to form the participants for Part B. Part B of the study involved observation of a group of volunteer participants (from those who had completed the questionnaire) who displayed many of the ‘typical’ characteristics, and a cross-section of backgrounds, of teachers of Senior Physical Education in Queensland. In the case of this study, the criteria used to select the group of teachers to be observed teaching were, teaching experience (number of years: 0-4, 5-10 and 11 years and over), gender, geographical location of schools (focused on Brisbane and near area for travel/access purposes), profile of the students at schools (girls, boys or co-educational), nature of school (Government or Private) and the physical activities being taught in a school (activities to reflect all the areas of physical activity outlined within the syllabus). A total of 27 questionnaire respondents from Part A indicated that they were willing to be observed teaching practical lessons. The respondents who volunteered to be involved in Part B of the study came from different regions across the state of Queensland and was not confined to the Brisbane metropolitan area or large cities. From the group of people who volunteered for Part B four came from outside Brisbane and 23 from the Brisbane area. The final observation group of nine participants included eight teachers from the Brisbane area and one from a rural area. The characteristics of the final group included three females and six males from private and public schools with a range of teaching experience in years and a range of physical activities. Four year 12 and five year 11 teachers and their classes were videoed on three occasions as they progressed through an eight – nine week unit of work. This resulted in 24 hours 48 minutes and 20 seconds (or 4465 observations) of video teaching data which was subsequently coded by several researchers (99% interobserver reliability) to determine the teaching styles employed by the participants. This research indicated that, based on Mosston and Ashworth’s (2002) Spectrum of Teaching Styles, teachers of Senior Physical Education in Queensland used predominantly one style to teach 27 observed lessons. This is in sharp contrast to the variety of styles 110 teachers self- reportedly used and in spite of the Queensland Senior Physical Education Syllabus (2004) suggesting a range of specific styles be used. These results are discussed in the context of the Queensland Senior Physical Education Syllabus (2004), teacher knowledge of teaching styles and high-stakes curriculum and external pressures such as national testing and the publication of data from schools in tabloid newspapers. The data and findings in this research provide a rationale for improving teacher knowledge regarding teaching styles and the need for a clear definition of terminology in syllabus documents. Careful examination of the effects that the publishing of school data may have on teaching styles is advised. This research not only collected teacher’s perceptions of the teaching styles they believed they used it also verified these claims through direct observations of the teachers while teaching. These findings are relevant to syllabus writers, teacher educators, policy makers within education and teachers.
Resumo:
Drawing on the largest Australian collection and analysis of empirical data on multiple facets of Aboriginal and Torres Strait Islander education in state schools to date, this article critically analyses the systemic push for standardized testing and improved scores, and argues for a greater balance of assessment types by providing alternative, inclusive, participatory approaches to student assessment. The evidence for this article derives from a major evaluation of the Stronger Smarter Learning Communities. The first large-scale picture of what is occurring in classroom assessment and pedagogy for Indigenous students is reported in this evaluation yet the focus in this article remains on the issue of fairness in student assessment. The argument presented calls for “a good balance between formative and summative assessment” (OECD, Synergies for Better Learning An International Perspective on Evaluation and Assessment, Pointers for Policy Development, 2013) at a time of unrelenting high-stakes, standardized testing in Australia with a dominance of secondary as opposed to primary uses of NAPLAN data by systems, schools and principals. A case for more “intelligent accountability in education” (O’Neill, Oxford Review of Education 39(1):4–16, 2013) together with a framework for analyzing efforts toward social justice in education (Cazden, International Journal of Educational Psychology 1(3):178–198, 2012) and fairer assessment make the case for more alternative assessment practices in recognition of the need for teachers’ pedagogic practice to cater for increased diversity.
Resumo:
Objective To develop a child victimization survey among a diverse group of child protection experts and examine the performance of the instrument through a set of international pilot studies. Methods The initial draft of the instrument was developed after input from scientists and practitioners representing 40 countries. Volunteers from the larger group of scientists participating in the Delphi review of the ICAST P and R reviewed the ICAST C by email in 2 rounds resulting in a final instrument. The ICAST C was then translated and back translated into six languages and field tested in four countries using a convenience sample of 571 children 12–17 years of age selected from schools and classrooms to which the investigators had easy access. Results The final ICAST C Home has 38 items and the ICAST C Institution has 44 items. These items serve as screeners and positive endorsements are followed by queries for frequency and perpetrator. Half of respondents were boys (49%). Endorsement for various forms of victimization ranged from 0 to 51%. Many children report violence exposure (51%), physical victimization (55%), psychological victimization (66%), sexual victimization (18%), and neglect in their homes (37%) in the last year. High rates of physical victimization (57%), psychological victimization (59%), and sexual victimization (22%) were also reported in schools in the last year. Internal consistency was moderate to high (alpha between .685 and .855) and missing data low (less than 1.5% for all but one item). Conclusions In pilot testing, the ICAST C identifies high rates of child victimization in all domains. Rates of missing data are low, and internal consistency is moderate to high. Pilot testing demonstrated the feasibility of using child self-report as one strategy to assess child victimization. Practice implications The ICAST C is a multi-national, multi-lingual, consensus-based survey instrument. It is available in six languages for international research to estimate child victimization. Assessing the prevalence of child victimization is critical in understanding the scope of the problem, setting national and local priorities, and garnering support for program and policy development aimed at child protection.
Resumo:
Solenopsis invicta Buren (red imported fire ant) are invasive pests that have the capability of major destructive impacts on lifestyle, ecology and economy. Control of this species is dependent, in part, upon ability to estimate the potential spread from newly discovered nests. The potential for spread and the spread characteristics differ between monogyne and polygyne social forms. Prior to this study, differentiation of the two social forms in laboratory test samples commonly used a method involving restriction endonuclease digestion of an amplified Gp-9 fragment. Success of this assay is limited by the quality of DNA, which in the field-collected insects may be affected by temporary storage in unfavourable conditions. Here, we describe an alternative and highly objective assay based upon a high resolution melt technique following preamplification of a significantly shorter Gp-9 fragment than that required for restriction endonuclease digestion. We demonstrate the application of this assay to a S. invicta incursion in Queensland, Australia, using field samples from which DNA may be partially degraded. The reductions in hands-on requirements and overall duration of the assay underpin its suitability for high-throughput testing.
Resumo:
This study builds on and contributes to work on assessment of children in primary school, particularly in science. Previous research has examined primary science assessment from different standpoints, but no studies have speci?cally addressed children’s perspectives. This article provides additional insight into issues surrounding children’s assessment in primary school and how the assessment of science might develop in England after the science SATs (Standard Assessment Tests) were abolished in 2009. Some research suggests that primary science assessment via SATs is a major reason for the observed decline in children’s engagement with science in upper primary and lower secondary school. The analytic focus on engaging children as coresearchers to assist in the process of gathering informed views and interpreting ?ndings from a large sample of children’s views enables another contribution. The study, based on a survey of 1000 children in primary and secondary schools in England and Wales, reveals that despite being assessed under two different regimes (high-stakes national tests in England and moderated teacher assessment in Wales), children’s views of science assessment are remarkably consistent. Most appreciate the usefulness of science assessment and value frequent, non-SATs testing for monitoring/improving science progress. There was a largely negative impact, however, of science
assessment on children’s well-being, particularly due to stress. The paper demonstrates that children provide an important perspective on assessment and that including their views can improve policy-making in relation to primary science assessment.
Resumo:
This paper has two aims. First, to present cases in which scientists developed a defensive system for their homeland: Blackett and the air defense of Britain in WWII, Forrester and the SAGE system for North America in the Cold War, and Archimedes’ work defending Syracuse during the Second Punic War. In each case the historical context and the individual’s other achievements are outlined, and a description of the contribution’s relationship to OR/MS is given. The second aim is to consider some of the features the cases share and examine them in terms of contemporary OR/MS methodology. Particular reference is made to a recent analysis of the field’s strengths and weaknesses. This allows both a critical appraisal of the field and a set of potential responses for strengthening it. Although a mixed set of lessons arise, the overall conclusion is that the cases are examples to build on and that OR/MS retains the ability to do high stakes work.