974 resultados para Item analysis
Resumo:
The new requirement placed on students in tertiary settings in Spain to demonstrate a B1 or a B2 proficiency level of English, in accordance with the Common European Framework of Reference for Languages (CEFRL), has led most Spanish universities to develop a program of certification or accreditation of the required level. The first part of this paper aims to provide a rationale for the type of test that has been developed at the Universidad Politécnica de Madrid for the accreditation of a B2 level, a multiple choice version, and to describe how it was constructed and validated. Then, in the second part of the paper, the results from its application to 924 students enrolled in different degree courses at a variety of schools and faculties at the university are analyzed based on a final test version item analysis. To conclude, some theoretical as well as practical conclusions about testing grammar that affect the teaching and learning process are drawn. RESUMEN. Las nuevas exigencias sobre niveles de competencia B1 y B2 en inglés según el Marco Común Europeo de Referencia para las Lenguas (MCERL) que se imponen sobre los estudiantes de grado y posgrado han llevado a la mayoría de las universidades españolas a desarrollar programas de acreditación o de certificación de estos niveles. La primera parte de este trabajo trata sobre las razones que fundamentan la elección de un tipo concreto de examen para la acreditación del nivel B2 de lengua inglesa en la Universidad Politécnica de Madrid. Se trata de un test de opción múltiple y en esta parte del trabajo se describe cómo fue diseñado y validado. En la segunda parte, se analizan los resultados de la aplicación del test a gran escala a un total de 924 estudiantes matriculados en varias escuelas y Facultades de la Universidad. Para terminar, se apuntan una serie de conclusiones teóricas y prácticas sobre la evaluación de la gramática y de qué modo influye en los procesos de enseñanza y aprendizaje.
Resumo:
Área de endemismo ou elemento biótico é uma região geográfica que apresenta congruência distribucional entre táxons. Não há um padrão aceito universalmente para delimitação de áreas de endemismo e, portanto, várias metodologias são usadas para sua identificação. Nesta dissertação, propomos uma comparação integrada de alguns métodos de análises de endemismo, com base em dados de distribuição hipotéticos e reais. Desta forma, este estudo tem como objetivos: (1) comparar a Análise de Parcimônia de endemicidade (PAE), a Análise de endemicidade (EA) e um novo método de codificação que propomos a Análise de Distribuições de Três-Itens (3ID), avaliando sua performance com base na capacidade de identificar padrões hipotéticos predefinidos de áreas de endemismo, representando áreas não conflitantes, aninhadas e sobrepostas; (2) analisar os padrões de distribuição de 214 espécies de hidrozoários bentônicos, pelágicos e benthopelágicos não-sifonóforos do Oceano Atlântico Sul Ocidental (OASO), usando três métodos biogeográficos para testar hipóteses anteriores de regionalização biogeográfica e avaliar o performance da PAE, a EA e a 3ID com conjuntos de dados reais. No capítulo 2, intitulado “Comparison of analysis of endemism procedures based on hypothetical distributions”, nós comparamos a PAE, EA e 3ID e encontramos que a 3ID tem o maior percentual de sucesso na recuperação de áreas de endemismo predefinidas. Adicionalmente, a EA é o único método capaz de recuperar padrões sobrepostos, porém também encontra padrões espúrios. Nós sugerimos, portanto, que a melhor opção para identificação de áreas de endemismo é o uso de 3ID e EA em conjunto. No capítulo 3, intitulado “Biogeographic patterns of benthic and planktonic hydrozoans from the southwestern Atlantic Ocean”, nós utilizamos dados distribucionais de 214 espécies de hidrozoários bentônicos, pelágicos e bentopelágicos não-sifonóforos do OASO (20°-60°S, 33°-75°W), os quais foram organizados em diferentes matrizes (concatenada, bentônica, pelágica, e bentopelágica) de acordo com as diferentes estratégias de ciclo de vida em Hydrozoa. Todas as matrizes foram analisadas por meio da PAE, EA e 3ID. Os resultados mostram três padrões biogeográficos gerais: (1) Tropical (2) Temperado-Quente, e (3) Temperado-Frio. Os padrões obtidos variam de acordo com o tipo de ciclo de vida em Hydrozoa, demonstrando a importância de analisar-se separadamente conjuntos de dados de espécies com diferentes estratégias de reprodução. Cada método teve um desempenho diferente e, portanto, concluímos que o uso de 3ID e EA em conjunto é a melhor opção para inferir padrões biogeográficos marinhos
Resumo:
The heritability and stability over a 19 year period of long (23-item) and short (12-item) versions of Eysenck's Neuroticism scale were compared in a large Australian twin-family sample. Stability over 19 years of the 23-item Neuroticism scale was 0.62 and for the 12-item scale 0.59. Correlations between scores obtained by mailed questionnaire and telephone interview a few weeks apart were 0.87 for the long scale and 0.85 for the short scale; scores obtained by mail were slightly higher, particularly for females. The 12-item scale had slightly reduced power to discriminate both high and low scoring individuals on the full 23-item scale. Mean Neuroticism score for the 12-item scale was atypically low when compared to the distribution of the complete set of scores for all possible combinations (> 1 million) of 12-items drawn from the full 23-item EPQ-R. Mean heritabilities for the lowest and highest 300,000 of these combinations were 43.2% and 42.7%, respectively, somewhat higher than the 41.0% for the actual EPQ-R-S 12-item scale. Heritability for the 23-item scale was 46.5%. We conclude that there is little loss of either stability or heritability in using the short EPQ-R scale, but the choice of which 12-items could have been better. (c) 2005 Elsevier Ltd. All rights reserved.
Resumo:
Purpose - This paper provides a deeper examination of the fundamentals of commonly-used techniques - such as coefficient alpha and factor analysis - in order to more strongly link the techniques used by marketing and social researchers to their underlying psychometric and statistical rationale. Design/methodology approach - A wide-ranging review and synthesis of psychometric and other measurement literature both within and outside the marketing field is used to illuminate and reconsider a number of misconceptions which seem to have evolved in marketing research. Findings - The research finds that marketing scholars have generally concentrated on reporting what are essentially arbitrary figures such as coefficient alpha, without fully understanding what these figures imply. It is argued that, if the link between theory and technique is not clearly understood, use of psychometric measure development tools actually runs the risk of detracting from the validity of the measures rather than enhancing it. Research limitations/implications - The focus on one stage of a particular form of measure development could be seen as rather specialised. The paper also runs the risk of increasing the amount of dogma surrounding measurement, which runs contrary to the spirit of this paper. Practical implications - This paper shows that researchers may need to spend more time interpreting measurement results. Rather than simply referring to precedence, one needs to understand the link between measurement theory and actual technique. Originality/value - This paper presents psychometric measurement and item analysis theory in easily understandable format, and offers an important set of conceptual tools for researchers in many fields. © Emerald Group Publishing Limited.
Resumo:
Background: Qualitative research has suggested that spousal carers of someone with dementia differ in terms of whether they perceive their relationship with that person as continuous with the premorbid relationship or as radically different, and that a perception of continuity may be associated with more person-centered care and the experience of fewer of the negative emotions associated with caring. The aim of the study was to develop and evaluate a quantitative measure of the extent to which spousal carers perceive the relationship to be continuous. Methods: An initial pool of 42 questionnaire items was generated on the basis of the qualitative research about relationship continuity. These were completed by 51 spousal carers and item analysis was used to reduce the pool to 23 items. The retained items, comprising five subscales, were then administered to a second sample of 84 spousal carers, and the questionnaire's reliability, discriminative power, and validity were evaluated. Results: The questionnaire showed good reliability: Cronbach's α for the full scale was 0.947, and test-retest reliability was 0.932. Ferguson's δ was 0.987, indicating good discriminative power. Evidence of construct validity was provided by predicted patterns of subscale correlations with the Closeness and Conflict Scale and the Marwit-Meuser Caregiver Grief Inventory. Conclusion: Initial psychometric evaluation of the measure was encouraging. The measure provides a quantitative means of investigating ideas from qualitative research about the role of relationship continuity in influencing how spousal carers provide care and how they react emotionally to their caring role. © 2012 International Psychogeriatric Association.
Resumo:
It has been reported that the cultural-historical experiences of ethnic group members can play a role in the literacy beliefs of those members. Socioeconomic conditions can also influence the belief system of the groups' constituents. This study investigated parents' and children's beliefs pertaining to early literacy acquisition as related to the ethnicity and socioeconomic status (SES) of the participants. The objectives were to determine (a) the differential patterns regarding emergent literacy and traditional skills approaches as they interact with ethnicity and SES and (b) the correspondence between parents and children's beliefs about literacy acquisition. ^ The study was conducted with 152 parents (38 low-income Hispanic, 38 middle-income Hispanic, 38 low-income African-American, and 38 middle-income African-American) and 36 of their 3-, 4-, or 5-year-old children (18 male and 18 female). ^ The parents were asked to check those items with which they agreed on a survey that consisted of an equal number of items from the traditional skills-based and emergent literacy orientations. These responses were used to determine the differences and interaction by ethnicity and SES. The children responded to open-ended questions related to the instruction of reading and writing skills. The parents' responses and children's answers were compared to ascertain the matching parent-child dyads by ethnicity and SES. ^ An item analysis was conducted to strengthen the internal reliability consistency coefficient of the traditional skills-based and emergent literacy scales as measured by the Cronbach Alpha. ^ A two-way multivariate analysis of variance (MANOVA) revealed a significant difference in traditional skill-based beliefs for the low-income African-American and Hispanic parents. There were no significant findings for the parents' traditional skill based or emergent literacy beliefs based on ethnicity, for the interaction between ethnicity and SES, or for the relationship between parents' and children's literacy beliefs by ethnicity and SES. ^ It can be concluded that low-income African-American and Hispanic parents believe in the traditional skills approach, indicating that these parents find it necessary for children to have sufficient school readiness skills prior to learning to read or write. In addition, the parent and child dyads had a strong tendency toward emergent literacy beliefs. ^
Resumo:
This thesis extends previous research on critical decision making and problem-solving by refining and validating a self-report measure designed to assess the use of critical decision making and problem solving in making life choices. The analysis was conducted by performing two studies, and therefore collecting two sets of data on the psychometric properties of the measure. Psychometric analyses included: item analysis, internal consistency reliability, interrater reliability, and an exploratory factor analysis. This study also included regression analysis with the Wonderlic, an established measure of general intelligence, to provide preliminary evidence for the construct validity of the measure.
Resumo:
Background and problem – As a result of financial crises and the realization of a broader stakeholder network, recent decades have seen an increase in stakeholder demand for non- financial information in corporate reporting. This has led to a situation of information overload where separate financial and sustainability reports have developed in length and complexity interdependent of each other. Integrated reporting has been presented as a solution to this problematic situation. The question is whether the corporate world believe this to be the solution and if the development of corporate reporting is heading in this direction. Purpose - This thesis aims to examine and assess to what extent companies listed on the OMX Stockholm 30 (OMXS30), as per 2016-02-28, comply with the Strategic content element of the <IR> Framework and how this disclosure has developed since the framework’s pilot project and official release by using a self-constructed disclosure index based on its specific items. Methodology – The purpose was fulfilled through an analysis of 104 annual reports comprising 26 companies during the period of 2011-2014. The annual reports were assessed using a self-constructed disclosure index based on the <IR> Framework content element Strategy and Resource Allocation, where one point was given for each disclosed item. Analysis and conclusions – The study found that the OMXS30-listed companies to a large extent complies with the strategic content element of the <IR> Framework and that this compliance has seen a steady growth throughout the researched time span. There is still room for improvement however with a total average framework compliance of 84% for 2014. Although many items are being reported on, there are indications that companies generally miss out on the core values of Integrated reporting.
Resumo:
The shift from decentralized to centralized A-level examinations (Abitur) was implemented in the German school system as a measure of Educational Governance in the last decade. This reform was mainly introduced with the intention of providing higher comparability of school examinations and student achievement as well as increasing fairness in school examinations. It is not known yet if these ambitious aims and functions of the new centralized examination format have been achieved and if fairer assessment can be guaranteed in terms of providing all students with the same opportunities to pass the examinations by allocating fair tests to different student subpopulations e.g., students of different background or gender. The research presented in this article deals with these questions and focuses on gender differences. It investigates gender-specific fairness of the test items in centralized Abitur examinations as high school exit examinations in Germany. The data are drawn from Abitur examinations in English (as a foreign language). Differential item functioning (DIF) analysis reveals that at least some parts of the examinations indicate gender inequality. (DIPF/Orig.)
Resumo:
Magdeburg, Univ., Fak. für Wirtschaftswiss., Diss., 2013
Resumo:
Although the Unified Huntington's Disease Rating Scale (UHDRS) is widely used in the assessment of Huntington disease (HD), the ability of individual items to discriminate individual differences in motor or behavioral manifestations has not been extensively studied in HD gene expansion carriers without a motor-defined clinical diagnosis (ie, prodromal-HD or prHD). To elucidate the relationship between scores on individual motor and behavioral UHDRS items and total score for each subscale, a nonparametric item response analysis was performed on retrospective data from 2 multicenter longitudinal studies. Motor and behavioral assessments were supplied for 737 prHD individuals with data from 2114 visits (PREDICT-HD) and 686 HD individuals with data from 1482 visits (REGISTRY). Option characteristic curves were generated for UHDRS subscale items in relation to their subscale score. In prHD, overall severity of motor signs was low, and participants had scores of 2 or above on very few items. In HD, motor items that assessed ocular pursuit, saccade initiation, finger tapping, tandem walking, and to a lesser extent, saccade velocity, dysarthria, tongue protrusion, pronation/supination, Luria, bradykinesia, choreas, gait, and balance on the retropulsion test were found to discriminate individual differences across a broad range of motor severity. In prHD, depressed mood, anxiety, and irritable behavior demonstrated good discriminative properties. In HD, depressed mood demonstrated a good relationship with the overall behavioral score. These data suggest that at least some UHDRS items appear to have utility across a broad range of severity, although many items demonstrate problematic features.
Resumo:
"4 September 1981."
Resumo:
Universidade Estadual de Campinas . Faculdade de Educação Física
Resumo:
Polytomous Item Response Theory Models provides a unified, comprehensive introduction to the range of polytomous models available within item response theory (IRT). It begins by outlining the primary structural distinction between the two major types of polytomous IRT models. This focuses on the two types of response probability that are unique to polytomous models and their associated response functions, which are modeled differently by the different types of IRT model. It describes, both conceptually and mathematically, the major specific polytomous models, including the Nominal Response Model, the Partial Credit Model, the Rating Scale model, and the Graded Response Model. Important variations, such as the Generalized Partial Credit Model are also described as are less common variations, such as the Rating Scale version of the Graded Response Model. Relationships among the models are also investigated and the operation of measurement information is described for each major model. Practical examples of major models using real data are provided, as is a chapter on choosing an appropriate model. Figures are used throughout to illustrate important elements as they are described.