Biblioteca Digital

734 resultados para Reliability measure

Evaluation of a photographic method to measure dental angulation

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objetivo: analisar a confiabilidade e reprodutibilidade de um método simplificado para análise da angulação dentária que faz uso de fotografias digitalizadas de modelos de gesso. Métodos: foram realizadas fotografias digitalizadas e padronizadas de modelos de gesso, posteriormente transportadas para um programa gráfico de leitura de ângulos, para a obtenção das medidas. Tais procedimentos foram repetidos para avaliação do erro do método casual e para a análise da reprodutibilidade por meio da Correlação Intraclasse. A amostra constituiu-se de 12 indivíduos com dentição permanente completa e não tratados ortodonticamente, sendo seis do sexo masculino e seis do feminino. As análises foram feitas bilateralmente, gerando 24 medidas. Resultados: o erro casual mostrou uma variação de 0,77 a 2,55º para a angulação dos dentes. A análise estatística revelou que o método apresenta uma excelente reprodutibilidade (r = 0,65 - 0,91; p < 0,0001) para todos os dentes, exceto para os pré-molares superiores, mas ainda assim estatisticamente significativa (p < 0,001). Conclusão: o método proposto apresenta confiabilidade suficiente para justificar seu uso no desenvolvimento de pesquisas científicas, bem como na prática clínica.

Speech Graphs Provide a Quantitative Measure of Thought Disorder in Psychosis

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: Psychosis has various causes, including mania and schizophrenia. Since the differential diagnosis of psychosis is exclusively based on subjective assessments of oral interviews with patients, an objective quantification of the speech disturbances that characterize mania and schizophrenia is in order. In principle, such quantification could be achieved by the analysis of speech graphs. A graph represents a network with nodes connected by edges; in speech graphs, nodes correspond to words and edges correspond to semantic and grammatical relationships. Methodology/Principal Findings: To quantify speech differences related to psychosis, interviews with schizophrenics, manics and normal subjects were recorded and represented as graphs. Manics scored significantly higher than schizophrenics in ten graph measures. Psychopathological symptoms such as logorrhea, poor speech, and flight of thoughts were grasped by the analysis even when verbosity differences were discounted. Binary classifiers based on speech graph measures sorted schizophrenics from manics with up to 93.8% of sensitivity and 93.7% of specificity. In contrast, sorting based on the scores of two standard psychiatric scales (BPRS and PANSS) reached only 62.5% of sensitivity and specificity. Conclusions/Significance: The results demonstrate that alterations of the thought process manifested in the speech of psychotic patients can be objectively measured using graph-theoretical tools, developed to capture specific features of the normal and dysfunctional flow of thought, such as divergence and recurrence. The quantitative analysis of speech graphs is not redundant with standard psychometric scales but rather complementary, as it yields a very accurate sorting of schizophrenics and manics. Overall, the results point to automated psychiatric diagnosis based not on what is said, but on how it is said.

Personal and Social Performance (PSP) scale for patients with schizophrenia: translation to Portuguese, cross-cultural adaptation and interrater reliability

Relevância:

30.00% 30.00%

Publicador:

Resumo:

INTRODUCTION: Schizophrenia is a chronic mental disorder associated with impairment in social functioning. The most widely used scale to measure social functioning is the GAF (Global Assessment of Functioning), but it has the disadvantage of measuring at the same time symptoms and functioning, as described in its anchors. OBJECTIVES:Translation and cultural adaptation of the PSP, proposing a final version in Portuguese for use in Brazil. METHODS: We performed five steps: 1) translation; 2) back translation; 3) formal assessment of semantic equivalence; 4) debriefing; 5) analysis by experts. Interrater reliability (Intraclass correlation, ICC) between two raters was also measured. RESULTS: The final version was applied by two independent investigators in 18 adults with schizophrenia (DSM-IV-TR). The interrater reliability (ICC) was 0.812 (p < 0.001). CONCLUSION: The translation and adaptation of the PSP had an adequate level of semantic equivalence between the Portuguese version and the original English version. There were no difficulties related to understanding the content expressed in the translated texts and terms. Its application was easy and it showed a good interrater reliability. The PSP is a valid instrument for the measurement of personal and social functioning in schizophrenia.

Validity of an automatic measure protocol in distal femur for allograft selection from a three-dimensional virtual bone bank system

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Osteoarticular allograft is one possible treatment in wide surgical resections with large defects. Performing best osteoarticular allograft selection is of great relevance for optimal exploitation of the bone databank, good surgery outcome and patient’s recovery. Current approaches are, however, very time consuming hindering these points in practice. We present a validation study of a software able to perform automatic bone measurements used to automatically assess the distal femur sizes across a databank. 170 distal femur surfaces were reconstructed from CT data and measured manually using a size measure protocol taking into account the transepicondyler distance (A), anterior-posterior distance in medial condyle (B) and anterior-posterior distance in lateral condyle (C). Intra- and inter-observer studies were conducted and regarded as ground truth measurements. Manual and automatic measures were compared. For the automatic measurements, the correlation coefficients between observer one and automatic method, were of 0.99 for A measure and 0.96 for B and C measures. The average time needed to perform the measurements was of 16 h for both manual measurements, and of 3 min for the automatic method. Results demonstrate the high reliability and, most importantly, high repeatability of the proposed approach, and considerable speed-up on the planning.

Multiplicity of data in trial reports and the reliability of meta-analyses: empirical study

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objectives To examine the extent of multiplicity of data in trial reports and to assess the impact of multiplicity on meta-analysis results. Design Empirical study on a cohort of Cochrane systematic reviews. Data sources All Cochrane systematic reviews published from issue 3 in 2006 to issue 2 in 2007 that presented a result as a standardised mean difference (SMD). We retrieved trial reports contributing to the first SMD result in each review, and downloaded review protocols. We used these SMDs to identify a specific outcome for each meta-analysis from its protocol. Review methods Reviews were eligible if SMD results were based on two to ten randomised trials and if protocols described the outcome. We excluded reviews if they only presented results of subgroup analyses. Based on review protocols and index outcomes, two observers independently extracted the data necessary to calculate SMDs from the original trial reports for any intervention group, time point, or outcome measure compatible with the protocol. From the extracted data, we used Monte Carlo simulations to calculate all possible SMDs for every meta-analysis. Results We identified 19 eligible meta-analyses (including 83 trials). Published review protocols often lacked information about which data to choose. Twenty-four (29%) trials reported data for multiple intervention groups, 30 (36%) reported data for multiple time points, and 29 (35%) reported the index outcome measured on multiple scales. In 18 meta-analyses, we found multiplicity of data in at least one trial report; the median difference between the smallest and largest SMD results within a meta-analysis was 0.40 standard deviation units (range 0.04 to 0.91). Conclusions Multiplicity of data can affect the findings of systematic reviews and meta-analyses. To reduce the risk of bias, reviews and meta-analyses should comply with prespecified protocols that clearly identify time points, intervention groups, and scales of interest.

Geriatric Pain Measure short form: development and initial evaluation

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVES: To develop and evaluate a short form of the 24-item Geriatric Pain Measure (GPM) for use in community-dwelling older adults. DESIGN: Derivation and validation of a 12-item version of the GPM in a European and an independent U.S. sample of community-dwelling older adults. SETTING: Three community-dwelling sites in London, United Kingdom; Hamburg, Germany; Solothurn, Switzerland; and two ambulatory geriatrics clinics in Los Angeles, California. PARTICIPANTS: European sample: 1,059 community-dwelling older persons from three sites (London, UK; Hamburg, Germany; Solothurn, Switzerland); validation sample: 50 persons from Los Angeles, California, ambulatory geriatric clinics. MEASUREMENTS: Multidimensional questionnaire including self-reported demographic and clinical information. RESULTS: Based on item-to-total scale correlations in the European sample, 11 of 24 GPM items were selected for inclusion in the short form. One additional item (pain-related sleep problems) was included based on clinical relevance. In the validation sample, the Cronbach alpha of GPM-12 was 0.92 (individual subscale range 0.77-0.92), and the Pearson correlation coefficient (r) between GPM-12 and the original GPM was 0.98. The correlation between the GPM-12 and the McGill Pain Questionnaire was 0.63 (P<.001), similar to the correlation between the original GPM and the McGill Pain Questionnaire (Pearson r=0.63; P<.001). Exploratory factor analysis indicated that the GPM-12 covers three subfactors (pain intensity, pain with ambulation, disengagement because of pain). CONCLUSION: The GPM-12 demonstrated good validity and reliability in these European and U.S. populations of older adults. Despite its brevity, the GPM-12 captures the multidimensional nature of pain in three subscales. The self-administered GPM-12 may be useful in the clinical assessment process and management of pain and in pain-related research in older persons.

The Getting-Out-of-Bed (GoB) scale: a measure of motivation and life outlook in older adults with cancer

Relevância:

30.00% 30.00%

Publicador:

Resumo:

OBJECTIVE: To develop and evaluate the psychometric properties of a measure of motivation and life outlook (Getting-Out-of-Bed [GoB]). DESIGN: Secondary analysis of baseline and 6-month data from a longitudinal follow-up study of older breast cancer survivors. PARTICIPANTS: Women (N = 660) diagnosed with primary breast cancer stage I-IIIA disease, age >or=65 years, and permission to contact from an attending physician in four geographic regions in the United States (city-based Los Angeles, California; statewide in Minnesota, North Carolina, and Rhode Island). MEASUREMENT: Data were collected over 6-months of follow-up from consenting patients' medical records and telephone interviews with patients. Data collected included the 4-item GoB, health-related quality of life (HRQoL), breast cancer, sociodemographic, and health-related characteristics. RESULTS: Factor analysis produced, as hypothesized, one principal component with eigen values of 2.74(baseline) and 2.91(6-months) which explained 68.6%(baseline) and 72.7%(6-months) of total variance. In further psychometric analyses, GoB exhibited good construct validity (divergent: low nonstatistically significant correlations with unrelated constructs; convergent: moderate statistically significant correlations with related constructs; discriminant: distinguished high HRQoL groups with a high level of significance), excellent internal reliability (Cronbach's alpha 0.84(baseline), 0.87(6-months)), and produced stable measurements over 6-months. Women with GoB scores >or=50 at baseline were more likely at 6-months to have good HRQoL, good self-perceived health, and report regular exercise, indicating good predictive ability. CONCLUSION: GoB demonstrated overall good psychometric properties in this sample of older breast cancer survivors, suggestive of a promising tool for assessing motivation and life outlook in older adults. Nevertheless, because it was developed and initially evaluated in a select sample, using measures with similar but not exact content overlap further evaluation is needed before it can be recommended for widespread use.

Psychoendocrine validation of a short measure for assessment of perceived stress management skills in different non-clinical populations

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND: We investigated the psychometric properties of a short questionnaire for combined assessment of different perceived stress management skills in the general population and tested whether scores relate to physiological stress reactivity. METHODS: For psychometric evaluation, we determined the factor structure of the questionnaire and investigated its measurement invariance in the participant groups and over time in three different independent samples representing the general population (total N=332). Reliability was tested by estimating test-retest reliability, internal consistency, and item reliabilities. We examined convergent and criterion validity using selected criterion variables. For endocrine validation, 35 healthy non-smoking and medication-free men in a laboratory study and 35 male and female employees in a workplace study underwent an acute standardized psychosocial stress task. We assessed stress management skills and measured salivary cortisol before and several times up to 60 min (workplace study) and 120 min (laboratory study) after stress. Potential confounders were controlled. RESULTS: The factor structure of the questionnaire consists of five scales reflecting acceptably distinct stress management skills such as cognitive strategies, use of social support, relaxation strategies, anger regulation, and perception of bodily tension. This factor structure was stable across participant groups and over time. Internal consistencies, item reliabilities, and test-retest reliabilities met established statistical requirements. Convergent and criterion validity were also established. In both endocrine validation studies, higher stress management skills were independently associated with lower cortisol stress reactivity (p's<.029). CONCLUSIONS: Our findings suggest that the questionnaire has good psychometric properties and that it relates to subjective psychological and objective physiological stress indicators. Therefore, the instrument seems a suitable measure for differential assessment of stress management skills in the general population.

A web-based non-intrusive ambient system to measure and classify activities of daily living.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

BACKGROUND The number of older adults in the global population is increasing. This demographic shift leads to an increasing prevalence of age-associated disorders, such as Alzheimer's disease and other types of dementia. With the progression of the disease, the risk for institutional care increases, which contrasts with the desire of most patients to stay in their home environment. Despite doctors' and caregivers' awareness of the patient's cognitive status, they are often uncertain about its consequences on activities of daily living (ADL). To provide effective care, they need to know how patients cope with ADL, in particular, the estimation of risks associated with the cognitive decline. The occurrence, performance, and duration of different ADL are important indicators of functional ability. The patient's ability to cope with these activities is traditionally assessed with questionnaires, which has disadvantages (eg, lack of reliability and sensitivity). Several groups have proposed sensor-based systems to recognize and quantify these activities in the patient's home. Combined with Web technology, these systems can inform caregivers about their patients in real-time (e.g., via smartphone). OBJECTIVE We hypothesize that a non-intrusive system, which does not use body-mounted sensors, video-based imaging, and microphone recordings would be better suited for use in dementia patients. Since it does not require patient's attention and compliance, such a system might be well accepted by patients. We present a passive, Web-based, non-intrusive, assistive technology system that recognizes and classifies ADL. METHODS The components of this novel assistive technology system were wireless sensors distributed in every room of the participant's home and a central computer unit (CCU). The environmental data were acquired for 20 days (per participant) and then stored and processed on the CCU. In consultation with medical experts, eight ADL were classified. RESULTS In this study, 10 healthy participants (6 women, 4 men; mean age 48.8 years; SD 20.0 years; age range 28-79 years) were included. For explorative purposes, one female Alzheimer patient (Montreal Cognitive Assessment score=23, Timed Up and Go=19.8 seconds, Trail Making Test A=84.3 seconds, Trail Making Test B=146 seconds) was measured in parallel with the healthy subjects. In total, 1317 ADL were performed by the participants, 1211 ADL were classified correctly, and 106 ADL were missed. This led to an overall sensitivity of 91.27% and a specificity of 92.52%. Each subject performed an average of 134.8 ADL (SD 75). CONCLUSIONS The non-intrusive wireless sensor system can acquire environmental data essential for the classification of activities of daily living. By analyzing retrieved data, it is possible to distinguish and assign data patterns to subjects' specific activities and to identify eight different activities in daily living. The Web-based technology allows the system to improve care and provides valuable information about the patient in real-time.

The VEINES-QOL/Sym questionnaire is a reliable and valid disease-specific quality of life measure for deep vein thrombosis in elderly patients.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

PURPOSE To prospectively evaluate the psychometric properties of the Venous Insufficiency Epidemiological and Economic Study (VEINES-QOL/Sym) questionnaire, an instrument to measure disease-specific quality of life and symptoms in elderly patients with deep vein thrombosis (DVT), and to validate a German version of the questionnaire. METHODS In a prospective multicenter cohort study of patients aged ≥ 65 years with acute venous thromboembolism, we used standard psychometric tests and criteria to evaluate the reliability, validity, and responsiveness of the VEINES-QOL/Sym in patients with acute symptomatic DVT. We also performed an exploratory factor analysis. RESULTS Overall, 352 French- and German-speaking patients were enrolled (response rate of 87 %). Both language versions of the VEINES-QOL/Sym showed good acceptability (missing data, floor and ceiling effects), reliability (internal consistency, item-total and inter-item correlations), validity (convergent, discriminant, known-groups differences), and responsiveness to clinical change over time in elderly patients with DVT. The exploratory factor analysis of the VEINES-QOL/Sym suggested three underlying dimensions: limitations in daily activities, DVT-related symptoms, and psychological impact. CONCLUSIONS The VEINES-QOL/Sym questionnaire is a practical, reliable, valid, and responsive instrument to measure quality of life and symptoms in elderly patients with DVT and can be used with confidence in prospective studies to measure outcomes in such patients.

Affect and cognition measures in preference-based decisions: Validity testing of the Ottawa Decisional Conflict Scale and a decision-specific anxiety measure with men eligible for prostate cancer screening

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background. At present, prostate cancer screening (PCS) guidelines require a discussion of risks, benefits, alternatives, and personal values, making decision aids an important tool to help convey information and to help clarify values. Objective: The overall goal of this study is to provide evidence of the reliability and validity of a PCS anxiety measure and the Decisional Conflict Scale (DCS). Methods. Using data from a randomized, controlled PCS decision aid trial that measured PCS anxiety at baseline and DCS at baseline (T0) and at two-weeks (T2), four psychometric properties were assessed: (1) internal consistency reliability, indicated by factor analysis intraclass correlations and Cronbach's α; (2) construct validity, indicated by patterns of Pearson correlations among subscales; (3) discriminant validity, indicated by the measure's ability to discriminate between undecided men and those with a definite screening intention; and (4) factor validity and invariance using confirmatory factor analyses (CFA). Results. The PCS anxiety measure had adequate internal consistency reliability and good construct and discriminant validity. CFAs indicated that the 3-factor model did not have adequate fit. CFAs for a general PCS anxiety measure and a PSA anxiety measure indicated adequate fit. The general PCS anxiety measure was invariant across clinics. The DCS had adequate internal consistency reliability except for the support subscale and had adequate discriminate validity. Good construct validity was found at the private clinic, but was only found for the feeling informed subscale at the public clinic. The traditional DCS did not have adequate fit at T0 or at T2. The alternative DCS had adequate fit at T0 but was not identified at T2. Factor loadings indicated that two subscales, feeling informed and feeling clear about values, were not distinct factors. Conclusions. Our general PCS anxiety measure can be used in PCS decision aid studies. The alternative DCS may be appropriate for men eligible for PCS. Implications: More emphasis needs to be placed on the development of PCS anxiety items relating to testing procedures. We recommend that the two DCS versions be validated in other samples of men eligible for PCS and in other health care decisions that involve uncertainty. ^

AN INVESTIGATION OF THE RELIABILITY AND VALIDITY OF THE CENTER FOR EPIDEMIOLOGIC STUDIES DEPRESSION SCALE IN THREE ETHNIC GROUPS

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Epidemiologic studies of mental disorder have called attention to the need for identifying untreated cases and to the inadequacies of the instruments available for this purpose. Accurate case ascertainment devices are the basis of sound epidemiology. Without these, neither case classification nor analytic studies of risk factors is possible.^ The purpose of this research was to examine the reliability and validity of an instrument designed to measure depressive symptoms in community populations--the Center for Epidemiologic Studies Depression Scale (CES-D Scale). Two particular foci of the study were whether or not the scale had the same statistical structure across three ethnic groups and whether or not the magnitude and pattern of rates of symptoms for these groups were affected by one source of response error, that due to response tendencies. The effects of age and education on the pattern and magnitude of rates also were examined. In addition, the reliability and validity of the measures of response tendencies were assessed.^ The study population consisted of residents of Alameda County, California. A stratified sample of approximately 700 whites, blacks and Mexican-Americans was interviewed in the summer and fall of 1978.^ The results of the analysis indicated that the scale was reliable and measured a similar content domain across the three ethnic groups. The unadjusted sex- and ethnic-specific rates of depressive symptoms showed an ethnic pattern for both sexes: rates for whites were lowest, those for Mexican-Americans were highest, and those for blacks were intermediate. Measures of response tendencies--need for social approval, trait desirability, and acquiescence--affected the magnitude of the rates for most comparisons. Likewise, the pattern of rates changed somewhat from that originally observed. The one fairly consistent observation was that rates for Mexican-American women were higher than those for the other two female subgroups in most of the comparisons. These results must be considered in the context of the reliability and validity assessment of the measures of response tendencies which indicated the tenuousness of these measures.^ Age affected the ethnic pattern of rates for men in an inconsistent way; for women, Mexican-Americans continued to have higher rates than whites or blacks in all age categories. Education affected the magnitude of rates for women but not for men. For both men and women, Mexican-Americans had higher rates in all educational strata. Rates for women showed an inverse association with education while those for men did not. ^

A multi-site pilot test study to measure safety climate in the university work setting

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Next to leisure, sport, and household activities, the most common activity resulting in medically consulted injuries and poisonings in the United States is work, with an estimated 4 million workplace related episodes reported in 2008 (U.S. Department of Health and Human Services, 2009). To address the risks inherent to various occupations, risk management programs are typically put in place that include worker training, engineering controls, and personal protective equipment. Recent studies have shown that such interventions alone are insufficient to adequately manage workplace risks, and that the climate in which the workers and safety program exist (known as the "safety climate") is an equally important consideration. The organizational safety climate is so important that many studies have focused on developing means of measuring it in various work settings. While safety climate studies have been reported for several industrial settings, published studies on assessing safety climate in the university work setting are largely absent. Universities are particularly unique workplaces because of the potential exposure to a diversity of agents representing both acute and chronic risks. Universities are also unique because readily detectable health and safety outcomes are relatively rare. The ability to measure safety climate in a work setting with rarely observed systemic outcome measures could serve as a powerful means of measure for the evaluation of safety risk management programs. ^ The goal of this research study was the development of a survey tool to measure safety climate specifically in the university work setting. The use of a standardized tool also allows for comparisons among universities throughout the United States. A specific study objective was accomplished to quantitatively assess safety climate at five universities across the United States. At five universities, 971 participants completed an online questionnaire to measure the safety climate. The average safety climate score across the five universities was 3.92 on a scale of 1 to 5, with 5 indicating very high perceptions of safety at these universities. The two lowest overall dimensions of university safety climate were "acknowledgement of safety performance" and "department and supervisor's safety commitment". The results underscore how the perception of safety climate is significantly influenced at the local level. A second study objective regarding evaluating the reliability and validity of the safety climate questionnaire was accomplished. A third objective fulfilled was to provide executive summaries resulting from the questionnaire to the participating universities' health & safety professionals and collect feedback on usefulness, relevance and perceived accuracy. Overall, the professionals found the survey and results to be very useful, relevant and accurate. Finally, the safety climate questionnaire will be offered to other universities for benchmarking purposes at the annual meeting of a nationally recognized university health and safety organization. The ultimate goal of the project was accomplished and was the creation of a standardized tool that can be used for measuring safety climate in the university work setting and can facilitate meaningful comparisons amongst institutions.^

The 12-Item General Health Questionnaire (GHQ-12): Reliability,external validity and factor structure in the Spanish population

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The purpose of this study was to analyze the internal consistency and the external and structure validity of the 12-Item General Health Questionnaire (GHQ-12) in the Spanish general population. A stratified sample of 1001 subjects, ages between 25 and 65 years, taken from the general Spanish population was employed. The GHQ-12 and the Inventory of Situations and Responses of Anxiety-ISRA were administered. A Cronbach’s alpha of .76 (Standardized Alpha: .78) and a 3-factor structure (with oblique rotation and maximum likelihood procedure) were obtained. External validity of Factor I (Successful Coping) with the ISRA is very robust (.82; Factor II, .70; Factor III, .75). The GHQ-12 shows adequate reliability and validity in the Spanish population. Therefore, the GHQ-12 can be used with efficacy to assess people’s overall psychological well-being and to detect non-psychotic psychiatric problems. Additionally, our results confirm that the GHQ-12 can best be thought of as a multidimensional scale that assesses several distinct aspects of distress, rather than just a unitary screening measure.

Psychometric Investigation of the Check-In, Check-Out Fidelity of Implementation Measure

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Suspension and expulsion are utilized frequently and disproportionality in schools in the United States. Many schools utilize Positive Behavioral Interventions and Supports (PBIS), a tiered framework to prevent problem behavior and reduce the use of discipline practices (Sugai et al., 2000). Check-In, Check-Out (CICO) is a targeted group behavioral intervention that is utilized within this framework in schools to prevent severe problem behavior in students that are beginning to exhibit externalizing and/or internalizing behavioral needs; thus, preventing the use of exclusionary discipline practices (Crone et al., 2010; Hawken & Horner, 2003). As the use of CICO in schools continues to grow, so too does the need for an instrument measuring its fidelity of implementation. The purpose of this study was to investigate the reliability and validity of the Check-In, Check-Out Fidelity of Implementation Measure (Crone et al., 2010), an instrument created to measure the fidelity of implementation of CICO intervention. This study assessed the psychometric properties of the instrument utilizing an archival data set collected by the statewide PBIS initiative in a western state in the U.S. The results demonstrated promising content validity, construct validity, internal consistency, and interrater reliability. A unidimensional structure was determined to be the best structure for the instrument based on parsimony and the strong results obtained from the item loadings, internal consistency, and interrater reliability. Implications for use and future research are discussed.

«
1
2
3
4
5
6
7
8
...
48
49
»