17 resultados para convergent validity
em DigitalCommons@The Texas Medical Center
Resumo:
The Surgeon General recommends preschoolers 3-5 years old accumulate 60 minutes of moderate-to-vigorous physical activity (MVPA) per day. However, there is limited data measuring physical activity (PA) and MVPA amongst this population. The purpose of this cross-sectional study is to determine the validity, reliability, and feasibility of using MVP 4 Function Walk4Life digital pedometers (MVP-4) in measuring MVPA among preschoolers using the newly modified direct observational technique, System for Observing Fitness Instruction Time-Preschool Version (SOFIT-P) as the gold standard. An ethnically diverse population of 3-5 year old underserved children were recruited from two Harris County Department of Education (HCDE) Head Start centers. For 2 days at baseline and 2 days at post-test, 75 children enrolled wore MVP-4 pedometers for approximately 6-hours per observation day and were observed using SOFIT-P during predominantly active times. Statistical analyses used Pearson "r" correlation coefficients to determine mean minutes of PA and MVPA, convergent and criterion validity, and reliability. Significance was set at p = <0.05. Feasibility was determined through process evaluation information collected during this study via observations from data collectors and teacher input. Results show mean minutes of PA and MVPA ranged between 30-42 and 11-14 minutes, respectively. Convergent validity comparing BMI percentiles with MVP-4 PA outcomes show no significance at pre-test; however, each measurement at post-test showed significance for MVPA (p = 0.0247, p = 0.0056), respectively. Criterion validity comparing percent MVPA time between SOFIT-P and MVP-4 pedometers was determined; however, results deemed insufficient due to inconsistency in observation times while using the newly developed SOFIT-P. Reliability measures show no significance at pre-test, yet show significant results for all PA outcomes at post-test (p = 0.001, p = 0.001, p = 0.0010, p = 0.003), respectively. Finally, MVP-4 pedometers lacked feasibility due to logistical barriers in design. Researchers feel the significant results at post-test are secondary to increased familiarity and more accurate placement of pedometers across time. Researchers suggest manufacturers of MVP-4 pedometers further modify the instrument for ease of use with this population, following which future studies ought to determine validity using objective measures or all-day direct observation techniques.^
Resumo:
Loneliness is a pervasive, rather common experience in American culture, particularly notable among adolescents. However, the phenomenon is not well documented in the cross-cultural psychiatric literature. For psychiatric epidemiology to encompass a wide array of psychopathologic phenomena, it is important to develop useful measures to characterize and classify both non-clinical and clinical dysfunction in diverse subgroups and cultures.^ The goal of this research was to examine the cross-cultural reliability and construct validity of a scale designed to measure loneliness. The Roberts Loneliness Scale (RLS-8) was administered to 4,060 adolescents ages 10-19 years enrolled in high schools along either side of the Texas-Tamaulipas border region between the U.S. and Mexico. Data collected in 1988 from a study focusing on substance use and psychological distress among adolescents in these regions were used to examine the operating characteristics of the RLS-8. A sample stratified by nationality and language, age, gender, and grade was used for analysis.^ Results indicated that in general the RLS-8 has moderate reliability in the U.S. sample, but not in the Mexican sample. Validity analyses demonstrated that there was evidence for convergent validity of the RLS-8 in the U.S. sample, but none in the Mexican sample. Discriminant validity of the measures in neither sample could be established. Based on the factor structure of the RLS-8, two subscales were created and analyzed for construct validity. Evidence for convergent validity was established for both subscales in both national samples. However, the discriminant validity of the measure remains unsubstantiated in both national samples. Also, the dimensionality of the scale is unresolved.^ One primary goal for future cross-cultural research would be to develop and test better defined culture-specific models of loneliness within the two cultures. From such scientific endeavor, measures of loneliness can be developed or reconstructed to classify the phenomenon in the same manner across cultures. Since estimates of prevalence and incidence are contingent upon reliable and valid screening or diagnostic measures, this objective would serve as an important foundation for future psychiatric epidemiologic inquiry into loneliness. ^
Resumo:
Background. Because it is important to minimize children's sun exposure to reduce skin cancer risk, much of the extensive skin cancer prevention literature consists of studies of children's sun protection, sun avoidance and ultraviolet radiation (UVR) exposure. Little attention has been focused on the measurement of psychosocial constructs in these studies. Identification of the psychosocial correlates or determinants of children's skin cancer risk or risk-reduction behavior is critical to more fully understand and predict behavior. Furthermore, psychosocial variables may be influenced by interventions to reduce risk. Thus, it is important to examine the psychosocial measures used in studies of children's skin cancer prevention. Information on the validity and reliability of psychosocial measures may increase confidence in study findings based on these measures. In particular, self-efficacy and barriers are key constructs in several major theoretical frameworks and parental measures have been associated with children's sun protection. However, there is conceptual overlap of self-efficacy and barriers measures and little is known about the psychometric properties of these measures.^ Study Aims and Methods. The overall goal of this dissertation was to examine the measurement of psychosocial constructs relevant to children's skin cancer prevention. Because children depend primarily on their parents for skin cancer prevention, measures of parents' psychosocial constructs are the focus. Study 1 was a systematic review of parental psychosocial measures used in studies of children's sun protection, sun avoidance and UVR exposure. The specific aims of Study 1 were to (1) describe psychosocial measures reported by parents, including available information on the psychometric properties of these measures and their use in analyses and (2) provide recommendations for the development, refinement and standardized reporting of measures. ^ Study 2 examined the psychometric properties of measures of parental self-efficacy and barriers regarding children's sun protection. Melanoma patients (N=205) who were parents of children ≤ 12 years of age completed a telephone interview that included self-efficacy and barriers measures specific to sunscreen, clothing, shade and limiting time outdoors. The specific aims of Study 2 were to (1) use a confirmatory factor analytic approach to examine the factorial validity of parental self-efficacy and barriers measures, (2) examine the convergent and discriminant validity of behavior-specific measures of self-efficacy and barriers and (3) assess the reliability of item and scale measures.^ Results. In Study 1, a search of standard databases yielded 48 eligible studies. Most studies assessed only one or two psychosocial constructs. Knowledge was measured most frequently. There was little discussion of measure source, development, theoretical background or psychometric properties, besides internal consistency reliability. There was conceptual overlap of some measures. In Study 2, confirmatory factor analytic findings supported the factorial validity of the self-efficacy and barriers measures. When all eight self-efficacy and barriers measures were included in the same model, a modified eight-factor model adequately fit the data, providing preliminary evidence that the measures are distinct. Measure associations supported the convergent validity of all measures and the discriminant validity of most measures. The self-efficacy and barriers measures were reliable.^ Conclusions. Recommendations based on the literature review include developing and refining psychosocial measures based on theory. Describing a measure's theoretical basis and psychometric properties would facilitate critical evaluation. Standardized reporting of source, development, theory, construct, items and analytic role would facilitate comparison of findings, continual refinement and future applications of measures. In the validation study, self-efficacy and barriers measures were examined in a sample of parents with a personal history of melanoma. Findings suggested that these measures are valid and reliable for use in studies of children's sun protection. There was preliminary evidence that these measures are distinct but additional study is needed. ^
Resumo:
Introduction: Laparoscopic training models are increasingly important in urology to allow trainees to improve their laparoscopic skills prior to going to the operating room. For a training model to be valid, it must correlate with performance in a real case. The model must also discriminate between experienced and inexperienced subjects. [See PDF for complete abstract]
Resumo:
BACKGROUND: : Women at increased risk of breast cancer (BC) are not widely accepting of chemopreventive interventions, and ethnic minorities are underrepresented in related trials. Furthermore, there is no validated instrument to assess the health-seeking behavior of these women with respect to these interventions. METHODS: : By using constructs from the Health Belief Model, the authors developed and refined, based on pilot data, the Breast Cancer Risk Reduction Health Belief (BCRRHB) scale using a population of 265 women at increased risk of BC who were largely medically underserved, of low socioeconomic status (SES), and ethnic minorities. Construct validity was assessed using principal components analysis with oblique rotation to extract factors, and generate and interpret summary scales. Internal consistency was determined using Cronbach alpha coefficients. RESULTS: : Test-retest reliability for the pilot and final data was calculated to be r = 0.85. Principal components analysis yielded 16 components that explained 64% of the total variance, with communalities ranging from 0.50-0.75. Cronbach alpha coefficients for the extracted factors ranged from 0.45-0.77. CONCLUSIONS: : Evidence suggests that the BCRRHB yields reliable and valid data that allows for the identification of barriers and enhancing factors associated with use of breast cancer chemoprevention in the study population. These findings allow for tailoring treatment plans and intervention strategies to the individual. Future research is needed to validate the scale for use in other female populations. Cancer 2009. (c) 2009 American Cancer Society.
Resumo:
Neuromodulation is essential to many functions of the nervous system. In the simple gastropod mollusk Aplysia californica, neuromodulation of the circuits for the defensive withdrawal reflexes has been associated with several forms of learning. In the present work, the neurotransmitters and neural circuitry which contribute to the modulation of the tail-siphon withdrawal reflex were examined.^ A recently-identified neuropeptide transmitter, buccalin A was found to modulate the biophysical properties of the sensory neurons that mediate the reflex. The actions of buccalin A on the sensory neurons were compared with those of the well-characterized modulatory transmitter serotonin, and convergence and divergence in the actions of these two transmitters were evaluated. Buccalin A dramatically increased the excitability of sensory neurons and occluded further enhancement of excitability by serotonin. Buccalin A produced no significant change in spike duration, and it did not block serotonin-induced spike broadening. Voltage-clamp analysis revealed the currents that may be involved in the effects on spike duration and excitability. Buccalin A decreased an outward current similar to the S-K$\sp+$ current (I$\sb{\rm K,S}$). Buccalin A appeared to occlude further modulation of I$\sb{\rm K,S}$ by serotonin, but did not block serotonin-induced modulation of the voltage-dependent delayed rectifier K$\sp+$ current (I$\sb{\rm K,V}$). These results suggest that buccalin A converges on some, but not all, of the same subcellular modulatory pathways as serotonin.^ In order to begin to understand neuromodulation in a more physiological context for the tail-siphon withdrawal reflex, the modulatory circuitry for the tail-withdrawal circuit was examined. Mechanoafferent neurons in the J cluster of the cerebral ganglion were identified as elements of a modulatory circuit for the reflex. Excitatory and inhibitory connections were observed between the J cells and the pleural sensory neurons, the tail motor neurons, and several classes of interneurons for the tail-siphon withdrawal circuit. The J cells produced both fast and slow PSPs in these neurons. Of particular interest was the ability of the J cells to produce slow EPSPs in the pleural sensory neurons. These slow EPSPs were associated with an increase in the excitability of the sensory neurons. The J cells appear to mediate both sensory and modulatory inputs to the circuit for the tail-siphon withdrawal reflex from the anterior part of the animal. ^
Resumo:
The Eker rat model has allowed researchers the unique opportunity to study the tumorigenesis of spontaneously occurring uterine leiomyoma. Animals in this line harbor a germline mutation in the tuberous sclerosis complex-2 (Tsc-2) tumor suppressor gene and develop uterine leiomyomas at a rate of ∼65%. Primary leiomyomas obtained from humans and Eker rats along with Eker-derived leiomyoma cell lines were used in studies described herein to determine the effect of PPARγ ligand treatment on the proliferation of this cell type and to determine the role of tuberin and p27Kip1 in the etiology of this tumor type. Treatment of leiomyoma cells of human and rat origin with PPARγ-activating compounds resulted in decreased proliferation. Additionally, PPARγ ligands inhibited estrogen-dependent gene transactivation in Eker-derived leiomyoma cells suggesting that nuclear receptor cross-talk may exist between PPAR and the ER and may be responsible for the inhibition of proliferation in this cell type. Loss of tuberin, the product of the TSC-2 gene, is associated with Eker rat leiomyoma development while the role of this tumor suppressor in human leiomyoma development is unknown. Data herein show that tuberin expression is diminished in 25% of human leiomyomas tested. Additionally, we observed diminished p27 Kip1 expression in 80% of human uterine leiomyomas compared to normal myometrium. Interestingly, the loss of tuberin expression in human leiomyoma was associated with cytoplasmic p27Kip1 accumulation in this cell type. Furthermore, tuberin-null Eker rat leiomyomas and derived cell lines had predominantly cytoplasmic p27Kip1 compared to tuberin-expressing normal myometrium. Taken together, our data show that human and Eker rat leiomyoma proliferation is inhibited upon PPARγ treatment and that the etiology of human and Eker rat leiomyoma converge at loss of p27Kip1 function. Furthermore, our data indicate that the loss of p27 Kip1 function is mediated by loss of expression (in 80% of human leiomyoma) or cytoplasmic localization potentially resulting from the loss of tuberin. ^
Resumo:
In the last thirty years, increasing efforts have been made to reduce the prevalence of adolescent tobacco use in the United States. Although the prevalence has declined dramatically over the past decade, there are still sharp differences in adolescent smoking-initiation rates across racial/ethnic groups. Large-scale surveys frequently assess smoking-related attitudes, self-efficacy, and intentions to explain the differences in smoking rates between African Americans and Whites. However, there is little agreement about which constructs are significant. Moreover, the psychometric properties of smoking-related attitude, self-efficacy, and intention constructs have not been fully examined. More studies are needed to understand existing patterns of tobacco use and to validate and fully exploit the constructs' relationship to adolescent smoking initiation across racial/ethnic groups. ^ This dissertation reports on a secondary analysis of data from a large multi-ethnic convenience sample of sixth- through eighth-grade students in 22 schools in East Texas and the city of Houston. The specific aims of this dissertation were to (1) describe smoking and alternate tobacco product use rates by race/ethnicity, gender, age, and grade level (Article 1); (2) test the factorial validity of smoking-related attitudes, self-efficacy, and intentions using confirmatory factor analysis techniques (Article 2); and (3) test the factorial invariance of smoking-related attitudes, self-efficacy, and intentions between African Americans and Whites (Article 3). ^ The prevalence findings confirm the disparities in tobacco use among African American, Hispanic, and White adolescents that other surveys have reported (Article 1). This study also demonstrates the usefulness of examining use patterns of not only cigarettes but also alternative tobacco products in younger multiethnic populations, as well as of providing epidemiological data estimates about different phases of smoking. The confirmatory factor analysis provides evidence of construct validity of attitude, self-efficacy, and intention scales for the multiethnic sample (Article 2). Finally, the factorial invariance analyses indicates that some measures representing smoking-related attitudes, self-efficacy, and intentions may not be appropriate for use among both African Americans and Whites (Article 3). Additional research is needed to further our understanding of the patterns and predictors of youth tobacco use initiation. ^
Resumo:
With substance abuse treatment expanding in prisons and jails, understanding how behavior change interacts with a restricted setting becomes more essential. The Transtheoretical Model (TTM) has been used to understand intentional behavior change in unrestricted settings, however, evidence indicates restrictive settings can affect the measurement and structure of the TTM constructs. The present study examined data from problem drinkers at baseline and end-of-treatment from three studies: (1) Project CARE (n = 187) recruited inmates from a large county jail; (2) Project Check-In (n = 116) recruited inmates from a state prison; (3) Project MATCH, a large multi-site alcohol study had two recruitment arms, aftercare (n = 724 pre-treatment and 650 post-treatment) and outpatient (n = 912 pre-treatment and 844 post-treatment). The analyses were conducted using cross-sectional data to test for non-invariance of measures of the TTM constructs: readiness, confidence, temptation, and processes of change (Structural Equation Modeling, SEM) across restricted and unrestricted settings. Two restricted (jail and aftercare) and one unrestricted group (outpatient) entering treatment and one restricted (prison) and two unrestricted groups (aftercare and outpatient) at end-of-treatment were contrasted. In addition TTM end-of-treatment profiles were tested as predictors of 12 month drinking outcomes (Profile Analysis). Although SEM did not indicate structural differences in the overall TTM construct model across setting types, there were factor structure differences on the confidence and temptation constructs at pre-treatment and in the factor structure of the behavioral processes at the end-of-treatment. For pre-treatment temptation and confidence, differences were found in the social situations factor loadings and in the variance for the confidence and temptation latent factors. For the end-of-treatment behavioral processes, differences across the restricted and unrestricted settings were identified in the counter-conditioning and stimulus control factor loadings. The TTM end-of-treatment profiles were not predictive of drinking outcomes in the prison sample. Both pre and post-treatment differences in structure across setting types involved constructs operationalized with behaviors that are limited for those in restricted settings. These studies suggest the TTM is a viable model for explicating addictive behavior change in restricted settings but calls for modification of subscale items that refer to specific behaviors and caution in interpreting the mean differences across setting types for problem drinkers. ^
Resumo:
Background. Not only has obesity played a role in Texas adults but it is also becoming a large issue among low-income Latino children. In Latino children between 2-5 years of age, the Pediatric Nutrition Surveillance data in 1997 found the prevalence of obesity was 12 percent, highest among all ethnic groups. Children learn what and how to eat from their environment. Despite many mothers being working mothers they are still the principal caregivers and source of influence on their toddler's diet. Self-efficacy, a concept created by Albert Bandura, one's belief that one is capable of performing a behavior needed to reach an intended goal, is increasingly becoming important in nutrition and health education. This study is important to understand the degree of impact that a mother's self-efficacy will have on a child's diet. This is useful knowing if influencing a mother's self-efficacy could improve a child's diet to prevent certain public health issues such as obesity and diabetes. The purpose of this study was to examine nutrition self-efficacy of Latina mothers, focusing on sweets and beverage and if their self-efficacy impacted their child's diet. Methods. The data was collected during July-September 2008. Mothers were recruited from two federally qualified San Antonio health centers. In order to qualify, participants had to be Hispanic with children of toddler age. Mothers were informed of incentives available upon completion. The interview consisted of demographic info, a set of five self-efficacy questions repeated at completion, testing reliability and a 24-hour food recall diary asked of the participant's child's diet. Results. There were 225 mothers who participated between both clinics. The Crohnbach alpha scores for the two different times the self-efficacy questions were asked were .44 corresponding to the first time and .49 for the second time. The three most common beverages reported were milk, juice, and water. The mothers who met or gave their child more milk than recommended by the scientific community, 800mg of calcium/3 cups (24oz) set, had a higher self-efficacy score than those who did not meet the standard at all. Mothers who gave their children more juice than the standard recommends, 4-6oz for children 1-6 years of age, had slightly higher self-efficacy scores than mother's who simply met the standard. In general, the lower the mother's self-efficacy, the more sweets they gave their child and vice versa. Conclusion. This study's Kappa values were adequate and this research showed that Latina mothers did in fact have high self-efficacy. In general some of the children's diets did not reflect the current scientific nutrition recommendations. In order to improve self-efficacy and have an impact on children's diets, the scientific community has a responsibility to make recommendations that are easily understood and can be put into practice. The public health community needs to ensure that we encourage those we serve to be more active in their health and educate them about what constitutes good health and nutrition for both themselves and their children.^
Resumo:
Physical activity is a key component of life-style modification process which helps to reduce the risk of developing chronic diseases. It is important to have accurate estimates of physical activity to identify sedentary populations where interventions might be helpful. The International Physical Activity Questionnaire (IPAQ) short version has been used to estimate physical activity in diverse populations. However, there is little literature depicting the use of the IPAQ short version in Mexican America population. This study addressed the predictive validity and test-retest reliability of the IPAQ short version in Mexican American adults. The analysis was performed on 97 participants enrolled in the Cameron County Hispanic Cohort. Individuals selected in this study were 18 years of age or older. The predictive validity was evaluated by studying the relationship between physical activity and biomarkers known to be correlated with physical activity, namely, TNF-α, Adiponectin, and HDL. Multiple linear regression analysis was performed to delineate predictive validity. To assess test-retest reliability, two IPAQ-short last seven days questionnaires were interviewer administered to the participants on the same day, approximately two hours apart. Test-Retest reliability of IPAQ was estimated by performing intraclass correlations between the readings at two different time periods. The study showed that the IPAQ – short version used in the above study had acceptable test-retest reliability in the Mexican American population. This study showed that the IPAQ – short version did not have acceptable predictive validity when looking at physical activity and TNF-α, Adiponectin, and HDL in this sample.^
Resumo:
Background. At present, prostate cancer screening (PCS) guidelines require a discussion of risks, benefits, alternatives, and personal values, making decision aids an important tool to help convey information and to help clarify values. Objective: The overall goal of this study is to provide evidence of the reliability and validity of a PCS anxiety measure and the Decisional Conflict Scale (DCS). Methods. Using data from a randomized, controlled PCS decision aid trial that measured PCS anxiety at baseline and DCS at baseline (T0) and at two-weeks (T2), four psychometric properties were assessed: (1) internal consistency reliability, indicated by factor analysis intraclass correlations and Cronbach's α; (2) construct validity, indicated by patterns of Pearson correlations among subscales; (3) discriminant validity, indicated by the measure's ability to discriminate between undecided men and those with a definite screening intention; and (4) factor validity and invariance using confirmatory factor analyses (CFA). Results. The PCS anxiety measure had adequate internal consistency reliability and good construct and discriminant validity. CFAs indicated that the 3-factor model did not have adequate fit. CFAs for a general PCS anxiety measure and a PSA anxiety measure indicated adequate fit. The general PCS anxiety measure was invariant across clinics. The DCS had adequate internal consistency reliability except for the support subscale and had adequate discriminate validity. Good construct validity was found at the private clinic, but was only found for the feeling informed subscale at the public clinic. The traditional DCS did not have adequate fit at T0 or at T2. The alternative DCS had adequate fit at T0 but was not identified at T2. Factor loadings indicated that two subscales, feeling informed and feeling clear about values, were not distinct factors. Conclusions. Our general PCS anxiety measure can be used in PCS decision aid studies. The alternative DCS may be appropriate for men eligible for PCS. Implications: More emphasis needs to be placed on the development of PCS anxiety items relating to testing procedures. We recommend that the two DCS versions be validated in other samples of men eligible for PCS and in other health care decisions that involve uncertainty. ^
Resumo:
Epidemiologic studies of mental disorder have called attention to the need for identifying untreated cases and to the inadequacies of the instruments available for this purpose. Accurate case ascertainment devices are the basis of sound epidemiology. Without these, neither case classification nor analytic studies of risk factors is possible.^ The purpose of this research was to examine the reliability and validity of an instrument designed to measure depressive symptoms in community populations--the Center for Epidemiologic Studies Depression Scale (CES-D Scale). Two particular foci of the study were whether or not the scale had the same statistical structure across three ethnic groups and whether or not the magnitude and pattern of rates of symptoms for these groups were affected by one source of response error, that due to response tendencies. The effects of age and education on the pattern and magnitude of rates also were examined. In addition, the reliability and validity of the measures of response tendencies were assessed.^ The study population consisted of residents of Alameda County, California. A stratified sample of approximately 700 whites, blacks and Mexican-Americans was interviewed in the summer and fall of 1978.^ The results of the analysis indicated that the scale was reliable and measured a similar content domain across the three ethnic groups. The unadjusted sex- and ethnic-specific rates of depressive symptoms showed an ethnic pattern for both sexes: rates for whites were lowest, those for Mexican-Americans were highest, and those for blacks were intermediate. Measures of response tendencies--need for social approval, trait desirability, and acquiescence--affected the magnitude of the rates for most comparisons. Likewise, the pattern of rates changed somewhat from that originally observed. The one fairly consistent observation was that rates for Mexican-American women were higher than those for the other two female subgroups in most of the comparisons. These results must be considered in the context of the reliability and validity assessment of the measures of response tendencies which indicated the tenuousness of these measures.^ Age affected the ethnic pattern of rates for men in an inconsistent way; for women, Mexican-Americans continued to have higher rates than whites or blacks in all age categories. Education affected the magnitude of rates for women but not for men. For both men and women, Mexican-Americans had higher rates in all educational strata. Rates for women showed an inverse association with education while those for men did not. ^
Resumo:
The Work Limitations Questionnaire (WLQ) is used to determine the amount of work loss and productivity which stem from certain health conditions, including rheumatoid arthritis and cancer. The questionnaire is currently scored using methodology from Classical Test Theory. Item Response Theory, on the other hand, is a theory based on analyzing item responses. This study wanted to determine the validity of using Item Response Theory (IRT), to analyze data from the WLQ. Item responses from 572 employed adults with dysthymia, major depressive disorder (MDD), double depressive disorder (both dysthymia and MDD), rheumatoid arthritis and healthy individuals were used to determine the validity of IRT (Adler et al., 2006).^ PARSCALE, which is IRT software from Scientific Software International, Inc., was used to calculate estimates of the work limitations based on item responses from the WLQ. These estimates, also known as ability estimates, were then correlated with the raw score estimates calculated from the sum of all the items responses. Concurrent validity, which claims a measurement is valid if the correlation between the new measurement and the valid measurement is greater or equal to .90, was used to determine the validity of IRT methodology for the WLQ. Ability estimates from IRT were found to be somewhat highly correlated with the raw scores from the WLQ (above .80). However, the only subscale which had a high enough correlation for IRT to be considered valid was the time management subscale (r = .90). All other subscales, mental/interpersonal, physical, and output, did not produce valid IRT ability estimates.^ An explanation for these lower than expected correlations can be explained by the outliers found in the sample. Also, acquiescent responding (AR) bias, which is caused by the tendency for people to respond the same way to every question on a questionnaire, and the multidimensionality of the questionnaire (the WLQ is composed of four dimensions and thus four different latent variables) probably had a major impact on the IRT estimates. Furthermore, it is possible that the mental/interpersonal dimension violated the monotonocity assumption of IRT causing PARSCALE to fail to run for these estimates. The monotonicity assumption needs to be checked for the mental/interpersonal dimension. Furthermore, the use of multidimensional IRT methods would most likely remove the AR bias and increase the validity of using IRT to analyze data from the WLQ.^