781 resultados para external validity
Resumo:
Preference reversals are frequently observed in the lab, but almost all designs use completely transparent prospects, which are rarely features of decision making elsewhere. This raises questions of external validity. We test the robustness of the phenomenon to gambles that incorporate realistic ambiguity in both payoffs and probabilities. In addition, we test a recent explanation of preference reversals by loss aversion, which would also restrict the incidence of reversals outside the lab. According to this account, reversals occur largely because the valuation task endows subject with a gamble, activating loss aversion. This contrasts with the choice task, where the reference point is pre-experiment wealth. We test this explanation by holding the reference point constant. Our evidence suggests that reversals are only slightly diminished with ambiguity. We find no evidence supporting their explanation by loss aversion.
Resumo:
What can explain the strong euroscepticism of radical parties of both the right and the left? This article argues that the answer lies in the paradoxical role of nationalism as a central element in both party families, motivating opposition towards European integration. Conventionally, the link between nationalism and euroscepticism is understood solely as a prerogative of radical right-wing parties, whereas radical left-wing euroscepticism is associated with opposition to the neoliberal character of the European Union.This article contests this view. It argues that nationalism cuts across party lines and constitutes the common denominator of both radical right-wing and radical left-wing euroscepticism. It adopts a mixed-methods approach, combining intensive case study analysis with quantitative analysis of party manifestos. First, it traces the link between nationalism and euroscepticism in Greece and France in order to demonstrate the internal validity of the argument. It then undertakes a cross-country statistical estimation to assess the external validity of the argument and its generalisability across Europe.
Resumo:
This article reports about the development and validation of a measurement instrument assessing elementary school students' achievement emotions (Achievement Emotions Questionnaire-Elementary School, AEQ-ES). Specifically, the instrument assesses students' enjoyment, anxiety, and boredom pertaining to three types of academic settings (i.e., attending class, doing homework, and taking tests and exams). Scale construction was based on Pekrun's (2006) control-value theory of achievement emotions. The instrument was tested using samples from German and American elementary school classrooms. The results of Study 1 (German sample) corroborate the reliability and structural validity of the new emotion measure. Moreover, they show that students' achievement emotions were linked with their control and value appraisals as well as their academic performance, thus supporting the external validity of the measure as well as propositions of Pekrun's (2006) control-value theory of achievement emotions. Study 2 (American sample) corroborated the cross-cultural equivalence of the measure and the generalizability of findings across the German and American samples. Implications for research on achievement emotions and educational practice are discussed. (PsycINFO Database Record (c) 2013 APA, all rights reserved)(journal abstract)
Resumo:
Introduction Researchers have, for decades, contributed to an increased collective understanding of the physiological demands in cross-country skiing; however, almost all of these studies have used either non-elite subjects and/or performances that emulate cross-country skiing. To establish the physiological demands of cross-country skiing, it is important to relate the investigated physiological variables to the competitive performance of elite skiers. The overall aim of this doctoral thesis was, therefore, to investigate the external validity of physiological test variables to determine the physiological demands in competitive elite cross-country skiing. Methods The subjects in Study I – IV were elite male (I – III) and female (III – IV) cross-country skiers. In all studies, the relationship between test variables (general and ski-specific) and competitive performances (i.e. the results from competitions or the overall ski-ranking points of the International Ski Federation (FIS) for sprint (FISsprint) and distance (FISdist) races) were analysed. Test variables reflecting the subject’s general strength, upper-body and whole-body oxygen uptake, oxygen uptake and work intensity at the lactate threshold, mean upper-body power, lean mass, and maximal double-poling speed were investigated. Results The ability to maintain a high work rate without accumulating lactate is an indicator of distance performance, independent of sex (I, IV). Independent of sex, high oxygen uptake in whole-body and upper-body exercise was important for both sprint (II, IV) and distance (I, IV) performance. The maximal double-poling speed and 60-s double-poling mean power output were indicators of sprint (IV) and distance performance (I), respectively. Lean mass was correlated with distance performance for women (III), whereas correlations were found between lean mass and sprint performance among both male and female skiers (III). Moreover, no correlations between distance performance and test variables were derived from tests of knee-extension peak torque, vertical jumps, or double poling on a ski-ergometer with 20-s and 360-s durations (I), whereas gross efficiency while treadmill roller skiing showed no correlation with either distance or sprint performance in cross-country skiing (IV). Conclusion The results in this thesis show that, depending on discipline and sex, maximal and peak oxygen uptake, work intensity at the lactate threshold, lean mass, double-poling mean power output, and double-poling maximal speed are all externally valid physiological test variables for evaluation of performance capability among elite cross-country skiers; however, to optimally indicate performance capability different test-variable expressions should be used; in general, the absolute expression appears to be a better indicator of competitive sprint performance whereas the influence of body mass should be considered when evaluating competitive distance performance capability of elite cross-country skiers.
Resumo:
Este trabalho tem por finalidade a construção de um instrumento psicológico de medida de comportamentos criativos, o Teste de Aptidão Criativa - TAC. A construção do TAC é justificada de um lado, por sua utilidade; sobretudo para o orientador profissional e psicólogo escolar e de outro; pela inexistência no Brasil; em particular no Nordeste, de instrumentos de medida similar. A formulação do problema e seu contexto são descritos logo no início do trabalho, bem como, a fundamentação teórica. Neste particular, enfatizou-se o enfoque psicológico da criatividade v em suas abordagens personológica e cognitiva. Atenção especial foi dada ao estudo do "modelo da estrutura do intelecto" de J. P. Guilford, suporte teórico do TAC. O teste de Aptidão Criativa é apresentado em todas as etapas de sua construção , desde a forma pré-piloto ç à piloto e à experimental, salientando-se seus respectivos subtestes figural e verbal. Ilustrações dos itens, bem como, das formas do teste são também apresentadas para melhor compreensão do texto. Em prosseguimento descreve- se uma pesquisa empírica realizada com o TAC, cujas hipóteses operacionais visam comprovar sua validade de constrito sua fidedignidade. A população foi constituída de alunos, de ambos os sexos, que cursaram em 1977 a 8a. série do 1º grau, de escolas públicas ou particulares; do município do Recife. O trabalho descreve detalhadamente a determinação da amostra, os instrumentos utilizados bem como os critérios de avaliação dos dados. O tratamento estatístico consistiu em medidas de tendência ceutra e de variabilidade; análise fatorial (técnica dos componentes principais-solução Varimax) para verificação da validade de construto. Os resultados obtidos, pela análise fatorial, demonstraram a presença de três fatores: um denominado C e interpretado como Convergente e dois, mensuráveis pelo TAC (conforme mostram as cargas fatoriais) que foram chamados de F (Fluência e Flexibilidade) e 0 Originalidade. O índice de homogeneidade calculado pelo “ de Cronbach" da característica originalidade entendida como essencial para a criatividade foi de 0,72. Tal resultado permite afirmar que na amostra testada o TAC , neste aspecto, apresenta fidedignidade significativa. Uma análise de regressão múltipla por passos foi levada a efeito, visando a identificação dos itens que mais contribuíram para explicação do escore total. Os resultados permitiram apontar uma melhor forma para o TAC figural constituída pelos itens I , II, V, VI , VII e VIII, os quais comporão a forma definitiva do TAC. Calculou-se também o estatístico Z do "Teste de urna Amostra de Kolmogorou para verificação da normalidade da distribuição das três características que o de medir: Fluência, Flexibilidade e Originalidade. Explicita-se ainda uma crítica à técnica de computação das respostas do TAC em face do índice de flexibilidade e propõe-se a realização de estudos, que permitam a formulação de novos procedimentos para um tratamento mais adequado das respostas aos estímulos do teste. Sugere-se também um estudo teórico sistemático de maior profundidade sobre os pontos de convergência entre as teorias psicológicas que tratam o tema para uma melhor compreensão da natureza e dinâmica do comportamento criativa. Propõe-se ainda a continuidade da pesquisa empírica que deu origem a este trabalho, tendo por objetivo imediato a padronização do TAC, de modo a poder ser ele instrumento útil ao psicólogo, na prática da psicologia aplicada.
Resumo:
Programas de saúde e bem-estar têm sido adotados por empresas como forma de melhorar a saúde de empregados, e muitos estudos descrevem retornos econômicos positivos sobre os investimentos envolvidos. Entretanto, estudos mais recentes com metodologia melhor têm demonstrado retornos menores. O objetivo deste estudo foi investigar se características de programas de saúde e bem-estar agem como preditores de custos de internação hospitalar (em Reais correntes) e da proporção de funcionários que têm licença médica, entre Abril de 2014 e Maio de 2015, em uma amostra não-aleatória de empresas no Brasil, através de parceria com uma empresa gestora de ‘big data’ para saúde. Um questionário sobre características de programas de saúde no ambiente de trabalho foi respondida por seis grandes empresas brasileiras. Dados retirados destes seis questionários (presença e idade de programa de saúde, suas características – inclusão de atividades de screening, educação sobre saúde, ligação com outros programas da empresa, integração do programa à estrutura da empresa, e ambientes de trabalho voltado para a saúde – e a adoção de incentivos financeiros para aderência de funcionários ao programa), bem como dados individuais de idade, gênero e categoria de plano de saúde de cada empregado , foram usados para construir um banco de dados com mais de 76.000 indivíduos. Através de um modelo de regressão múltipla e seleção ‘stepwise’ de variáveis, a idade do empregado foi positivamente associada e a idade do programa de saúde e a categoria ‘premium’ de plano de saúde do funcionário foram negativamente associadas aos custos de internação hospitalar (como esperado). Inesperadamente, a inclusão de programas de screening e iniciativas de educação de saúde nos programas de saúde e bem-estar nas empresas foram identificados como preditores positivos significativos para custos de admissão hospitalar. Para evitar a inclusão errônea de licenças-maternidade, apenas os dados de licença médica de pacientes do sexo masculino foram analisados (dados disponíveis apenas para duas entre as companhias incluídas, com um total de 18.957 pacientes do sexo masculino). Analisando estes dados através de um teste Z para comparação de proporções, a empresa com programa de saúde que inclui atividades voltadas a cessação de hábitos ruins (como tabagismo e etilismo), controle de diabetes e hipertensão, e que adota incentivos financeiros para a aderência de funcionários ao programa tem menor proporção de empregados com licençca médica no período analisado, quando comparada com a outra empresa que não tem estas características (também conforme esperado). Entretanto, a companhia com menor proporção de funcionários com licença médica também foi aquela que adota programa de screening entre as atividades de seu programa de saúde. Potenciais fontes de ameaça à validade interna e externa destes resultados são discutidas, bem como possíveis explicações para a associação entre programas de screening e educação médica a piores indicadores de saúde nesta amostra de companhias são discutidas. Novos estudos com melhor desenho, com amostras maiores e randômicas são necessários para validar estes resultados e possivelmente melhorar a validade interna e externa destes resultados.
Resumo:
Prior models of the policy process have examined how human characteristics can affect policy decision-making in such a way that it leads to aggregate effects on policy outcomes as a whole. I develop a model of the policy process which suggests that emotions related to fair and unfair experiences in the same policy domain are utilized by decision-makers as policy criteria. In the lab, I empirically tested this, and find that emotions and experience related to fairness do influence the policy decision to move away from the status quo alternative. Based upon this result, I simulated the evolution of a society of agents engaged in decision-making using similar criteria. The simulation suggests that incentives have an important role in leading to cooperation and social success. The external validity of the simulation also implies that it can act as a platform for future evolutionary policy experimentation.
Resumo:
Motor symptoms in schizophrenia occur frequently and are relevant to diagnosis and antipsychotic therapy. To date motor symptoms are difficult to assess and their pathobiology is a widely unresolved issue. The Bern Psychopathology Scale for the assessment of system-specific psychotic symptoms (BPS) was designed to identify homogenous patient groups by focusing on three domains: language, affectivity and motor behavior. The present study aimed to validate the motor behavior domain of the BPS using wrist actigraphy. In total, 106 patients were rated with the BPS and underwent 24 h continuous actigraphy recording. The ratings of the global severity of the motor behavior domain (GSM) as well as the quantitative and the subjective items of the motor behavior domain of the BPS were significantly associated with actigraphic variables. In contrast, the qualitative items of the motor domain failed to show an association with actigraphy. Likewise, scores of the language and the affectivity domains were not related to actigraphic measures. In conclusion, we provided substantial external validity for global, quantitative and subjective ratings of the BPS motor behavior domain. Thus, the BPS is suitable to assess the dimension of quantitative motor behavior in the schizophrenia spectrum.
Resumo:
Arts experts are commonly skeptical of applying scientific methods to aesthetic experiencing, which remains a field of study predominantly for the humanities. Laboratory research has however indicated that artworks may elicit emotional and physiological responses. Yet, this line of aesthetics research has previously suffered from insufficient external validity. We therefore conducted a study in which aesthetic perception was monitored in a fine-art museum, unrestricting to the viewers’ freedom of aesthetic choice. Visitors were invited to wear electronic gloves through which their locomotion, heart rate and skin conductance were continuously recorded. Emotional and aesthetic responses to selected works of an exhibition were assessed using a customized questionnaire. In a sample of 373 adult participants, we found that physiological responses during perception of an artwork were significantly related to aesthetic-emotional experiencing. The dimensions ‘Aesthetic Quality’, ‘Surprise/Humor’, ‘Dominance’ and ‘Curatorial Quality’ were associated with cardiac measures (heart rate variability, heart rate level) and skin conductance variability. This is first evidence that aesthetics can be statistically grounded in viewers’ physiology in an ecologically valid environment, the art gallery, enhancing our understanding of the effects of artworks and their curatorial staging.
Resumo:
In the discussion about the rationale for spine registries, two basic questions have to be answered. The first one deals with the value of orthopaedic registries per se, considering them as observational studies and comparing the evidence they generate with that of randomised controlled trials. The second question asks if the need for registries in spine surgery is similar to that in the arthroplasty sector. The widely held view that randomised controlled trials are the 'gold standard' for evaluation and that observational methods have little or no value ignores the limitations of randomised trials. They may prove unnecessary, inappropriate, impossible, or inadequate. In addition, the external validity and hence the ability to make generalisations about the results of randomised trials is often low. Therefore, the false conflict between those who advocate randomised trials in all situations and those who believe observational data provide sufficient evidence needs to be replaced with mutual recognition of their complementary roles. The fact that many surgical techniques or technologies were introduced into the field of spine surgery without randomised trials or prospective cohort comparisons makes obvious an even increased need for spine registries compared to joint arthroplasty. An essential methodological prerequisite for a registry is a common terminology for reporting results and a sophisticated technology that networks all participants so that one central data pool is created and accessed. Recognising this need, the Spine Society of Europe has researched and developed Spine Tango, the first European spine registry, which can be accessed under www.eurospine.org.
Resumo:
A literature review of the most widely used condition specific, self administered assessment questionnaires for low back pain had been undertaken. General and historic aspects, reliability, responsiveness and minimum clinically important difference, external validity, floor and ceiling effects, and available languages were analysed. These criteria, however, are only part of the consideration. Of similar importance are the content, wording of questions and answers in each of the six questionnaires and an analysis of the different score results. The issue of score bias is discussed and suggestions are given in order to increase the construct validity in the practical use of the individual questionnaires.
Resumo:
Introduction: The Health Technology Assessment report on effectiveness, cost-effectiveness and appropriateness of homeopathy was compiled on behalf of the Swiss Federal Office for Public Health (BAG) within the framework of the 'Program of Evaluation of Complementary Medicine (PEK)'. Materials and Methods: Databases accessible by Internet were systematically searched, complemented by manual search and contacts with experts, and evaluated according to internal and external validity criteria. Results: Many high-quality investigations of pre-clinical basic research proved homeopathic high-potencies inducing regulative and specific changes in cells or living organisms. 20 of 22 systematic reviews detected at least a trend in favor of homeopathy. In our estimation 5 studies yielded results indicating clear evidence for homeopathic therapy. The evaluation of 29 studies in the domain 'Upper Respiratory Tract Infections/Allergic Reactions' showed a positive overall result in favor of homeopathy. 6 out of 7 controlled studies were at least equivalent to conventional medical interventions. 8 out of 16 placebocontrolled studies were significant in favor of homeopathy. Swiss regulations grant a high degree of safety due to product and training requirements for homeopathic physicians. Applied properly, classical homeopathy has few side-effects and the use of high-potencies is free of toxic effects. A general health-economic statement about homeopathy cannot be made from the available data. Conclusion: Taking internal and external validity criteria into account, effectiveness of homeopathy can be supported by clinical evidence and professional and adequate application be regarded as safe. Reliable statements of cost-effectiveness are not available at the moment. External and model validity will have to be taken more strongly into consideration in future studies.
Resumo:
Objective: A summary of main aspects from a Health Technology Assessment report on Traditional Chinese Medicine (TCM) in Switzerland concerning effectiveness and safety is given. Materials and Methods: Literature search was performed through 13 databases, by scanning reference lists of articles and by contacting experts. Assessed were quality of documentation, internal and external validity. Results: Effectiveness: 43 articles concerning 'gastrointestinal tract and liver' were assessed. The studies covering 7,436 patients were undertaken in China (35), Japan (3), USA (2) and Australia (3); 33/43 being controlled studies. 34/40 show significantly better results in the TCM-treated group. A comparison of studies on results of treatment based on a diagnosis according to TCM criteria and studies on results of treatment according to Western diagnosis shows that treatment based on TCM diagnosis improves the result. The comparison of treatment by individual medication and standard medication showed a trend in favor of individual medication. Safety: TCM training and practice for physicians in Switzerland are officially regulated. Side effects occur, but no severe effects have been registered up to now in Switzerland. TCM medicinals are imported; admission regulations are being installed. Problems due to production abroad, Internet trade, self-medication or admixtures are possible. Conclusion: The evaluation of the literature search provides evidence for a basic clinical effectiveness of TCM therapy. Severe side effects were not observed in Switzerland. Regulations for trading and use of medicinals prevent treatment risks. Further clinical studies in a Western context are required.
Resumo:
In randomized controlled trials with high internal validity, pharmacotherapy using acamprosate, naltrexone, and, to a somewhat lesser extent, disulfiram has proved effective in preventing relapse in patients with alcohol use disorders (AUD). There remains, however, a paucity of studies with sufficient external validity in which the effectiveness of pharmacotherapy in clinical practice is investigated. This study aimed to make a contribution to close this gap in research.
Methods and representativeness of a European survey in children and adolescents: the KIDSCREEN study
Resumo:
BACKGROUND: The objective of the present study was to compare three different sampling and questionnaire administration methods used in the international KIDSCREEN study in terms of participation, response rates, and external validity. METHODS: Children and adolescents aged 8-18 years were surveyed in 13 European countries using either telephone sampling and mail administration, random sampling of school listings followed by classroom or mail administration, or multistage random sampling of communities and households with self-administration of the survey materials at home. Cooperation, completion, and response rates were compared across countries and survey methods. Data on non-respondents was collected in 8 countries. The population fraction (PF, respondents in each sex-age, or educational level category, divided by the population in the same category from Eurostat census data) and population fraction ratio (PFR, ratio of PF) and their corresponding 95% confidence intervals were used to analyze differences by country between the KIDSCREEN samples and a reference Eurostat population. RESULTS: Response rates by country ranged from 18.9% to 91.2%. Response rates were highest in the school-based surveys (69.0%-91.2%). Sample proportions by age and gender were similar to the reference Eurostat population in most countries, although boys and adolescents were slightly underrepresented (PFR <1). Parents in lower educational categories were less likely to participate (PFR <1 in 5 countries). Parents in higher educational categories were overrepresented when the school and household sampling strategies were used (PFR = 1.78-2.97). CONCLUSION: School-based sampling achieved the highest overall response rates but also produced slightly more biased samples than the other methods. The results suggest that the samples were sufficiently representative to provide reference population values for the KIDSCREEN instrument.