63 resultados para Test reliability


Relevância:

30.00% 30.00%

Publicador:

Resumo:

Swiftlets are small insectivorous birds, many of which nest in caves and are known to echolocate. Due to a lack of distinguishing morphological characters, the taxonomy of swiftlets is primarily based on the presence or absence of echolocating ability, together with nest characters. To test the reliability of these behavioral characters, we constructed an independent phylogeny using cytochrome b mitochondrial DNA sequences from swiftlets and their relatives. This phylogeny is broadly consistent with the higher classification of swifts but does not support the monophyly of swiftlets. Echolocating swiftlets (Aerodramus) and the nonecholocating "giant swiftlet" (Hydrochous gigas) group together, but the remaining nonecholocating swiftlets belonging to Collocalia are not sister taxa to these swiftlets. While echolocation may be a synapomorphy of Aerodramus (perhaps secondarily lost in Hydrochous), no character of Aerodramus nests showed a statistically significant fit to the molecular phylogeny, indicating that nest characters are not phylogenetically reliable in this group.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Interobserver reliability for the classification of proximal humeral fractures is limited. The aim of this study was to test the null hypothesis that interobserver reliability of the AO classification of proximal humeral fractures, the preferred treatment, and fracture characteristics is the same for two-dimensional (2-D) and three-dimensional (3-D) computed tomography (CT). Members of the Science of Variation Group--fully trained practicing orthopaedic and trauma surgeons from around the world--were randomized to evaluate radiographs and either 2-D CT or 3-D CT images of fifteen proximal humeral fractures via a web-based survey and respond to the following four questions: (1) Is the greater tuberosity displaced? (2) Is the humeral head split? (3) Is the arterial supply compromised? (4) Is the glenohumeral joint dislocated? They also classified the fracture according to the AO system and indicated their preferred treatment of the fracture (operative or nonoperative). Agreement among observers was assessed with use of the multirater kappa (κ) measure. Interobserver reliability of the AO classification, fracture characteristics, and preferred treatment generally ranged from "slight" to "fair." A few small but statistically significant differences were found. Observers randomized to the 2-D CT group had slightly but significantly better agreement on displacement of the greater tuberosity (κ = 0.35 compared with 0.30, p < 0.001) and on the AO classification (κ = 0.18 compared with 0.17, p = 0.018). A subgroup analysis of the AO classification results revealed that shoulder and elbow surgeons, orthopaedic trauma surgeons, and surgeons in the United States had slightly greater reliability on 2-D CT, whereas surgeons in practice for ten years or less and surgeons from other subspecialties had slightly greater reliability on 3-D CT. Proximal humeral fracture classifications may be helpful conceptually, but they have poor interobserver reliability even when 3-D rather than 2-D CT is utilized. This may contribute to the similarly poor interobserver reliability that was observed for selection of the treatment for proximal humeral fractures. The lack of a reliable classification confounds efforts to compare the outcomes of treatment methods among different clinical trials and reports.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The primary aim of this study was to develop and validate a golf-specific approach-iron test for use with elite and high-level amateur golfers. Elite (n=26) and high-level amateur (n=23) golfers were recruited for this study. The ‘Approach-Iron Skill Test’ requires players to hit a total of 27 shots. Specifically, three shots are hit at each of nine targets on a specially constructed driving range in a randomised order. A real-time launch monitor positioned behind the player, measured the carry distance for each of these shots. A scoring system was developed based on the percentage error index of each shot, meaning that 81 points was the maximum score possible (with a maximum of three points per shot). Two rounds of the test were performed. For both rounds of the test, elite-level golfers scored significantly higher than their high-level amateur counterparts (56.3±5.6 and 58.5±4.6 points versus 46.0±6.3 and 46.1±6.7 points, respectively) (P<0.05). For both elite and high-level players, 95% limits of agreement statistics also indicated that the test showed good test–retest reliability (2.1±7.9 and 0.2±10.8, respectively). Due to the clinimetric properties of the test, we conclude that the Approach-Iron Skill Test is suitable for further examination with the players examined in this study.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Despite a recent increase in the amount of research investigating performance in golf, a comprehensive putting skill test has not been reported in the peer-reviewed literature. In this study, the Golf Australia Putting Test (GAPT) was developed and a series of measurement properties were assessed. Elite (n = 18) and high-level amateur (HLA; n = 22) participants completed six single putts from various areas on six concentric circles (circle radii = 0.9, 1.5, 3.0, 4.6, 6.1 and 7.6 m). Using a scoring system that rewarded participants for holing putts from longer distances, the maximum score from a single round of the test (i.e. 36 putts) was 27 points. After two rounds of the test were completed by all players, a subsample of participants (elite, n = 15; HLA, n = 7) had their putting performance recorded during tournament play for a period of 90 days to assess criterion (predictive) validity of the test. The reliability, sensitivity and discriminative validity of the GAPT were also assessed. Better agreement between Rounds 1 and 2 scores was noted in the elite group, whilst reliability values were similar for both groups. Further, the GAPT scores were shown to predict players from the elite and high-ability groups with a low classification error. An equation for predicting on-course performance from GAPT scores was also developed. Findings from this study indicate that the GAPT is a valid and reliable tool for high-level players and the GAPT may be used for player evaluation in the field.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ).

Relevância:

30.00% 30.00%

Publicador:

Resumo:

To develop a valid and reliable video-based decision-making test to examine and monitor the decision-making performance of Australian football umpires.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Many difficulties exist in directly following the static recrystallization of metals, particularly during hotworking. Indirect measurement of static recrystallization has been extensively performed in the literature where, for example, the recrystallization behavior of austenite in steels has commonly been measured indirectly using the fractional softening method. This method relies on the yield stress changes during recrystallization which are physically simulated by hot torsion or compression tests. However, the inherent heterogeneity of deformation during a mechanical test leads to a non-uniform static recrystallization distribution in the test sample. This, in turn, poses a serious question concerning the reliability of the measurement since the stress calculation techniques during recrystallization are not adequately developed in the existing literature. This paper develops a computer-based method to account for heterogeneous deformation during fractional softening measurements based on the hot torsion test data. The importance of the fractional softening gradient in determining the kinetics is emphasized and deficiencies in our understanding of the basic mechanisms are highlighted. A computer-based method is introduced to generate the experimental and computational components in a cost function. The cost function is then utilized by an inverse solution to calibrate the design parameters in a static recrystallization model.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

The detection of avian viruses in wild populations has considerable conservation implications. For DNA-based studies, feathers may be a convenient sample type for virus screening and are, therefore, an increasingly common technique. This is despite recent concerns about DNA quality, ethics, and a paucity of data comparing the reliability and sensitivity of feather sampling to other common sample types such as blood. Alternatively, skeletal muscle tissue may offer a convenient sample to collect from dead birds, which may reveal viraemia. Here, we describe a probe-based quantitative real-time PCR for the relative quantification of beak and feather disease virus (BFDV), a pathogen of serious conservation concern for parrots globally. We used this method to test for BFDV in wild crimson rosellas (Platycercus elegans), and compared three different sample types. We detected BFDV in samples from 29 out of 84 individuals (34.5%). However, feather samples provided discordant results concerning virus presence when compared with muscle tissue and blood, and estimates of viral load varied somewhat between different sample types. This study provides evidence for widespread infection of BFDV in wild crimson rosellas, but highlights the importance of sample type when generating and interpreting qualitative and quantitative avian virus data.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Purpose: Assessing health-related quality of life (HRQoL) via Computerized Adaptive Tests (CAT) provides greater measurement precision coupled with a lower test burden compared to conventional tests. Currently, there are no European pediatric HRQoL CATs available. This manuscript aims at describing the development of a HRQoL CAT for children and adolescents: the Kids-CAT, which was developed based on the established KIDSCREEN-27 HRQoL domain structure. Methods: The Kids-CAT was developed combining classical test theory and item response theory methods and using large archival data of European KIDSCREEN norm studies (n = 10,577–19,580). Methods were applied in line with the US PROMIS project. Item bank development included the investigation of unidimensionality, local independence, exploration of Differential Item Functioning (DIF), evaluation of Item Response Curves (IRCs), estimation and norming of item parameters as well as first CAT simulations. Results: The Kids-CAT was successfully built covering five item banks (with 26–46 items each) to measure physical well-being, psychological well-being, parent relations, social support and peers, and school well-being. The Kids-CAT item banks proved excellent psychometric properties: high content validity, unidimensionality, local independence, low DIF, and model conform IRCs. In CAT simulations, seven items were needed to achieve a measurement precision between.8 and.9 (reliability). It has a child-friendly design, is easy accessible online and gives immediate feedback reports of scores. Conclusions: The Kids-CAT has the potential to advance pediatric HRQoL measurement by making it less burdensome and enhancing the patient–doctor communication.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Abstract
Purpose
Assessing health-related quality of life (HRQoL) via Computerized Adaptive Tests (CAT) provides greater measurement precision coupled with a lower test burden compared to conventional tests. Currently, there are no European pediatric HRQoL CATs available. This manuscript aims at describing the development of a HRQoL CAT for children and adolescents: the Kids-CAT, which was developed based on the established KIDSCREEN-27 HRQoL domain structure.
Methods
The Kids-CAT was developed combining classical test theory and item response theory methods and using large archival data of European KIDSCREEN norm studies (n=10,577–19,580). Methods were applied in line with the US PROMIS project. Item bank development included the investigation of unidimensionality, local independence, exploration of Differential Item Functioning (DIF), evaluation of Item Response Curves (IRCs), estimation and norming of item parameters as well as first CAT simulations.
Results
The Kids-CAT was successfully built covering five item banks (with 26–46 items each) to measure physical well-being, psychological well-being, parent relations, social support and peers, and school well-being. The Kids-CAT item banks proved excellent psychometric properties: high content validity, unidimensionality, local independence, low DIF, and model conform IRCs. In CAT simulations, seven items were needed to achieve a measurement precision between .8 and .9 (reliability). It has a child-friendly design, is easy accessible online and gives immediate feedback reports of scores.
Conclusions
The Kids-CAT has the potential to advance pediatric HRQoL measurement by making it less burdensome and enhancing the patient–doctor communication.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Purpose: To evaluate the reliability, agreement and smallest detectable change in a measurement instrument for pain and function in knee osteoarthritis; the Dynamic weight-bearing Assessment of Pain (DAP). Methods: The sample size was set to 20 persons, recruited from the outpatient osteoarthritis clinic at Frederiksberg Hospital, Copenhagen. Two physiotherapists tested all participants during two visits; at the first visit, one single DAP (including four scores) was conducted by rater one; at the second visit, DAP was conducted by both raters one and two in randomized order with concealed allocation. The time interval was approximately 1.5 h. Measurement error was estimated by standard error of measurement (SEM). The intra- and inter-rater reliability was estimated by Intra-class Correlation Coefficients for agreement based on a two-way ANOVA with random effects (single measures ICC 2.1). Smallest detectable change (SDC) and limits of agreement were calculated.Results: The pain score showed excellent reliability in terms of ICC (intra-rater 0.93, CI 0.83–0.97, inter-rater 0.91, CI 0.78–0.96), low SEM (intra-rater 0.70, inter-rater 0.86, on a scale from 0 to 10), and acceptable SDC for intra-rater test (1.95). The three knee bend scores all had ICC above 0.50, showing fair-to-good reliability. None of the knee bend scores showed acceptable SEM and SDC. Conclusions: The reproducibility of the DAP pain score meets the demands for use in clinical practice and research. The total knee bend could be useful for motivational purpose in clinical use. Testing of other psychometric properties of the DAP is pending.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Background: It has been suggested that young people should develop competence in a variety of ‘lifelong physical activities’ to ensure that they can be active across the lifespan. Objective: The primary aim of this systematic review is to report the methodological properties, validity, reliability, and test duration of field-based measures that assess movement skill competency in lifelong physical activities. A secondary aim was to clearly define those characteristics unique to lifelong physical activities. Data Sources: A search of four electronic databases (Scopus, SPORTDiscus, ProQuest, and PubMed) was conducted between June 2014 and April 2015 with no date restrictions. Study Selection: Studies addressing the validity and/or reliability of lifelong physical activity tests were reviewed. Included articles were required to assess lifelong physical activities using process-oriented measures, as well as report either one type of validity or reliability. Study Appraisal and Synthesis Methods: Assessment criteria for methodological quality were adapted from a checklist used in a previous review of sport skill outcome assessments. Results: Movement skill assessments for eight different lifelong physical activities (badminton, cycling, dance, golf, racquetball, resistance training, swimming, and tennis) in 17 studies were identified for inclusion. Methodological quality, validity, reliability, and test duration (time to assess a single participant), for each article were assessed. Moderate to excellent reliability results were found in 16 of 17 studies, with 71 % reporting inter-rater reliability and 41 % reporting intra-rater reliability. Only four studies in this review reported test–retest reliability. Ten studies reported validity results; content validity was cited in 41 % of these studies. Construct validity was reported in 24 % of studies, while criterion validity was only reported in 12 % of studies. Limitations: Numerous assessments for lifelong physical activities may exist, yet only assessments for eight lifelong physical activities were included in this review. Generalizability of results may be more applicable if more heterogeneous samples are used in future research. Conclusion: Moderate to excellent levels of inter- and intra-rater reliability were reported in the majority of studies. However, future work should look to establish test–retest reliability. Validity was less commonly reported than reliability, and further types of validity other than content validity need to be established in future research. Specifically, predictive validity of ‘lifelong physical activity’ movement skill competency is needed to support the assertion that such activities provide the foundation for a lifetime of activity.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

We adapted/developed and examined the test–retest reliability and internal consistency of eight parent-report measures of home and neighborhood environmental correlates of physical activity appropriate for Chinese preschool-aged children and their parents/primary caregivers living in densely populated urban environments. This study consisted of a qualitative (cognitive interviews) and a quantitative (test–retest reliability) component. Chinese versions of the measures were pilot-tested on 20 parents of Hong Kong preschool-aged children using cognitive interviews. Measures were then administered to 61 parents twice, 1 week apart. Test–retest reliability and internal consistency were computed. Except for two items, the test–retest reliability of items and scale summary scores ranged from moderate to excellent. The internal consistency of the measures exceeded recommended minimal values (Cronbach’s α >.70). The parent-report measures examined in this study are potentially appropriate for use in investigations of environmental correlates of the physical activity of Chinese preschool-aged children living in densely populated urban environments. However, their predictive validity with respect to Chinese preschool-aged children’s physical activity needs to be assessed in future studies.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Mental health triage scales are clinical tools used at point of entry to specialist mental health service to provide a systematic way of categorizing the urgency of clinical presentations, and determining an appropriate service response and an optimal timeframe for intervention. The aim of the present study was to test the interrater reliability of a mental health triage scale developed for use in UK mental health triage and crisis services. An interrater reliability study was undertaken. Triage clinicians from England and Wales (n = 66) used the UK Mental Health Triage Scale (UK MHTS) to rate the urgency of 21 validated mental health triage scenarios derived from real occasions of triage. Interrater reliability was calculated using Kendall's coefficient of concordance (w) and intraclass correlation coefficient (ICC) statistics. The average ICC was 0.997 (95% confidence interval (CI): 0.996-0.999 (F (20, 1300) = 394.762, P < 0.001). The single measure ICC was 0.856 (95% CI: 0.776-0.926 (F (20, 1300) = 394.762, P < 0.001). The overall Kendall's w was 0.88 (P < 0.001). The UK MHTS shows substantial levels of interrater reliability. Reliable mental health triage scales employed within effective mental health triage systems offer possibilities for not only improved patient outcomes and experiences, but also for efficient use of finite specialist mental health services.

Relevância:

30.00% 30.00%

Publicador:

Resumo:

Objective: This paper describes the process employed to adapt the Problem Gambling Severity Index (PGSI) for use with Indigenous Australian populations.Methods: This study comprised a two-stage process: an initial consultation with Indigenous health workers, informing the textual and conceptual adaptation of items, followed by trial of the adjusted instrument with Indigenous community members (n=301).Results: Internal reliability was demonstrated: Australian Indigenous Problem Gambling Index (AIPGI) Cronbach's alpha α = 0.92 (Original PGSI, α = 0.84). Item-rest correlations confirmed that responses to items were consistent and related to the total score of remaining items. The AIPGI could predict gambling severity based on gambling frequency, when controlling for age and gender (OR=1.28, 95%CI 1.17–1.40).Conclusions: The adapted instrument is accessible to a cross-section of Indigenous Australians and has demonstrated properties of reliability and validity. An extended trial is needed to test the application of the instrument to a broader Indigenous audience and to further explore and confirm psychometric properties of the adapted instrument.Implications: This study introduces a culturally adapted tool for measuring rates of disordered gambling among Indigenous Australians.