99 resultados para Intraclass Correlation Coefficient

em Université de Lausanne, Switzerland


Relevância:

100.00% 100.00%

Publicador:

Resumo:

RATIONALE AND OBJECTIVES: Dose reduction may compromise patients because of a decrease of image quality. Therefore, the amount of dose savings in new dose-reduction techniques needs to be thoroughly assessed. To avoid repeated studies in one patient, chest computed tomography (CT) scans with different dose levels were performed in corpses comparing model-based iterative reconstruction (MBIR) as a tool to enhance image quality with current standard full-dose imaging. MATERIALS AND METHODS: Twenty-five human cadavers were scanned (CT HD750) after contrast medium injection at different, decreasing dose levels D0-D5 and respectively reconstructed with MBIR. The data at full-dose level, D0, have been additionally reconstructed with standard adaptive statistical iterative reconstruction (ASIR), which represented the full-dose baseline reference (FDBR). Two radiologists independently compared image quality (IQ) in 3-mm multiplanar reformations for soft-tissue evaluation of D0-D5 to FDBR (-2, diagnostically inferior; -1, inferior; 0, equal; +1, superior; and +2, diagnostically superior). For statistical analysis, the intraclass correlation coefficient (ICC) and the Wilcoxon test were used. RESULTS: Mean CT dose index values (mGy) were as follows: D0/FDBR = 10.1 ± 1.7, D1 = 6.2 ± 2.8, D2 = 5.7 ± 2.7, D3 = 3.5 ± 1.9, D4 = 1.8 ± 1.0, and D5 = 0.9 ± 0.5. Mean IQ ratings were as follows: D0 = +1.8 ± 0.2, D1 = +1.5 ± 0.3, D2 = +1.1 ± 0.3, D3 = +0.7 ± 0.5, D4 = +0.1 ± 0.5, and D5 = -1.2 ± 0.5. All values demonstrated a significant difference to baseline (P < .05), except mean IQ for D4 (P = .61). ICC was 0.91. CONCLUSIONS: Compared to ASIR, MBIR allowed for a significant dose reduction of 82% without impairment of IQ. This resulted in a calculated mean effective dose below 1 mSv.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVE: To determine the psychometric properties of an adapted version of the Falls Efficacy Scale (FES) in older rehabilitation patients. DESIGN: Cross-sectional survey. SETTING: Postacute rehabilitation facility in Switzerland. PARTICIPANTS: Seventy elderly persons aged 65 years and older receiving postacute, inpatient rehabilitation. INTERVENTIONS: Not applicable. MAIN OUTCOME MEASURES: FES questions asked about subject's confidence (range, 0 [none]-10 [full]) in performing 12 activities of daily living (ADLs) without falling. Construct validity was assessed using correlation with measures of physical (basic ADLs [BADLs]), cognitive (Mini-Mental State Examination [MMSE]), affective (15-item Geriatric Depression Scale [GDS]), and mobility (Performance Oriented Mobility Assessment [POMA]) performance. Predictive validity was assessed using the length of rehabilitation stay as the outcome. To determine test-retest reliability, FES administration was repeated in a random subsample (n=20) within 72 hours. RESULTS: FES scores ranged from 10 to 120 (mean, 88.7+/-26.5). Internal consistency was optimal (Cronbach alpha=.90), and item-to-total correlations were all significant, ranging from .56 (toilet use) to .82 (reaching into closets). Test-retest reliability was high (intraclass correlation coefficient, .97; 95% confidence interval, .95-.99; P<.001). Subjects reporting a fall in the previous year had lower FES scores than nonfallers (85.0+/-25.2 vs 94.4+/-27.9, P=.054). The FES correlated with POMA (Spearman rho=.40, P<.001), MMSE (rho=.37, P=.001), BADL (rho=.43, P<.001), and GDS (rho=-.53, P<.001) scores. These relationships remained significant in multivariable analysis for BADLs and GDS, confirming FES construct validity. There was a significant inverse relationship between FES score and the length of rehabilitation stay, independent of sociodemographic, functional, cognitive, and fall status. CONCLUSIONS: This adapted FES is reliable and valid in older patients undergoing postacute rehabilitation. The independent association between poor falls efficacy and increased length of stay has not been previously described and needs further investigations.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Anterior shoulder stabilization surgery with the arthroscopic Bankart procedure can have a high recurrence rate in certain patients. Identifying these patients to modify outcomes has become a focal point of research. PURPOSE: The Instability Shoulder Index Score (ISIS) was developed to predict the success of arthroscopic Bankart repair. Scores range from 0 to 10, with higher scores predicting a higher risk of recurrence after stabilization. The interobserver reliability of the score is not known. STUDY DESIGN: Cohort study (diagnosis); Level of evidence, 2. METHODS: This is a prospective multicenter (North America and Europe) study of patients suffering from shoulder instability and waiting for stabilization surgery. Five pairs of independent evaluators were asked to score patient instability severity with the ISIS. Patients also completed functional scores (Western Ontario Shoulder Instability Index [WOSI], Disabilities of the Arm, Shoulder and Hand-short version [QuickDASH], and Walch-Duplay test). Data on age, sex, number of dislocations, and type of surgery were collected. The test-retest method and intraclass correlation coefficient (ICC: >0.75 = good, >0.85 = very good, and >0.9 = excellent) were used for analysis. RESULTS: A total of 114 patients with anterior shoulder instability were included, of whom 89 (78%) were men. The mean age was 28 years. The ISIS was very reliable, with an ICC of 0.933. The mean number of dislocations per patient was higher in patients who had an ISIS of ≥6 (25 vs 14; P = .05). Patients who underwent more complex arthroscopic procedures such as Hill-Sachs remplissage or open Latarjet had higher preoperative ISIS outcomes, with a mean score of 4.8 versus 3.4, respectively (P = .002). There was no correlation between the ISIS and the quality-of-life questionnaires, with Pearson correlations all >0.05 (WOSI = 0.39; QuickDASH = 0.97; Walch-Duplay = 0.08). CONCLUSION: Our results show that the ISIS is reliable when used in a multicenter study with anterior traumatic instability populations. There was no correlation between the ISIS and the quality-of-life questionnaires, but surgical decisions reflected its increased use.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Straylight gives the appearance of a veil of light thrown over a person's retinal image when there is a strong light source present. We examined the reproducibility of the measurements by C-Quant, and assessed its correlation to characteristics of the eye and subjects' age. PARTICIPANTS AND METHODS: Five repeated straylight measurements were taken using the dominant eye of 45 healthy subjects (age 21-59) with a BCVA of 20/20: 14 emmetropic, 16 myopic, eight hyperopic and seven with astigmatism. We assessed the extent of reproducibility of straylight measures using the intraclass correlation coefficient. RESULTS: The mean straylight value of all measurements was 1.01 (SD 0.23, median 0.97, interquartile range 0.85-1.1). Per 10 years of age, straylight increased in average by 0.10 (95%CI 0.04 to 0.16, p < 0.01]. We found no independent association of refraction (range -5.25 dpt to +2 dpt) on straylight values (0.001; 95%CI -0.022 to 0.024, p = 0.92). Compared to emmetropic subjects, myopia reduced straylight (-.011; -0.024 to 0.02, p = 0.11), whereas higher straylight values (0.09; -0.01 to 0.20, p = 0.09) were observed in subjects with blue irises as compared to dark-colored irises when correcting for age. The intraclass correlation coefficient (ICC) of repeated measurements was 0.83 (95%CI 0.76 to 0.90). CONCLUSIONS: Our study showed that straylight measurements with the C-Quant had a high reproducibility, i.e. a lack of large intra-observer variability, making it appropriate to be applied in long-term follow-up studies assessing the long-term effect of surgical procedures on the quality of vision.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

The clinical demand for a device to monitor Blood Pressure (BP) in ambulatory scenarios with minimal use of inflation cuffs is increasing. Based on the so-called Pulse Wave Velocity (PWV) principle, this paper introduces and evaluates a novel concept of BP monitor that can be fully integrated within a chest sensor. After a preliminary calibration, the sensor provides non-occlusive beat-by-beat estimations of Mean Arterial Pressure (MAP) by measuring the Pulse Transit Time (PTT) of arterial pressure pulses travelling from the ascending aorta towards the subcutaneous vasculature of the chest. In a cohort of 15 healthy male subjects, a total of 462 simultaneous readings consisting of reference MAP and chest PTT were acquired. Each subject was recorded at three different days: D, D+3 and D+14. Overall, the implemented protocol induced MAP values to range from 80 ± 6 mmHg in baseline, to 107 ± 9 mmHg during isometric handgrip maneuvers. Agreement between reference and chest-sensor MAP values was tested by using intraclass correlation coefficient (ICC = 0.78) and Bland-Altman analysis (mean error = 0.7 mmHg, standard deviation = 5.1 mmHg). The cumulative percentage of MAP values provided by the chest sensor falling within a range of ±5 mmHg compared to reference MAP readings was of 70%, within ±10 mmHg was of 91%, and within ±15mmHg was of 98%. These results point at the fact that the chest sensor complies with the British Hypertension Society (BHS) requirements of Grade A BP monitors, when applied to MAP readings. Grade A performance was maintained even two weeks after having performed the initial subject-dependent calibration. In conclusion, this paper introduces a sensor and a calibration strategy to perform MAP measurements at the chest. The encouraging performance of the presented technique paves the way towards an ambulatory-compliant, continuous and non-occlusive BP monitoring system.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

PURPOSE: To conduct a cross-cultural adaptation of the Core Outcome Measures Index (COMI) into French according to established guidelines. METHODS: Seventy outpatients with chronic low back pain were recruited from six spine centres in Switzerland and France. They completed the newly translated COMI, and the Roland Morris disability (RMQ), Dallas Pain (DPQ), adjectival pain rating scale, WHO Quality of Life, and EuroQoL-5D questionnaires. After ~14 days RMQ and COMI were completed again to assess reproducibility; a transition question (7-point Likert scale; "very much worse" through "no change" to "very much better") indicated any change in status since the first questionnaire. RESULTS: COMI whole scores displayed no floor effects and just 1.5% ceiling effects. The scores for the individual COMI items correlated with their corresponding full-length reference questionnaire with varying strengths of correlation (0.33-0.84, P < 0.05). COMI whole scores showed a very good correlation with the "multidimensional" DPQ global score (Rho = 0.71). 55 patients (79%) returned a second questionnaire with no/minimal change in their back status. The reproducibility of individual COMI 5-point items was good, with test-retest differences within one grade ranging from 89% for 'social/work disability' to 98% for 'symptom-specific well-being'. The intraclass correlation coefficient for the COMI whole score was 0.85 (95% CI 0.76-0.91). CONCLUSIONS: In conclusion, the French version of this short, multidimensional questionnaire showed good psychometric properties, comparable to those reported for German and Spanish versions. The French COMI represents a valuable tool for future multicentre clinical studies and surgical registries (e.g. SSE Spine Tango) in French-speaking countries.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

OBJECTIVE: Surface magnetic resonance imaging (MRI) for aortic plaque assessment is limited by the trade-off between penetration depth and signal-to-noise ratio (SNR). For imaging the deep seated aorta, a combined surface and transesophageal MRI (TEMRI) technique was developed 1) to determine the individual contribution of TEMRI and surface coils to the combined signal, 2) to measure the signal improvement of a combined surface and TEMRI over surface MRI, and 3) to assess for reproducibility of plaque dimension analysis. METHODS AND RESULTS: In 24 patients six black blood proton-density/T2-weighted fast-spin echo images were obtained using three surface and one TEMRI coil for SNR measurements. Reproducibility of plaque dimensions (combined surface and TEMRI) was measured in 10 patients. TEMRI contributed 68% of the signal in the aortic arch and descending aorta, whereas the overall signal gain using the combined technique was up to 225%. Plaque volume measurements had an intraclass correlation coefficient of as high as 0.97. CONCLUSION: Plaque volume measurements for the quantification of aortic plaque size are highly reproducible for combined surface and TEMRI. The TEMRI coil contributes considerably to the aortic MR signal. The combined surface and TEMRI approach improves aortic signal significantly as compared to surface coils alone. CONDENSED ABSTRACT: Conventional MRI aortic plaque visualization is limited by the penetration depth of MRI surface coils and may lead to suboptimal image quality with insufficient reproducibility. By combining a transesophageal MRI (TEMRI) with surface MRI coils we enhanced local and overall image SNR for improved image quality and reproducibility.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

INTRODUCTION. The assessment of pain in critically ill brain-injured patients is challenging for health professionals. In addition to be unable to self-report, the confused and stereotyped behaviors of these patients are likely to alter their ''normal'' pain responses. Therefore, the pain indicators observed in the general critically ill population may not be appropriate. OBJECTIVES. To identify behavioral and physiological indicators used by clinicians to assess pain in critically ill brain-injured patients who are unable to self-report. METHODS.Amixed-method design was used with the first step being the combination of the results of an integrative literature review with the results of nominal groups of 12 nurses and four physicians. The second step involved a web-based survey to establish content validity. Fourteen experts (clinicians and academics) from three French speaking European countries rated the relevance of each indicator. A content validity index (CVI) was computed for each indicator (I-CVI) and for each category (S-CVI). RESULTS. The first step generated 52 indicators. These indicators were classified into six categories: facial expressions, position/movement, muscle tension, vocalization, compliance with ventilator, and physiological indicators. In the second step, the agreement between raters was high with an Intraclass Correlation Coefficient of 0.88 (95% CI 0.83-0.92). The I-CVIs ranged from 0.07 to 1. Indicators with an I-CVI below 0.5 (n = 12) were not retained, resulting in a final list of 30 indicators. The CVI for this final list was 0.75 with categories ranging from 0.67 (compliance with ventilation) to 0.87 (vocalization). CONCLUSIONS. This process identified specific pain indicators for critically ill braininjured patients. Further evaluation is in progress to test the validity and relevance of these indicators in the clinical setting.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Hyperoxaluria is a major risk factor for kidney stone formation. Although urinary oxalate measurement is part of all basic stone risk assessment, there is no standardized method for this measurement. METHODS: Urine samples from 24-h urine collection covering a broad range of oxalate concentrations were aliquoted and sent, in duplicates, to six blinded international laboratories for oxalate, sodium and creatinine measurement. In a second set of experiments, ten pairs of native urine and urine spiked with 10 mg/L of oxalate were sent for oxalate measurement. Three laboratories used a commercially available oxalate oxidase kit, two laboratories used a high-performance liquid chromatography (HPLC)-based method and one laboratory used both methods. RESULTS: Intra-laboratory reliability for oxalate measurement expressed as intraclass correlation coefficient (ICC) varied between 0.808 [95% confidence interval (CI): 0.427-0.948] and 0.998 (95% CI: 0.994-1.000), with lower values for HPLC-based methods. Acidification of urine samples prior to analysis led to significantly higher oxalate concentrations. ICC for inter-laboratory reliability varied between 0.745 (95% CI: 0.468-0.890) and 0.986 (95% CI: 0.967-0.995). Recovery of the 10 mg/L oxalate-spiked samples varied between 8.7 ± 2.3 and 10.7 ± 0.5 mg/L. Overall, HPLC-based methods showed more variability compared to the oxalate oxidase kit-based methods. CONCLUSIONS: Significant variability was noted in the quantification of urinary oxalate concentration by different laboratories, which may partially explain the differences of hyperoxaluria prevalence reported in the literature. Our data stress the need for a standardization of the method of oxalate measurement.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

INTRODUCTION: Quantitative sensory testing (QST) is widely used in human research to investigate the integrity of the sensory function in patients with pain of neuropathic origin, or other causes such as low back pain. Reliability of QST has been evaluated on both sides of the face, hands and feet as well as on the trunk (Th3-L3). In order to apply these tests on other body-parts such as the lower lumbar spine, it is important first to establish reliability on healthy individuals. The aim of this study was to investigate intra-rater reliability of thermal QST in healthy adults, on two sites within the L5 dermatome of the lumbar spine and lower extremity. METHODS: Test-retest reliability of thermal QST was determined at the L5-level of the lumbar spine and in the same dermatome on the lower extremity in 30 healthy persons under 40 years of age. Results were analyzed using descriptive statistics and intraclass correlation coefficient (ICC). Values were compared to normative data, using Z-transformation. RESULTS: Mean intraindividual differences were small for cold and warm detection thresholds but larger for pain thresholds. ICC values showed excellent reliability for warm detection and heat pain threshold, good-to-excellent reliability for cold pain threshold and fair-to-excellent reliability for cold detection threshold. ICC had large ranges of confidence interval (95%). CONCLUSION: In healthy adults, thermal QST on the lumbar spine and lower extremity demonstrated fair-to-excellent test-retest reliability.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Twitch mouth pressure (Pmo,tw) during magnetic phrenic nerve stimulation and sniff nasal inspiratory pressure (SNIP) were recently proposed as alternative noninvasive methods for assessing inspiratory muscle strength. This study aimed to compare their reproducibility with maximal inspiratory pressure (MIP) in normal subjects. Ten healthy subjects were studied at functional residual capacity in semirecumbent position. Cervical magnetic phrenic nerve stimulation was performed during gentle expiration against an occlusion incorporating a small leak. Constancy of stimulation was controlled by recording diaphragmatic electromyogram. Within and between-session reproducibility of pressure were studied for Pmo,tw, SNIP, and MIP. The subjects were studied during a session of 10 manoeuvres repeated after 1 day and 1 month. The mean values were 16 cmH2O for Pmo,tw, 118 cmH2O for SNIP, and 115 cmH2O for MIP. For the three tests, the within subject variation was small in relation to between-subject variation, with the intraclass correlation coefficient ranging 0.79-0.90 for Pmo,tw, 0.85-0.92 for SNIP, and 0.88-0.92 for MIP. At 1 day interval, the coefficient of repeatability (2 SD of differences) was 3.6 cmH2O for Pmo,tw, 32 cmH2O for SNIP and 28 cmH2O for MIP. At 1 month interval, the coefficient of repeatability was 5.8 cmH2O for Pmo,tw, 23 cmH2O for SNIP and 21 cmH2O for MIP. We conclude that the within session reproducibility of the new tests twitch mouth pressure and sniff nasal inspiratory pressure is sufficient to be clinically useful. For sniff nasal inspiratory pressure, the between session reproducibility established after 1 day was maintained after 1 month. For twitch mouth pressure, the between session reproducibility declined slightly after 1 month. These characteristics should be considered when using these methods to follow an individual patient over time.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: Due to the distinct cultural and language differences that exist in Switzerland, there is little information on the dietary intake among the general Swiss population. Adequately assessing dietary intake is thus paramount if nutritional epidemiological studies are to be conducted. OBJECTIVE: To assess the reproducibility and validity of a food-frequency questionnaire (FFQ) developed for French-speaking Swiss adults. DESIGN: A total of 23 men and 17 women (43.1+/-2.0 years) filled out one FFQ and completed one 24-hour dietary recall at baseline and 1 month afterward. RESULTS: Crude Pearson's correlation coefficients between the first and the second FFQ ranged from 0.58 to 0.90, intraclass correlation coefficient (ICC) ranged between 0.53 and 0.92. Lin's concordance coefficients ranged between 0.55 and 0.87. Over 80% of participants were classified in the same or adjacent tertile using each FFQ. Macronutrient intakes estimated by both FFQs were significantly higher than those estimated from the 24-hour recall for protein and water, while no significant differences were found for energy, carbohydrate, fats (five groups), and alcohol. De-attenuated Pearson's correlation coefficients between the 24-hour recall and the first FFQ ranged between 0.31 and 0.49, while for the second FFQ the values ranged between 0.38 and 0.59. Over 40 and 95% of participants fell into the same or the adjacent energy and nutrient tertiles, respectively, using the FFQs and the 24-hour recall. CONCLUSIONS: This FFQ shows good reproducibility and can be used determining macronutrient intake in a French-speaking Swiss population in an epidemiological setting.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

Purpose: To evaluate inter- and intraobserver variability of indices crucial for detection of keratoconus progression derived from the Pentacam HR® (high-resolution) tomographer (OCULUS Optikgeräte GmbH, Wetzlar, Germany) in patients with mild to moderate keratoconus. Methods: Three repeated corneal topography measurements in the 25-picture mode by two independent observers were performed. The extent of variability across a large range of measurement parameters was analyzed including anterior and posterior corneal surface measurements, pachymetry values, corneal volume, anterior chamber volume and depth, and iridocorneal angle. The intraclass correlation coefficient (ICC) between and within each investigator was calculated to assess reproducibility and repeatability, respectively. Results: 31 eyes of 20 patients (mean age 31.6, SD ± 8.6) were included. Overall, the repeatability and reproducibility were excellent. The range of variability was reported by calculating the standard deviation of measurements. The detailed results are shown in Table 1. Conclusions: This study shows that the Pentacam HR® tomographer provides reliable measurements in patients with mild to moderate keratoconus. However, all parameters showed a certain range of variability. This should be taken into account when assessing keratoconus progression in order to distinguish true progression from variability in measurements. In addition, the excellent reproducibility suggests that the measurements can be reliably performed by different individuals from one visit to another.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

PURPOSE: To evaluate the technical quality and the diagnostic performance of a protocol with use of low volumes of contrast medium (25 mL) at 64-detector spiral computed tomography (CT) in the diagnosis and management of adult, nontraumatic subarachnoid hemorrhage (SAH). MATERIALS AND METHODS: This study was performed outside the United States and was approved by the institutional review board. Intracranial CT angiography was performed in 73 consecutive patients with nontraumatic SAH diagnosed at nonenhanced CT. Image quality was evaluated by two observers using two criteria: degree of arterial enhancement and venous contamination. The two independent readers evaluated diagnostic performance (lesion detection and correct therapeutic decision-making process) by using rotational angiographic findings as the standard of reference. Sensitivity, specificity, and positive and negative predictive values were calculated for patients who underwent CT angiography and three-dimensional rotational angiography. The intraclass correlation coefficient was calculated to assess interobserver concordance concerning aneurysm measurements and therapeutic management. RESULTS: All aneurysms were detected, either ruptured or unruptured. Arterial opacification was excellent in 62 cases (85%), and venous contamination was absent or minor in 61 cases (84%). In 95% of cases, CT angiographic findings allowed optimal therapeutic management. The intraclass correlation coefficient ranged between 0.93 and 0.95, indicating excellent interobserver agreement. CONCLUSION: With only 25 mL of iodinated contrast medium focused on the arterial phase, 64-detector CT angiography allowed satisfactory diagnostic and therapeutic management of nontraumatic SAH.

Relevância:

100.00% 100.00%

Publicador:

Resumo:

BACKGROUND: To compare morphological gross tumor volumes (GTVs), defined as pre- and postoperative gadolinium enhancement on T1-weighted magnetic resonance imaging to biological tumor volumes (BTVs), defined by the uptake of (18)F fluoroethyltyrosine (FET) for the radiotherapy planning of high-grade glioma, using a dedicated positron emission tomography (PET)-CT scanner equipped with three triangulation lasers for patient positioning. METHODS: Nineteen patients with malignant glioma were included into a prospective protocol using FET PET-CT for radiotherapy planning. To be eligible, patients had to present with residual disease after surgery. Planning was performed using the clinical target volume (CTV = GTV union or logical sum BTV) and planning target volume (PTV = CTV + 20 mm). First, the interrater reliability for BTV delineation was assessed among three observers. Second, the BTV and GTV were quantified and compared. Finally, the geometrical relationships between GTV and BTV were assessed. RESULTS: Interrater agreement for BTV delineation was excellent (intraclass correlation coefficient 0.9). Although, BTVs and GTVs were not significantly different (p = 0.9), CTVs (mean 57.8 +/- 30.4 cm(3)) were significantly larger than BTVs (mean 42.1 +/- 24.4 cm(3); p < 0.01) or GTVs (mean 38.7 +/- 25.7 cm(3); p < 0.01). In 13 (68%) and 6 (32%) of 19 patients, FET uptake extended >or= 10 and 20 mm from the margin of the gadolinium enhancement. CONCLUSION: Using FET, the interrater reliability had excellent agreement for BTV delineation. With FET PET-CT planning, the size and geometrical location of GTVs and BTVs differed in a majority of patients.