63 resultados para Interobserver Agreement
Resumo:
PURPOSE Different international target volume delineation guidelines exist and different treatment techniques are available for salvage radiation therapy (RT) for recurrent prostate cancer, but less is known regarding their respective applicability in clinical practice. METHODS AND MATERIALS A randomized phase III trial testing 64 Gy vs 70 Gy salvage RT was accompanied by an intense quality assurance program including a site-specific and study-specific questionnaire and a dummy run (DR). Target volume delineation was performed according to the European Organisation for the Research and Treatment of Cancer guidelines, and a DR-based treatment plan was established for 70 Gy. Major and minor protocol deviations were noted, interobserver agreement of delineated target contours was assessed, and dose-volume histogram (DVH) parameters of different treatment techniques were compared. RESULTS Thirty European centers participated, 43% of which were using 3-dimensional conformal RT (3D-CRT), with the remaining centers using intensity modulated RT (IMRT) or volumetric modulated arc technique (VMAT). The first submitted version of the DR contained major deviations in 21 of 30 (70%) centers, mostly caused by inappropriately defined or lack of prostate bed (PB). All but 5 centers completed the DR successfully with their second submitted version. The interobserver agreement of the PB was moderate and was improved by the DR review, as indicated by an increased κ value (0.59 vs 0.55), mean sensitivity (0.64 vs 0.58), volume of total agreement (3.9 vs 3.3 cm(3)), and decrease in the union volume (79.3 vs 84.2 cm(3)). Rectal and bladder wall DVH parameters of IMRT and VMAT vs 3D-CRT plans were not significantly different. CONCLUSIONS The interobserver agreement of PB delineation was moderate but was improved by the DR. Major deviations could be identified for the majority of centers. The DR has improved the acquaintance of the participating centers with the trial protocol.
Resumo:
Histopathologic tumor regression grades (TRGs) after neoadjuvant chemotherapy predict survival in different cancers. In bladder cancer, corresponding studies have not been conducted. Fifty-six patients with advanced invasive urothelial bladder cancer received neoadjuvant chemotherapy before cystectomy and lymphadenectomy. TRGs were defined as follows: TRG1: complete tumor regression; TRG2: >50% tumor regression; TRG3: 50% or less tumor regression. Separate TRGs were assigned for primary tumors and corresponding lymph nodes. The prognostic impact of these 2 TRGs, the highest (dominant) TRG per patient, and competing tumor features reflecting tumor regression (ypT/ypN stage, maximum diameter of the residual tumor) were determined. Tumor characteristics in initial transurethral resection of the bladder specimens were tested for response prediction. The frequency of TRGs 1, 2, and 3 in the primary tumors were n=16, n=19, and n=21; corresponding data from the lymph nodes were n=31, n=9, and n=16. Interobserver agreement in determination of the TRG was strong (κ=0.8). Univariately, all evaluated parameters were significantly (P≤0.001) related to overall survival; however, the segregation of the Kaplan-Meier curves was best for the dominant TRG. In multivariate analysis, only dominant TRG predicted overall survival independently (P=0.035). In transurethral resection specimens of the chemotherapy-naive bladder cancer, the only tumor feature with significant (P<0.03) predictive value for therapy response was a high proliferation rate. In conclusion, among all parameters reflecting tumor regression, the dominant TRG was the only independent risk factor. A favorable chemotherapy response is associated with a high proliferation rate in the initial chemotherapy-naive bladder cancer. This feature might help personalize neoadjuvant chemotherapy.
Resumo:
Histopathologic determination of tumor regression provides important prognostic information for locally advanced gastroesophageal carcinomas after neoadjuvant treatment. Regression grading systems mostly refer to the amount of therapy-induced fibrosis in relation to residual tumor or the estimated percentage of residual tumor in relation to the former tumor site. Although these methods are generally accepted, currently there is no common standard for reporting tumor regression in gastroesophageal cancers. We compared the application of these 2 major principles for assessment of tumor regression: hematoxylin and eosin-stained slides from 89 resection specimens of esophageal adenocarcinomas following neoadjuvant chemotherapy were independently reviewed by 3 pathologists from different institutions. Tumor regression was determined by the 5-tiered Mandard system (fibrosis/tumor relation) and the 4-tiered Becker system (residual tumor in %). Interobserver agreement for the Becker system showed better weighted κ values compared with the Mandard system (0.78 vs. 0.62). Evaluation of the whole embedded tumor site showed improved results (Becker: 0.83; Mandard: 0.73) as compared with only 1 representative slide (Becker: 0.68; Mandard: 0.71). Modification into simplified 3-tiered systems showed comparable interobserver agreement but better prognostic stratification for both systems (log rank Becker: P=0.015; Mandard P=0.03), with independent prognostic impact for overall survival (modified Becker: P=0.011, hazard ratio=3.07; modified Mandard: P=0.023, hazard ratio=2.72). In conclusion, both systems provide substantial to excellent interobserver agreement for estimation of tumor regression after neoadjuvant chemotherapy in esophageal adenocarcinomas. A simple 3-tiered system with the estimation of residual tumor in % (complete regression/1% to 50% residual tumor/>50% residual tumor) maintains the highest reproducibility and prognostic value.
Resumo:
OBJECTIVE: To develop a behavioural observation method to simultaneously assess distractors and communication/teamwork during surgical procedures through direct, on-site observations; to establish the reliability of the method for long (>3 h) procedures. METHODS: Observational categories for an event-based coding system were developed based on expert interviews, observations and a literature review. Using Cohen's κ and the intraclass correlation coefficient, interobserver agreement was assessed for 29 procedures. Agreement was calculated for the entire surgery, and for the 1st hour. In addition, interobserver agreement was assessed between two tired observers and between a tired and a non-tired observer after 3 h of surgery. RESULTS: The observational system has five codes for distractors (door openings, noise distractors, technical distractors, side conversations and interruptions), eight codes for communication/teamwork (case-relevant communication, teaching, leadership, problem solving, case-irrelevant communication, laughter, tension and communication with external visitors) and five contextual codes (incision, last stitch, personnel changes in the sterile team, location changes around the table and incidents). Based on 5-min intervals, Cohen's κ was good to excellent for distractors (0.74-0.98) and for communication/teamwork (0.70-1). Based on frequency counts, intraclass correlation coefficient was excellent for distractors (0.86-0.99) and good to excellent for communication/teamwork (0.45-0.99). After 3 h of surgery, Cohen's κ was 0.78-0.93 for distractors, and 0.79-1 for communication/teamwork. DISCUSSION: The observational method developed allows a single observer to simultaneously assess distractors and communication/teamwork. Even for long procedures, high interobserver agreement can be achieved. Data collected with this method allow for investigating separate or combined effects of distractions and communication/teamwork on surgical performance and patient outcomes.
Resumo:
AIMS Information on tumour border configuration (TBC) in colorectal cancer (CRC) is currently not included in most pathology reports, owing to lack of reproducibility and/or established evaluation systems. The aim of this study was to investigate whether an alternative scoring system based on the percentage of the infiltrating component may represent a reliable method for assessing TBC. METHODS AND RESULTS Two hundred and fifteen CRCs with complete clinicopathological data were evaluated by two independent observers, both 'traditionally' by assigning the tumours into pushing/infiltrating/mixed categories, and alternatively by scoring the percentage of infiltrating margin. With the pushing/infiltrating/mixed pattern method, interobserver agreement (IOA) was moderate (κ = 0.58), whereas with the percentage of infiltrating margins method, IOA was excellent (intraclass correlation coefficient of 0.86). A higher percentage of infiltrating margin correlated with adverse features such as higher grade (P = 0.0025), higher pT (P = 0.0007), pN (P = 0.0001) and pM classification (P = 0.0063), high-grade tumour budding (P < 0.0001), lymphatic invasion (P < 0.0001), vascular invasion (P = 0.0032), and shorter survival (P = 0.0008), and was significantly associated with an increased probability of lymph node metastasis (P < 0.001). CONCLUSIONS Information on TBC gives additional prognostic value to pathology reports on CRC. The novel proposed scoring system, by using the percentage of infiltrating margin, outperforms the 'traditional' way of reporting TBC. Additionally, it is reproducible and simple to apply, and can therefore be easily integrated into daily diagnostic practice.
Resumo:
Autophagy assures cellular homeostasis, and gains increasing importance in cancer, where it impacts on carcinogenesis, propagation of the malignant phenotype and development of resistance. To date, its tissue-based analysis by immunohistochemistry remains poorly standardized. Here we show the feasibility of specifically and reliably assessing the autophagy markers LC3B and p62 (SQSTM1) in formalin fixed and paraffin embedded human tissue by immunohistochemistry. Preceding functional experiments consisted of depleting LC3B and p62 in H1299 lung cancer cells with subsequent induction of autophagy. Western blot and immunofluorescence validated antibody specificity, knockdown efficiency and autophagy induction prior to fixation in formalin and embedding in paraffin. LC3B and p62 antibodies were validated on formalin fixed and paraffin embedded cell pellets of treated and control cells and finally applied on a tissue microarray with 80 human malignant and non-neoplastic lung and stomach formalin fixed and paraffin embedded tissue samples. Dot-like staining of various degrees was observed in cell pellets and 18/40 (LC3B) and 22/40 (p62) tumors, respectively. Seventeen tumors were double positive for LC3B and p62. P62 displayed additional significant cytoplasmic and nuclear staining of unknown significance. Interobserver-agreement for grading of staining intensities and patterns was substantial to excellent (kappa values 0.60 - 0.83). In summary, we present a specific and reliable IHC staining of LC3B and p62 on formalin fixed and paraffin embedded human tissue. Our presented protocol is designed to aid reliable investigation of dysregulated autophagy in solid tumors and may be used on large tissue collectives.
Resumo:
A common debate among dermatopathologists is that prior knowledge of the clinical picture of melanocytic skin neoplasms may introduce a potential bias in the histopathologic examination. Histologic slides from 99 melanocytic skin neoplasms were circulated among 10 clinical dermatologists, all of them formally trained and board-certified dermatopathologists: 5 dermatopathologists had clinical images available after a 'blind' examination (Group 1); the other 5 had clinical images available before microscopic examination (Group 2). Data from the two groups were compared regarding 'consensus' (a diagnosis in agreement by ≥4 dermatopathologists/group), chance-corrected interobserver agreement (Fleiss' k) and level of diagnostic confidence (LDC: a 1-5 arbitrary scale indicating 'increasing reliability' of any given diagnosis). Compared with Group 1 dermatopathologists, Group 2 achieved a lower number of consensus (84 vs. 90) but a higher k value (0.74 vs. 0.69) and a greater mean LDC value (4.57 vs. 4.32). The same consensus was achieved by the two groups in 81/99 cases. Spitzoid neoplasms were most frequently controversial for both groups. The histopathologic interpretation of melanocytic neoplasms seems to be not biased by the knowledge of the clinical picture before histopathologic examination.
Resumo:
INTRODUCTION Conventional 2-dimensional radiography uses defined criteria for outcome assessment of apical surgery. However, these radiographic healing criteria are not applicable for 3-dimensional radiography. The present study evaluated the repeatability and reproducibility of new cone-beam computed tomographic (CBCT)-based healing criteria for the judgment of periapical healing 1 year after apical surgery. METHODS CBCT scans taken 1 year after apical surgery (61 roots of 54 teeth in 54 patients, mean age = 54.4 years) were evaluated by 3 blinded and calibrated observers using 4 different indices. Reformatted buccolingual CBCT sections through the longitudinal axis of the treated roots were analyzed. Radiographic healing was assessed at the resection plane (R index), within the apical area (A index), of the cortical plate (C index), and regarding a combined apical-cortical area (B index). All readings were performed twice to calculate the intraobserver agreement (repeatability). Second-time readings were used for analyzing the interobserver agreement (reproducibility). Various statistical tests (Cohen, kappa, Fisher, and Spearman) were performed to measure the intra- and interobserver concurrence, the variability of score ratios, and the correlation of indices. RESULTS For all indices, the rates of identical first- and second-time scores were always higher than 80% (intraobserver Cohen κ values ranging from 0.793 to 0.963). The B index (94.0%) showed the highest intraobserver agreement. Regarding interobserver agreement, the highest rate was found for the B index (72.1%). The Fleiss' κ values for R and B indices exhibited substantial agreement (0.626 and 0.717, respectively), whereas the values for A and C indices showed moderate agreement (0.561 and 0.573, respectively). The Spearman correlation coefficients for R, A, C, and B indices all exhibited a moderate to very strong correlation with the highest correlation found between C and B indices (rs = 0.8069). CONCLUSIONS All indices showed an excellent intraobserver agreement (repeatability). With regard to interobserver agreement (reproducibility), the B index (healing of apical and cortical defects combined) and the R index (healing on the resection plane) showed substantial congruence and thus are to be recommended in future studies when using buccolingual CBCT sections for radiographic outcome assessment of apical surgery.
Resumo:
UNLABELLED The purpose of this study was to evaluate the reproducibility of a new software based analysing system for ventilation/perfusion single-photon emission computed tomography/computed tomography (V/P SPECT/CT) in patients with pulmonary emphysema and to compare it to the visual interpretation. PATIENTS, MATERIAL AND METHODS 19 patients (mean age: 68.1 years) with pulmonary emphysema who underwent V/P SPECT/CT were included. Data were analysed by two independent observers in visual interpretation (VI) and by software based analysis system (SBAS). SBAS PMOD version 3.4 (Technologies Ltd, Zurich, Switzerland) was used to assess counts and volume per lung lobe/per lung and to calculate the count density per lung, lobe ratio of counts and ratio of count density. VI was performed using a visual scale to assess the mean counts per lung lobe. Interobserver variability and association for SBAS and VI were analysed using Spearman's rho correlation coefficient. RESULTS Interobserver agreement correlated highly in perfusion (rho: 0.982, 0.957, 0.90, 0.979) and ventilation (rho: 0.972, 0.924, 0.941, 0.936) for count/count density per lobe and ratio of counts/count density in SBAS. Interobserver agreement correlated clearly for perfusion (rho: 0.655) and weakly for ventilation (rho: 0.458) in VI. CONCLUSIONS SBAS provides more reproducible measures than VI for the relative tracer uptake in V/P SPECT/CTs in patients with pulmonary emphysema. However, SBAS has to be improved for routine clinical use.
Resumo:
PURPOSE To investigate whether the effects of hybrid iterative reconstruction (HIR) on coronary artery calcium (CAC) measurements using the Agatston score lead to changes in assignment of patients to cardiovascular risk groups compared to filtered back projection (FBP). MATERIALS AND METHODS 68 patients (mean age 61.5 years; 48 male; 20 female) underwent prospectively ECG-gated, non-enhanced, cardiac 256-MSCT for coronary calcium scoring. Scanning parameters were as follows: Tube voltage, 120 kV; Mean tube current time-product 63.67 mAs (50 - 150 mAs); collimation, 2 × 128 × 0.625 mm. Images were reconstructed with FBP and with HIR at all levels (L1 to L7). Two independent readers measured Agatston scores of all reconstructions and assigned patients to cardiovascular risk groups. Scores of HIR and FBP reconstructions were correlated (Spearman). Interobserver agreement and variability was assessed with ĸ-statistics and Bland-Altmann-Plots. RESULTS Agatston scores of HIR reconstructions were closely correlated with FBP reconstructions (L1, R = 0.9996; L2, R = 0.9995; L3, R = 0.9991; L4, R = 0.986; L5, R = 0.9986; L6, R = 0.9987; and L7, R = 0.9986). In comparison to FBP, HIR led to reduced Agatston scores between 97 % (L1) and 87.4 % (L7) of the FBP values. Using HIR iterations L1 - L3, all patients were assigned to identical risk groups as after FPB reconstruction. In 5.4 % of patients the risk group after HIR with the maximum iteration level was different from the group after FBP reconstruction. CONCLUSION There was an excellent correlation of Agatston scores after HIR and FBP with identical risk group assignment at levels 1 - 3 for all patients. Hence it appears that the application of HIR in routine calcium scoring does not entail any disadvantages. Thus, future studies are needed to demonstrate whether HIR is a reliable method for reducing radiation dose in coronary calcium scoring.
Resumo:
BACKGROUND: Integrity of the abdominal aortic aneurysm (AAA) neck is crucial for the long-term success of endovascular AAA repair (EVAR). However, suitable tools for reliable assessment of changes in small aortic volumes are lacking. The purpose of this study was to assess the intraobserver and interobserver variability of software-enhanced 64-row computed tomographic angiography (CTA) AAA neck volume measurements in patients after EVAR. METHODS: A total of 25 consecutive patients successfully treated by EVAR underwent 64-row follow-up CTA in 1.5-mm collimation. Manual CTA measurements were performed twice by three blinded and independent readers in random order with at least a 4-week interval between readings. Maximum and minimum transverse aortic neck diameters were measured twice on two different levels within the proximal neck. Volumetry of the proximal aortic neck was performed by using dedicated software. Variability was calculated as 1.96 SD of the mean arithmetic difference according to Bland and Altman. Two-sided and paired t tests were used to compare measurements. P values <.05 were considered to indicate statistical significance. RESULTS: Intraobserver agreement was excellent for dedicated aneurysmal neck volumetry, with mean differences of less than 1 mL (P > .05), whereas it was poor for transverse aortic neck diameter measurements (P < .05). However, interobserver variability was statistically significant for both neck volumetry (P < .005) and neck diameter measurements (P < .015). CONCLUSIONS: The reliability of dedicated AAA neck volumetry by using 64-row CTA is excellent for serial measurements by individual readers, but not between different readers. Therefore, studies should be performed with aortic neck volumetry by a single experienced reader.
Resumo:
PURPOSE: To quantify the interobserver variability of abdominal aortic aneurysm (AAA) neck length and angulation measurements. MATERIALS AND METHODS: A total of 25 consecutive patients scheduled for endovascular AAA repair underwent follow-up 64-row computed tomographic (CT) angiography in 0.625-mm collimation. AAA neck length and angulation were determined by four blinded, independent readers. AAA neck length was defined as the longitudinal distance between the first transverse CT slice directly distal to the lowermost renal artery and the first transverse CT slice that showed at least a 15% larger outer aortic wall diameter versus the diameter measured directly below the lowermost renal artery. Infrarenal AAA neck angulation was defined as the true angle between the longitudinal axis of the proximal AAA neck and the longitudinal axis of the AAA lumen as analyzed on three-dimensional CT reconstructions. RESULTS: Mean deviation in aortic neck length determination was 32.3% and that in aortic neck angulation was 32.1%. Interobserver variability of aortic neck length and angulation measurements was considerable: in any reader combination, at least one measurement difference was outside the predefined limits of agreement. CONCLUSIONS: Assessment of the longitudinal extension and angulation of the infrarenal aortic neck is associated with substantial observer variability, even if measurement is carried out according to a standardized protocol. Further studies are mandatory to assess dedicated technical approaches to minimize variance in the determination of the longitudinal extension and angulation of the infrarenal aortic neck.
Resumo:
Clinical manifestations of lactase (LCT) deficiency include intestinal and extra-intestinal symptoms. Lactose hydrogen breath test (H2-BT) is considered the gold standard to evaluate LCT deficiency (LD). Recently, the single-nucleotide polymorphism C/T(-13910) has been associated with LD. The objectives of the present study were to evaluate the agreement between genetic testing of LCT C/T(-13910) and lactose H2-BT, and the diagnostic value of extended symptom assessment. Of the 201 patients included in the study, 194 (139 females; mean age 38, range 17-79 years, and 55 males, mean age 38, range 18-68 years) patients with clinical suspicion of LD underwent a 3-4 h H2-BT and genetic testing for LCT C/T(-13910). Patients rated five intestinal and four extra-intestinal symptoms during the H2-BT and then at home for the following 48 h. Declaring H2-BT as the gold standard, the CC(-13910) genotype had a sensitivity of 97% and a specificity of 95% with a of 0.9 in diagnosing LCT deficiency. Patients with LD had more intense intestinal symptoms 4 h following the lactose challenge included in the H2-BT. We found no difference in the intensity of extra-intestinal symptoms between patients with and without LD. Symptom assessment yielded differences for intestinal symptoms abdominal pain, bloating, borborygmi and diarrhoea between 120 min and 4 h after oral lactose challenge. Extra-intestinal symptoms (dizziness, headache and myalgia) and extension of symptom assessment up to 48 h did not consistently show different results. In conclusion, genetic testing has an excellent agreement with the standard lactose H2-BT, and it may replace breath testing for the diagnosis of LD. Extended symptom scores and assessment of extra-intestinal symptoms have limited diagnostic value in the evaluation of LD.
Resumo:
To assess the interobserver variability (IOV) in indicating retreatment for neovascular Age-related macular degeneration 4 weeks after three Ranibizumab loading doses using spectral domain OCT (SD-OCT) as the primary objective diagnostic tool.