RESUMO
BACKGROUND: Securing sufficient blood perfusion to the anastomotic area after low-anterior resection is a crucial factor in preventing anastomotic leakage (AL). Intra-operative indocyanine green fluorescent imaging (ICG-FI) has been suggested as a tool to assess perfusion. However, knowledge of inter-observer variation among surgeons in the interpretation of ICG-FI is sparse. Our primary objective was to evaluate inter-observer variation among surgeons in the interpretation of bowel blood-perfusion assessed visually by ICG-FI. Our secondary objective was to compare the results both from the visual assessment of ICG and from computer-based quantitative analyses of ICG-FI between patients with and without the development of AL. METHOD: A multicenter study, including patients undergoing robot-assisted low anterior resection with stapled anastomosis. ICG-FI was evaluated visually by the surgeon intra-operatively. Postoperatively, recorded videos were anonymized and exchanged between centers for inter-observer evaluation. Time to visibility (TTV), time to maximum visibility (TMV), and time to wash-out (TWO) were visually assessed. In addition, the ICG-FI video-recordings were analyzed using validated pixel analysis software to quantify blood perfusion. RESULTS: Fifty-five patients were included, and five developed clinical AL. Bland-Altman plots (BA plots) demonstrated wide inter-observer variation for visually assessed fluorescence on all parameters (TTV, TMV, and TWO). Comparing leak-group with no-leak group, we found no significant differences for TTV: Hazard Ratio; HR = 0.82 (CI 0.32; 2.08), TMV: HR = 0.62 (CI 0.24; 1.59), or TWO: HR = 1.11 (CI 0.40; 3.11). In the quantitative pixel analysis, a lower slope of the fluorescence time-curve was found in patients with a subsequent leak: median 0.08 (0.07;0.10) compared with non-leak patients: median 0.13 (0.10;0.17) (p = 0.04). CONCLUSION: The surgeon's visual assessment of the ICG-FI demonstrated wide inter-observer variation, there were no differences between patients with and without AL. However, quantitative pixel analysis showed a significant difference between groups. TRIAL REGISTRATION: ClinicalTrials.gov Identifier: NCT04766060.
Assuntos
Neoplasias Colorretais , Laparoscopia , Robótica , Humanos , Verde de Indocianina , Variações Dependentes do Observador , Neoplasias Colorretais/cirurgia , Laparoscopia/métodos , Anastomose Cirúrgica/métodos , Fístula Anastomótica/etiologia , Fístula Anastomótica/prevenção & controle , Fístula Anastomótica/cirurgia , Perfusão , AngiofluoresceinografiaRESUMO
BACKGROUND: Pediatric hepatic steatosis is a global public health concern, as an increasing number of children are affected by this condition. Liver biopsy is the gold standard diagnostic method; however, this procedure is invasive. Magnetic resonance imaging (MRI)-derived proton density fat fraction has been accepted as an alternative to biopsy. However, this method is limited by cost and availability. Ultrasound (US) attenuation imaging is an upcoming tool for noninvasive quantitative assessment of hepatic steatosis in children. A limited number of publications have focused on US attenuation imaging and the stages of hepatic steatosis in children. OBJECTIVE: To analyze the usefulness of ultrasound attenuation imaging for the diagnosis and quantification of hepatic steatosis in children. MATERIAL AND METHODS: Between July and November 2021, 174 patients were included and divided into two groups: group 1, patients with risk factors for steatosis (n = 147), and group 2, patients without risk factors for steatosis (n = 27). In all cases, age, sex, weight, body mass index (BMI), and BMI percentile were determined. B-mode US (two observers) and US attenuation imaging with attenuation coefficient acquisition (two independent sessions, two different observers) were performed in both groups. Steatosis was classified into four grades (0: absent, 1: mild, 2: moderate and 3: severe) using B-mode US. Attenuation coefficient acquisition was correlated with steatosis score according to Spearman's correlation. Attenuation coefficient acquisition measurements' interobserver agreement was assessed using intraclass correlation coefficients (ICC). RESULTS: All attenuation coefficient acquisition measurements were satisfactory without technical failures. The median values for group 1 for the first session were 0.64 (0.57-0.69) dB/cm/MHz and 0.64 (0.60-0.70) dB/cm/MHz for the second session. The median values for group 2 for the first session were 0.54 (0.51-0.56) dB/cm/MHz and 0.54 (0.51-0.56) dB/cm/MHz for the second. The average attenuation coefficient acquisition was 0.65 (0.59-0.69) dB/cm/MHz for group 1 and 0.54 (0.52-0.56) dB/cm/MHz for group 2. There was excellent interobserver agreement at 0.94 (95% CI 0.92-0.96). There was substantial agreement between both observers (κ = 0.77, with a P < 0.001). There was a positive correlation between ultrasound attenuation imaging and B-mode scores for both observers (r = 0.87, P < 0.001 for observer 1; r = 0.86, P < 0.001 for observer 2). Attenuation coefficient acquisition median values were significantly different for each steatosis grade (P < 0.001). In the assessment of steatosis by B-mode US, the agreement between the two observers was moderate (κ = 0.49 and κ = 0.55, respectively, with a P < 0.001 in both cases). CONCLUSION: US attenuation imaging is a promising tool for the diagnosis and follow-up of pediatric steatosis, which provides a more repeatable form of classification, especially at low levels of steatosis detectable in B-mode US.
Assuntos
Fígado Gorduroso , Hepatopatia Gordurosa não Alcoólica , Humanos , Criança , Fígado/diagnóstico por imagem , Fígado/patologia , Fígado Gorduroso/diagnóstico por imagem , Fígado Gorduroso/patologia , Ultrassonografia/métodos , Biópsia , Imageamento por Ressonância Magnética/métodos , Curva ROCRESUMO
BACKGROUND: The Danish Neuro Oncology Group (DNOG) has established national consensus guidelines for the delineation of organs at risk (OAR) structures based on published literature. This study was conducted to finalise these guidelines and evaluate the inter-observer variability of the delineated OAR structures by expert observers. MATERIAL AND METHODS: The DNOG delineation guidelines were formed by participants from all Danish centres that treat brain tumours with radiotherapy. In a two-day workshop, guidelines were discussed and finalised based on a pilot study. Following this, the ten participants contoured the following OARs on T1-weighted gadolinium enhanced MRI from 13 patients with brain tumours: optic tracts, optic nerves, chiasm, spinal cord, brainstem, pituitary gland and hippocampus. The metrics used for comparison were the Dice similarity coefficient (Dice), mean surface distance (MSD) and others. RESULTS: A total of 968 contours were delineated across the 13 patients. On average eight (range six to nine) individual contour sets were made per patient. Good agreement was found across all structures with a median MSD below 1 mm for most structures, with the chiasm performing the best with a median MSD of 0.45 mm. The Dice was as expected highly volume dependent, the brainstem (the largest structure) had the highest Dice value with a median of 0.89 whereas smaller volumes such as the chiasm had a Dice of 0.71. CONCLUSION: Except for the caudal definition of the spinal cord, the variances observed in the contours of OARs in the brain were generally low and consistent. Surface mapping revealed sub-regions of higher variance for some organs. The data set is being prepared as a validation data set for auto-segmentation algorithms for use within the Danish Comprehensive Cancer Centre - Radiotherapy and potential collaborators.
Assuntos
Órgãos em Risco , Planejamento da Radioterapia Assistida por Computador , Encéfalo/diagnóstico por imagem , Humanos , Variações Dependentes do Observador , Projetos PilotoRESUMO
INTRODUCTION: Universal screening for preterm delivery by adding transvaginal ultrasound measurement of cervical length to routine second trimester ultrasound has been proposed. The aim is to estimate inter- and intraobserver agreement and reliability of second trimester transvaginal ultrasound measurements of cervical length performed by specially trained midwife sonographers. MATERIAL AND METHODS: This is a prospective reliability and agreement study performed in seven Swedish ultrasound centers. In total, 18 midwife sonographers specially trained to perform ultrasound measurements of cervical length and 286 women in the second trimester were included. In each center, two midwife sonographers measured cervical length a few minutes apart in the same woman, the number of women examined per examiner pair varying between 24 and 30 (LIVE study). Sixteen midwife sonographers measured cervical length twice ≥2 months apart on 93 video clips (CLIPS study). The main outcome measures were mean difference, limits of agreement, intraclass correlation coefficient, intra-individual standard deviation, repeatability, Cohen's kappa and Fleiss kappa. RESULTS: The limits of agreement and intraclass correlation coefficient of the best examiner pair in the LIVE study were -4.06 to 4.72 mm and 0.91, and those of the poorest were -11.11 to 11.39 mm and 0.31. In the CLIPS study, median (range) intra-individual standard deviation was 2.14 mm (1.40-3.46), repeatability 5.93 mm (3.88-9.58), intraclass correlation coefficient 0.84 (0.66-0.94). Median (range) interobserver agreement for cervical length ≤25 mm in the CLIPS study was 94.6% (84.9%-98.9%) and Cohen's kappa 0.56 (0.12-0.92), median (range) intraobserver agreement was 95.2% (87.1%-98.9%) and Cohen's kappa 0.68 (0.27-0.93). CONCLUSIONS: Agreement and reliability of cervical length measurements differed substantially between examiner pairs and examiners. If cervical length measurements are used to guide management there is potential for both over- and under-treatment. Uniform training and rigorous supervision and quality control are advised.
Assuntos
Medida do Comprimento Cervical , Colo do Útero/diagnóstico por imagem , Segundo Trimestre da Gravidez , Incompetência do Colo do Útero/diagnóstico por imagem , Adulto , Feminino , Humanos , Tocologia , Variações Dependentes do Observador , Gravidez , Estudos Prospectivos , Reprodutibilidade dos TestesRESUMO
Microscopic colitis (MC) comprising lymphocytic colitis (LC), collagenous colitis (CC) and the incomplete forms of microscopic colitis (MCi) are frequent causes of chronic watery diarrhea. The diagnosis is based on specific histological features in colonic biopsies. Especially regarding MCi, the histological features may be subtle. The PRO-MC collaboration was established in 2016 with the aims to systematically describe the disease course and to validate the diagnostic criteria of MC. In the present study, we analysed pathologists' initial approach to diagnose MC. Five pathologists with expertise in gastro-intestinal pathology reviewed the first 10 cases enrolled in the PRO-MC registry in six of the participating centres. Despite considerable differences in strategies in biopsy sampling, in choice of stains and in minimum number of biopsies and segments required for diagnosing MC, inter-observer agreement between the participating centres and expert pathologists as well as among the latter was substantial. Disagreed cases most often related to difficulties in distinguishing between MC subgroups. We recommend that pathologists as well as clinicians reach consensus in their diagnostic approach to MC, which is a prerequisite to compare MC cohorts internationally and to facilitate clinical MC trials and follow-up studies.
Assuntos
Colite/diagnóstico , Patologia Clínica/métodos , Patologia Clínica/normas , Europa (Continente) , Humanos , Variações Dependentes do Observador , Patologistas , Padrões de Prática Médica/normasRESUMO
PURPOSE AND HYPOTHESIS: In preoperative sizing for meniscal transplantation, most authors take into consideration the length and width of the original meniscus, but not its height. This study aimed at evaluating (1) whether the meniscal height is associated with the meniscal length and width, (2) whether the heights of the meniscal segments are associated with the individual's anthropometric data, (3) whether the heights of the meniscal segments are associated with each other in the same meniscus, and (4) the degree of symmetry of the meniscal dimensions between the right and left knees. METHODS: In this cross-sectional, observational study, two independent radiologists measured the meniscal length, width and height in knee magnetic resonance imaging scans obtained from 25 patients with patello-femoral pain syndrome. Reproducibility of measurements was calculated with intraclass correlation coefficients. Associations between the anthropometric data and the meniscal measurements, the meniscal length and width versus height, and the heights of the meniscal segments in the same meniscus were examined with Pearson's correlation. RESULTS: Inter-observer reliability was excellent (>0.8) for length and height and good (0.6-0.8) for width measurements. There was also excellent agreement (>0.8) for the length and width of the menisci in the right and left knees. The heights of the horns of the lateral meniscus showed good agreement (0.6-0.8), while the heights of the other meniscal segments had excellent agreement between the sides (>0.8). There were significant associations with generally low (r < 0.5) correlation between the heights of the meniscal segments and the lengths and widths of the menisci, between the meniscal height and anthropometric data, and between the heights of the meniscal segments in the same meniscus. Correlations between anthropometric data and meniscal length and width were generally high (r > 0.7). CONCLUSIONS: There was excellent agreement between the meniscal dimensions of the right and left knees, and a weak association between the meniscal height with the meniscal width and length, between the height of the menisci with anthropometric data and between the heights of the segments in the same meniscus. The height of the meniscal segments may be a new variable in preoperative meniscal measurement.
Assuntos
Imageamento por Ressonância Magnética , Meniscos Tibiais/anatomia & histologia , Meniscos Tibiais/transplante , Procedimentos Ortopédicos , Síndrome da Dor Patelofemoral/cirurgia , Cuidados Pré-Operatórios/métodos , Adolescente , Adulto , Estudos Transversais , Feminino , Humanos , Masculino , Meniscos Tibiais/diagnóstico por imagem , Variações Dependentes do Observador , Síndrome da Dor Patelofemoral/diagnóstico por imagem , Reprodutibilidade dos Testes , Adulto JovemRESUMO
OBJECTIVE: To investigate if pathologic biopsy reevaluation and implementation of immunohistochemical biomarkers could improve prediction of radical prostatectomy outcome in men initially on active surveillance. METHODS: Biopsy specimens from diagnosis until switching to radical prostatectomy in men initially on active surveillance in the Dutch part of the Prostate cancer Research International Active Surveillance (PRIAS) study were collected and revised by a single pathologist. Original and revised biopsy Gleason score were compared and correlated with radical prostatectomy Gleason score. Biopsy specimens were immunohistochemically stained for Ki67 and ERG. Predictive ability of clinical characteristics and biomarkers on Gleason ≥7 or ≥pT3 on radical prostatectomy was tested using logistic regression and ROC curve analysis. RESULTS: A total of 150 biopsies in 95 men were revised. In 13% of diagnostic or second-to-last biopsies and 20% of the last biopsies on active surveillance revision of Gleason score resulted in change of recommendation (ie, active treatment or active surveillance). Concordance with Gleason score on radical prostatectomy was however similar for both the revised and original Gleason on biopsy. Ki67 and ERG were not statistically significant predictors of Gleason ≥7 or ≥pT3 on radical prostatectomy. CONCLUSIONS: Although interobserver differences in pathology reporting on biopsy could result in a change of management strategy in approximately 13-20% of men on active surveillance, both pathological revision and tested biomarkers (Ki67 and ERG) did not improve prediction of outcome on radical prostatectomy. Undersampling of most aggressive tumor remains the main focus in order to increase accurate grading at time of treatment decision making.
Assuntos
Biópsia/métodos , Antígeno Ki-67/análise , Próstata/patologia , Prostatectomia/métodos , Neoplasias da Próstata , Idoso , Biópsia/estatística & dados numéricos , Gerenciamento Clínico , Progressão da Doença , Humanos , Imuno-Histoquímica , Masculino , Pessoa de Meia-Idade , Gradação de Tumores , Países Baixos/epidemiologia , Variações Dependentes do Observador , Valor Preditivo dos Testes , Prognóstico , Neoplasias da Próstata/diagnóstico , Neoplasias da Próstata/epidemiologia , Neoplasias da Próstata/patologia , Neoplasias da Próstata/cirurgia , Regulador Transcricional ERG/análiseRESUMO
OBJECTIVES: To elucidate the accuracy and inter-observer agreement of non-tuberculous mycobacterial lung disease (NTM-LD) diagnosis based on chest CT findings. METHODS: Two chest radiologists and two pulmonologists interpreted chest CTs of 66 patients with NTM-LD, 33 with pulmonary tuberculosis and 33 with non-cystic fibrosis bronchiectasis. These observers selected one of these diagnoses for each case without knowing any clinical information except age and sex. Sensitivity and specificity were calculated according to degree of observer confidence. Inter-observer agreement was assessed using Fleiss' κ values. Multiple logistic regression was performed to elucidate which radiological features led to the correct diagnosis. RESULTS: The sensitivity of NTM-LD diagnosis was 56.4 % (95 % CI 47.9-64.7) and specificity 80.3 % (73.1-86.0). The specificity of NTM-LD diagnosis increased with confidence: 44.4 % (20.5-71.3) for possible, 77.4 % (67.4-85.0) for probable, 95.2 % (87.2-98.2) for definite (P < 0.001) diagnoses. Inter-observer agreement for NTM-LD diagnosis was moderate (κ = 0.453). Tree-in-bud pattern (adjusted odds ratio [aOR] 6.24, P < 0.001), consolidation (aOR 1.92, P = 0.036) and atelectasis (aOR 3.73, P < 0.001) were associated with correct NTM-LD diagnoses, whereas presence of pleural effusion (aOR 0.05, P < 0.001) led to false diagnoses. CONCLUSIONS: NTM-LD diagnosis based on chest CT findings is specific but not sensitive. KEY POINTS: ⢠Diagnosis of NTM-LD based on radiological findings showed high specificity. ⢠Sensitivity of NTM-LD diagnosis was around 50 %. ⢠Inter- observer agreement was moderate. ⢠Identification of tree-in-bud pattern, consolidation and atelectasis led to correct diagnoses.
Assuntos
Bronquiectasia/diagnóstico por imagem , Infecções por Mycobacterium não Tuberculosas/diagnóstico por imagem , Tuberculose Pulmonar/diagnóstico por imagem , Idoso , Feminino , Humanos , Modelos Logísticos , Masculino , Pessoa de Meia-Idade , Análise Multivariada , Variações Dependentes do Observador , Razão de Chances , Atelectasia Pulmonar/diagnóstico por imagem , Estudos Retrospectivos , Sensibilidade e Especificidade , Tomografia Computadorizada por Raios X/métodosRESUMO
BACKGROUND: The Bosniak classification is a diagnostic tool for the differentiation of cystic changes in the kidney. The process of categorizing renal cysts may be challenging, involving a series of decisions that may affect the final diagnosis and clinical outcome such as surgical management. PURPOSE: To investigate the inter- and intra-observer agreement among experienced uroradiologists when categorizing complex renal cysts according to the Bosniak classification. MATERIAL AND METHODS: The original categories of 100 cystic renal masses were chosen as "Gold Standard" (GS), established in consensus by two experienced uroradiologists. Three experienced uroradiological readers were blinded from the previous CT reports. Weighted κ was calculated to assess agreement, defined as: fair, 0.21-0.40; moderate, 0.41-0.60; good, 0.61-0.80; and very good, 0.81-1.00. RESULTS: For readers the distribution of correctly classified lesions were as follows: BI, 95-100%; BII, 59-93%; BIIF, 54-92%; BIII, 58-95%; and B IV, 77-100% for the first review. Weighted κ for inter-observer/intra-observer variation was for Reader A: 0.85/0.99, Reader B: 0.97/0.99, and Reader C: 0.98/0.99, (P ≤ 0.001). CONCLUSION: According to the calculated weighted κ all readers performed "very good" for both inter-observer and intra-observer variation. Most variation was seen in cysts catagorized as Bosniak II, IIF, and III. These results show that radiologists who evaluate complex renal cysts routinely may apply the Bosniak classification reproducibly.
Assuntos
Doenças Renais Císticas/diagnóstico por imagem , Neoplasias Renais/diagnóstico por imagem , Tomografia Computadorizada por Raios X/métodos , Idoso , Meios de Contraste , Diagnóstico Diferencial , Feminino , Humanos , Rim/diagnóstico por imagem , Masculino , Pessoa de Meia-Idade , Variações Dependentes do Observador , Intensificação de Imagem Radiográfica , Interpretação de Imagem Radiográfica Assistida por Computador/métodos , Reprodutibilidade dos Testes , Estudos Retrospectivos , Tomografia Computadorizada por Raios X/normas , Ácidos Tri-IodobenzoicosRESUMO
Extremely well-differentiated gastric-type adenocarcinoma (EWDGA) is a rare type of gastric cancer composed of deceptively bland-looking malignant cells resembling normal foveolar or pyloric epithelium. The histological features of this tumor have not been recognized by many pathologists, and inter-observer variation studies are lacking. Here, we report seven EWDGAs and inter-observer variation of six preoperative biopsies was evaluated by 11 pathologists in a single institute. Based on the pathological diagnosis of the endoscopic biopsy slides, the average rate of definite malignancy diagnosis was 15.2â¯%, and the overall diagnostic concordance rate was 34.9â¯% among 11 pathologists. Microscopically, the surface epithelium was preserved and only a few atypical tumor glands were scattered in most endoscopic biopsies. Structural atypia was minimal, and the tumor glands were barely distinguishable from normal glands. Although nuclear atypia was minimal, enlarged nuclei, relatively large glands with irregular shapes, and abundant cytoplasmic mucin were observed in gastric pinch biopsies. In preoperative biopsies, no cases showed p53 overexpression, and Ki-67 labeling index ranged from 3â¯% to 35â¯% and was higher compared to non-neoplastic glands in 3 cases. After gastrectomy, four (57.1â¯%) patients had advanced gastric cancer and three (42.9â¯%) had lymph node metastasis. Genomic profiling of the four patients revealed mutations of TP53, BRAF, KRAS, STK11, and MDM2/CCND1 amplification. Immunohistochemistry for p53 was not helpful while Ki-67 may be helpful when staining pattern is distinct from the non-neoplastic mucosa. In conclusion, it is challenging to diagnose EWDGA using biopsy specimens. Recognizing and addressing this rare entity will increase diagnostic accuracy to ensure the early diagnosis of cancer.
RESUMO
PURPOSE: To quantify interobserver variation (IOV) in target volume and organs-at-risk (OAR) contouring across 31 institutions in breast cancer cases and to explore the clinical utility of deep learning (DL)-based auto-contouring in reducing potential IOV. METHODS AND MATERIALS: In phase 1, two breast cancer cases were randomly selected and distributed to multiple institutions for contouring six clinical target volumes (CTVs) and eight OAR. In Phase 2, auto-contour sets were generated using a previously published DL Breast segmentation model and were made available for all participants. The difference in IOV of submitted contours in phases 1 and 2 was investigated quantitatively using the Dice similarity coefficient (DSC) and Hausdorff distance (HD). The qualitative analysis involved using contour heat maps to visualize the extent and location of these variations and the required modification. RESULTS: Over 800 pairwise comparisons were analysed for each structure in each case. Quantitative phase 2 metrics showed significant improvement in the mean DSC (from 0.69 to 0.77) and HD (from 34.9 to 17.9 mm). Quantitative analysis showed increased interobserver agreement in phase 2, specifically for CTV structures (5-19 %), leading to fewer manual adjustments. Underlying IOV differences causes were reported using a questionnaire and hierarchical clustering analysis based on the volume of CTVs. CONCLUSION: DL-based auto-contours improved the contour agreement for OARs and CTVs significantly, both qualitatively and quantitatively, suggesting its potential role in minimizing radiation therapy protocol deviation.
Assuntos
Neoplasias da Mama , Aprendizado Profundo , Humanos , Feminino , Neoplasias da Mama/diagnóstico por imagem , Planejamento da Radioterapia Assistida por Computador/métodos , Órgãos em Risco , Mama/diagnóstico por imagemRESUMO
BACKGROUND AND PURPOSE: Studies investigating the application of Artificial Intelligence (AI) in the field of radiotherapy exhibit substantial variations in terms of quality. The goal of this study was to assess the amount of transparency and bias in scoring articles with a specific focus on AI based segmentation and treatment planning, using modified PROBAST and TRIPOD checklists, in order to provide recommendations for future guideline developers and reviewers. MATERIALS AND METHODS: The TRIPOD and PROBAST checklist items were discussed and modified using a Delphi process. After consensus was reached, 2 groups of 3 co-authors scored 2 articles to evaluate usability and further optimize the adapted checklists. Finally, 10 articles were scored by all co-authors. Fleiss' kappa was calculated to assess the reliability of agreement between observers. RESULTS: Three of the 37 TRIPOD items and 5 of the 32 PROBAST items were deemed irrelevant. General terminology in the items (e.g., multivariable prediction model, predictors) was modified to align with AI-specific terms. After the first scoring round, further improvements of the items were formulated, e.g., by preventing the use of sub-questions or subjective words and adding clarifications on how to score an item. Using the final consensus list to score the 10 articles, only 2 out of the 61 items resulted in a statistically significant kappa of 0.4 or more demonstrating substantial agreement. For 41 items no statistically significant kappa was obtained indicating that the level of agreement among multiple observers is due to chance alone. CONCLUSION: Our study showed low reliability scores with the adapted TRIPOD and PROBAST checklists. Although such checklists have shown great value during development and reporting, this raises concerns about the applicability of such checklists to objectively score scientific articles for AI applications. When developing or revising guidelines, it is essential to consider their applicability to score articles without introducing bias.
Assuntos
Inteligência Artificial , Lista de Checagem , Técnica Delphi , Planejamento da Radioterapia Assistida por Computador , Humanos , Planejamento da Radioterapia Assistida por Computador/métodos , Planejamento da Radioterapia Assistida por Computador/normas , Guias de Prática Clínica como Assunto , Viés , Reprodutibilidade dos Testes , Neoplasias/radioterapiaRESUMO
PURPOSE: To investigate geometric and dosimetric inter-observer variability in needle reconstruction for temporary prostate brachytherapy. To assess the potential of registrations between transrectal ultrasound (TRUS) and cone-beam computed tomography (CBCT) to support implant reconstructions. METHODS AND MATERIALS: The needles implanted in 28 patients were reconstructed on TRUS by three physicists. Corresponding geometric deviations and associated dosimetric variations to prostate and organs at risk (urethra, bladder, rectum) were analyzed. To account for the found inter-observer variability, various approaches (template-based, probe-based, marker-based) for registrations of CBCT to TRUS were investigated regarding the respective needle transfer accuracy in a phantom study. Three patient cases were examined to assess registration accuracy in-vivo. RESULTS: Geometric inter-observer deviations >1 mm and >3 mm were found for 34.9% and 3.5% of all needles, respectively. Prostate dose coverage (changes up to 7.2%) and urethra dose (partly exceeding given dose constraints) were most affected by associated dosimetric changes. Marker-based and probe-based registrations resulted in the phantom study in high mean needle transfer accuracies of 0.73 mm and 0.12 mm, respectively. In the patient cases, the marker-based approach was the superior technique for CBCT-TRUS fusions. CONCLUSION: Inter-observer variability in needle reconstruction can substantially affect dosimetry for individual patients. Especially marker-based CBCT-TRUS registrations can help to ensure accurate reconstructions for improved treatment planning.
Assuntos
Braquiterapia , Tomografia Computadorizada de Feixe Cônico , Agulhas , Variações Dependentes do Observador , Imagens de Fantasmas , Neoplasias da Próstata , Dosagem Radioterapêutica , Humanos , Masculino , Neoplasias da Próstata/radioterapia , Neoplasias da Próstata/diagnóstico por imagem , Braquiterapia/métodos , Tomografia Computadorizada de Feixe Cônico/métodos , Planejamento da Radioterapia Assistida por Computador/métodos , Ultrassonografia/métodos , Próstata/diagnóstico por imagem , Órgãos em Risco/efeitos da radiação , Radioterapia Guiada por Imagem/métodos , Reto/diagnóstico por imagemRESUMO
Hemoglobin and hematocrit are parameters widely used. They can be obtained from an automated hematology analyzer or from an arterial blood gas analyzer. Its variability is shown in the article "Variability of hemoglobin and hematocrit determined in blood gas equipment." Clinical and statistical information requested is extended for a better understanding of the article and its conclusions. It is suggested to carry out an analysis of variability in parameters and laboratory equipment.
La hemoglobina y el hematocrito son parámetros de amplio uso. Pueden ser obtenidos de un analizador automatizado de hematología o de un analizador de gases arteriales. Su variabilidad se muestra en el artículo "Variabilidad de la hemoglobina y hematocrito determinados en equipo de gases sanguíneos". Se amplía la información clínica y estadística solicitada para la mejor comprensión del trabajo y sus conclusiones. Se sugiere hacer un análisis de variabilidad en parámetros y equipos de laboratorios.
Assuntos
Hemoglobinas , Humanos , HematócritoRESUMO
BACKGROUND: Normal tissue complication probability (NTCP) models can be useful to estimate the risk of fibrosis after breast-conserving surgery (BCS) and radiotherapy (RT) to the breast. However, they are subject to uncertainties. We present the impact of contouring variation on the prediction of fibrosis. MATERIALS AND METHODS: 280 breast cancer patients treated BCS-RT were included. Nine Clinical Target Volume (CTV) contours were created for each patient: i) CTV_crop (reference), cropped 5 mm from the skin and ii) CTV_skin, uncropped and including the skin, iii) segmenting the 95% isodose (Iso95%) and iv) 3 different auto-contouring atlases generating uncropped and cropped contours (Atlas_skin/Atlas_crop). To illustrate the impact of contour variation on NTCP estimates, we applied two equations predicting fibrosis grade ≥ 2 at 5 years, based on Lyman-Kutcher-Burman (LKB) and Relative Seriality (RS) models, respectively, to each contour. Differences were evaluated using repeated-measures ANOVA. For completeness, the association between observed fibrosis events and NTCP estimates was also evaluated using logistic regression. RESULTS: There were minimal differences between contours when the same contouring approach was followed (cropped and uncropped). CTV_skin and Atlas_skin contours had lower NTCP estimates (-3.92%, IQR 4.00, p < 0.05) compared to CTV_crop. No significant difference was observed for Atlas_crop and Iso95% contours compared to CTV_crop. For the whole cohort, NTCP estimates varied between 5.3% and 49.5% (LKB) or 2.2% and 49.6% (RS) depending on the choice of contours. NTCP estimates for individual patients varied by up to a factor of 4. Estimates from "skin" contours showed higher agreement with observed events. CONCLUSION: Contour variations can lead to significantly different NTCP estimates for breast fibrosis, highlighting the importance of standardising breast contours before developing and/or applying NTCP models.
Assuntos
Neoplasias da Mama , Doença da Mama Fibrocística , Feminino , Humanos , Dosagem Radioterapêutica , Neoplasias da Mama/radioterapia , Neoplasias da Mama/cirurgia , Mama/diagnóstico por imagem , Planejamento da Radioterapia Assistida por Computador , Probabilidade , FibroseRESUMO
Background: During the last decade, radiotherapy using MR Linac has gone from research to clinical implementation for different cancer locations. For head and neck cancer (HNC), target delineation based only on MR images is not yet standard, and the utilisation of MRI instead of PET/CT in radiotherapy planning is not well established. We aimed to analyse the inter-observer variation (IOV) in delineating GTV (gross tumour volume) on MR images only for patients with HNC. Material/methods: 32 HNC patients from two independent departments were included. Four clinical oncologists from Denmark and four radiation oncologists from Australia had independently contoured primary tumour GTVs (GTV-T) and nodal GTVs (GTV-N) on T2-weighted MR images obtained at the time of treatment planning. Observers were provided with sets of images, delineation guidelines and patient synopsis. Simultaneous truth and performance level estimation (STAPLE) reference volumes were generated for each structure using all observer contours. The IOV was assessed using the DICE Similarity Coefficient (DSC) and mean absolute surface distance (MASD). Results: 32 GTV-Ts and 68 GTV-Ns were contoured per observer. The median MASD for GTV-Ts and GTV-Ns across all patients was 0.17 cm (range 0.08-0.39 cm) and 0.07 cm (range 0.04-0.33 cm), respectively. Median DSC relative to a STAPLE volume for GTV-Ts and GTV-Ns across all patients were 0.73 and 0.76, respectively. A significant correlation was seen between median DSCs and median volumes of GTV-Ts (Spearman correlation coefficient 0.76, p < 0.001) and of GTV-Ns (Spearman correlation coefficient 0.55, p < 0.001). Conclusion: Contouring GTVs in patients with HNC on MRI showed that the median IOV for GTV-T and GTV-N was below 2 mm, based on observes from two separate radiation departments. However, there are still specific regions in tumours that are difficult to resolve as either malignant tissue or oedema that potentially could be improved by further training in MR-only delineation.
RESUMO
BACKGROUND AND PURPOSE: Neoadjuvant chemoradiotherapy (nCRT) is used in locally recurrent rectal cancer (LRRC) to increase chances of a radical surgical resection. Delineation in LRRC is hampered by complex disease presentation and limited clinical exposure. Within the PelvEx II trial, evaluating the benefit of chemotherapy preceding nCRT for LRRC, a delineation guideline was developed by an expert LRRC team. MATERIALS AND METHODS: Eight radiation oncologists, from Dutch and Swedish expert centres, participated in two meetings, delineating GTV and CTV in six cases. Regions at-risk for re-recurrence or irradical resection were identified by eleven expert surgeons and one expert radiologist. Target volumes were evaluated multidisciplinary. Inter-observer variation was analysed. RESULTS: Inter-observer variation in delineation of LRRC appeared large. Multidisciplinary evaluation per case is beneficial in determining target volumes. The following consensus regarding target volumes was reached. GTV should encompass all tumour, including extension into OAR if applicable. If the tumour is in fibrosis, GTV should encompass the entire fibrotic area. Only if tumour can clearly be distinguished from fibrosis, GTV may be reduced, as long as the entire fibrotic area is covered by the CTV. CTV is GTV with a 1 cm margin and should encompass all at-risk regions for irradical resection or re-recurrence. CTV should not be adjusted towards other organs. Multifocal recurrences should be encompassed in one CTV. Elective nodal delineation is only advised in radiotherapy-naïve patients. CONCLUSION: This study provides a first consensus-based delineation guideline for LRRC. Analyses of re-recurrences is needed to understand disease behaviour and to optimize delineation guidelines accordingly.
Assuntos
Recidiva Local de Neoplasia , Neoplasias Retais , Humanos , Consenso , Neoplasias Retais/diagnóstico por imagem , Neoplasias Retais/terapia , Variações Dependentes do Observador , Fibrose , Planejamento da Radioterapia Assistida por ComputadorRESUMO
BACKGROUND AND PURPOSE: To evaluate the inter-observer variation (IOV) in pharyngeal constrictor muscle (PCM) contouring, and resultant impact on dosimetry and estimated toxicity, as part of the pre-trial radiotherapy trial quality assurance (RTQA) within DARS, a multicenter phase III randomized controlled trial investigating the functional benefits of dysphagia-optimized intensity-modulated radiotherapy (Do-IMRT) in pharyngeal cancers. METHODS AND MATERIALS: Outlining accuracy of 15 clinicians' superior and middle PCM (SMPCM) and inferior PCM (IPCM) were retrospectively assessed against gold standards (GS) using volume, location, and conformity indices (CIs) on a pre-trial benchmark case of oropharyngeal cancer. The influence of delineation variability on dose delivered to the constrictor muscles with Do-IMRT and resultant normal tissue complication probability (NTCP) for physician-scored radiation-associated dysphagia at 6 months was evaluated. RESULTS: For GS, SMPCM, and IPCM volumes were 13.51 and 1.67 cm3; corresponding clinician mean volumes were 12.18 cm3 (SD 3.0) and 2.40 cm3 (SD 0.9) respectively. High IOV in SMPCM and IPCM delineation was observed by the low DICE similarity coefficient value, along with high geographical miss index and discordance index values. Delineation variability did not significantly affect the mean dose delivered to the constrictors, relative to the GS plan. Mean clinician NTCP was 24.6% (SD 0.6), compared to the GS-NTCP of 24.7%. CONCLUSIONS: Results from this benchmark case demonstrate that inaccurate PCM delineation existed, even with protocol guidelines. This did not impact on delivered dose to this structure with Do-IMRT, or on estimated swallowing toxicity, in this single benchmark case.
RESUMO
PURPOSE: Selection of a meniscus allograft with a similar three-dimensional (3D) size is essential for good clinical results in meniscus allograft surgery. Direct meniscus sizing by MRI scan is not possible in total meniscectomy and indirect sizing by conventional radiography is often inaccurate. The purpose of this study was to develop a new indirect sizing method, based on the 3D shape of the ipsilateral tibia plateau, which is independent of the meniscus condition. METHODS: MRI and CT scans of fifty healthy knee joints were used to create 3D surface models of both menisci (MRI) and tibia plateau (CT). 3D bone models of the proximal 10 mm of the entire and half tibia plateau (with / without intercondylar area) were created in a standardized fashion. For each meniscus, the best fitting "allograft" couple out of all other 49 menisci were assessed by the surface distance of the 3D meniscus (best available allograft), of the 3D tibia plateau (3D-CT) and by the radiographic method of Pollard (2D-RX). RESULTS: 3D-CT sizing was significantly better by using only the half tibia plateau without the intercondylar area (p < 0.001). But neither sizing by 3D-CT, nor by 2D-RX could select the best available allograft. Compared to 2D-RX, 3D-CT sizing was significantly better for the medial, but not for the lateral meniscus. CONCLUSIONS: Automatized, indirect meniscus sizing using the 3D bone models of the tibia plateau is feasible and more precise than the previously described 2D-RX method.. However, further technical improvement is needed to select always the best available allograft.
RESUMO
BACKGROUND: Hippocampal avoidance prophylactic cranial irradiation (HA-PCI) techniques have been developed to reduce radiation damage to the hippocampus. An inter-observer hippocampus delineation analysis was performed and the influence of the delineation variability on dose to the hippocampus was studied. MATERIALS AND METHODS: For five patients, seven observers delineated both hippocampi on brain MRI. The intra-class correlation (ICC) with absolute agreement and the generalized conformity index (CIgen) were computed. Median surfaces over all observers' delineations were created for each patient and regional outlining differences were analysed. HA-PCI dose plans were made from the median surfaces and we investigated whether dose constraints in the hippocampus could be met for all delineations. RESULTS: The ICC for the left and right hippocampus was 0.56 and 0.69, respectively, while the CIgen ranged from 0.55 to 0.70. The posterior and anterior-medial hippocampal regions had most variation with SDs ranging from approximately 1 to 2.5 mm. The mean dose (Dmean) constraint was met for all delineations, but for the dose received by 1% of the hippocampal volume (D1%) violations were observed. CONCLUSION: The relatively low ICC and CIgen indicate that delineation variability among observers for both left and right hippocampus was large. The posterior and anterior-medial border have the largest delineation inaccuracy. The hippocampus Dmean constraint was not violated.