RESUMEN
Background Digital breast tomosynthesis (DBT) is often inadequate for screening women with a personal history of breast cancer (PHBC). The ongoing prospective Tomosynthesis or Contrast-Enhanced Mammography, or TOCEM, trial includes three annual screenings with both DBT and contrast-enhanced mammography (CEM). Purpose To perform interim assessment of cancer yield, stage, and recall rate when CEM is added to DBT in women with PHBC. Materials and Methods From October 2019 to December 2022, two radiologists interpreted both examinations: Observer 1 reviewed DBT first and then CEM, and observer 2 reviewed CEM first and then DBT. Effects of adding CEM to DBT on incremental cancer detection rate (ICDR), cancer type and node status, recall rate, and other performance characteristics of the primary radiologist decisions were assessed. Results Among the participants (mean age at entry, 63.6 years ± 9.6 [SD]), 1273, 819, and 227 women with PHBC completed year 1, 2, and 3 screening, respectively. For observer 1, year 1 cancer yield was 20 of 1273 (15.7 per 1000 screenings) for DBT and 29 of 1273 (22.8 per 1000 screenings; ICDR, 7.1 per 1000 screenings [95% CI: 3.2, 13.4]) for DBT plus CEM (P < .001). Year 2 plus 3 cancer yield was four of 1046 (3.8 per 1000 screenings) for DBT and eight of 1046 (7.6 per 1000 screenings; ICDR, 3.8 per 1000 screenings [95% CI: 1.0, 7.6]) for DBT plus CEM (P = .001). Year 1 recall rate for observer 1 was 103 of 1273 (8.1%) for (incidence) DBT alone and 187 of 1273 (14.7%) for DBT plus CEM (difference = 84 of 1273, 6.6% [95% CI: 5.3, 8.1]; P < .001). Year 2 plus 3 recall rate was 40 of 1046 (3.8%) for DBT and 92 of 1046 (8.8%) for DBT plus CEM (difference = 52 of 1046, 5.0% [95% CI: 3.7, 6.3]; P < .001). In 18 breasts with cancer detected only at CEM after integration of both observers, 13 (72%) cancers were invasive (median tumor size, 0.6 cm) and eight of nine (88%) with staging were N0. Among 1883 screenings with adequate reference standard, there were three interval cancers (one at the scar, two in axillae). Conclusion CEM added to DBT increased early breast cancer detection each year in women with PHBC, with an accompanying approximately 5.0%-6.6% recall rate increase. Clinical trial registration no. NCT04085510 © RSNA, 2024 Supplemental material is available for this article.
Asunto(s)
Neoplasias de la Mama , Medios de Contraste , Mamografía , Humanos , Femenino , Neoplasias de la Mama/diagnóstico por imagen , Mamografía/métodos , Estudios Prospectivos , Persona de Mediana Edad , Detección Precoz del Cáncer/métodos , Anciano , Intensificación de Imagen Radiográfica/métodos , Mama/diagnóstico por imagenRESUMEN
Background Staging newly diagnosed breast cancer by using dynamic contrast material-enhanced MRI is limited by access, high cost, and false-positive findings. The utility of contrast-enhanced mammography (CEM) and 99mTc sestamibi-based molecular breast imaging (MBI) in this setting is largely unknown. Purpose To compare extent-of-disease assessments by using MRI, CEM, and MBI versus pathology in women with breast cancer. Materials and Methods In this HIPAA-compliant prospective study, women with biopsy-proven breast cancer underwent MRI, CEM, and MBI between October 2014 and April 2018. Eight radiologists independently interpreted each examination result prospectively and were blinded to interpretations of findings with the other modalities. Visibility of index malignancies, lesion size, and additional suspicious lesions (malignant or benign) were compared during pathology review. Accuracy of index lesion sizing and detection of additional lesions in women without neoadjuvant chemotherapy were compared. Results A total of 102 women were enrolled and 99 completed the study protocol (mean age, 51 years ± 11 [standard deviation]; range, 32-77 years). Lumpectomy or mastectomy was performed in 71 women (79 index malignancies) without neoadjuvant chemotherapy and in 28 women (31 index malignancies) with neoadjuvant chemotherapy. Of the 110 index malignancies, MRI, CEM, and MBI depicted 102 (93%; 95% confidence interval [CI]: 86%, 97%), 100 (91%; 95% CI: 84%, 96%), and 101 (92%; 95% CI: 85%, 96%) malignancies, respectively. In patients without neoadjuvant chemotherapy, pathologic size of index malignancies was overestimated with all modalities (P = .02). MRI led to overestimation of 24% (17 of 72) of malignancies by more than 1.5 cm compared with 11% (eight of 70) with CEM and 15% (11 of 72) with MBI. MRI depicted more (P = .007) nonindex lesions, with sensitivity similar to that of CEM or MBI, resulting in lower positive predictive value of additional biopsies (13 of 46 [28%; 95% CI: 17%, 44%] for MRI; 14 of 27 [52%; 95% CI: 32%, 71%] for CEM; and 11 of 25 [44%; 95% CI: 24%, 65%] for MBI (overall P = .01). Conclusion Contrast-enhanced mammography, molecular breast imaging, and MRI showed similar detection of all malignancies. MRI depicted more nonindex suspicious benign lesions than did contrast-enhanced mammography or molecular breast imaging, leading to lower positive predictive value of additional biopsies. All three modalities led to overestimation of index tumor size, particularly MRI. © RSNA, 2019 Online supplemental material is available for this article.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Adulto , Anciano , Medios de Contraste , Femenino , Humanos , Imagen por Resonancia Magnética , Mamografía , Persona de Mediana Edad , Imagen Molecular , Estadificación de Neoplasias , Estudios Prospectivos , Radiofármacos , Sensibilidad y Especificidad , Tecnecio Tc 99m SestamibiRESUMEN
PURPOSE: To assess the effect of and interaction between the availability of prior images and digital breast tomosynthesis (DBT) images in decisions to recall women during mammogram interpretation. MATERIALS AND METHODS: Verbal informed consent was obtained for this HIPAA-compliant institutional review board-approved protocol. Eight radiologists independently interpreted twice deidentified mammograms obtained in 153 women (age range, 37-83 years; mean age, 53.7 years ± 9.3 [standard deviation]) in a mode by reader by case-balanced fully crossed study. Each study consisted of current and prior full-field digital mammography (FFDM) images and DBT images that were acquired in our facility between June 2009 and January 2013. For one reading, sequential ratings were provided by using (a) current FFDM images only, (b) current FFDM and DBT images, and (c) current FFDM, DBT, and prior FFDM images. The other reading consisted of (a) current FFDM images only, (b) current and prior FFDM images, and (c) current FFDM, prior FFDM, and DBT images. Fifty verified cancer cases, 60 negative and benign cases (clinically not recalled), and 43 benign cases (clinically recalled) were included. Recall recommendations and interaction between the effect of prior FFDM and DBT images were assessed by using a generalized linear model accounting for case and reader variability. RESULTS: Average recall rates in noncancer cases were significantly reduced with the addition of prior FFDM images by 34% (145 of 421) and 32% (106 of 333) without and with DBT images, respectively (P < .001). However, this recall reduction was achieved at the cost of a corresponding 7% (23 of 345) and 4% (14 of 353) reduction in sensitivity (P = .006). In contrast, availability of DBT images resulted in a smaller reduction in recall rates (false-positive interpretations) of 19% (76 of 409) and 26% (71 of 276) without and with prior FFDM images, respectively (P = .001). Availability of DBT images resulted in 4% (15 of 338) and 8% (25 of 322) increases in sensitivity, respectively (P = .007). The effects of the availability of prior FFDM images or DBT images did not significantly change regardless of the sequence in presentation (P = .81 and P = .47 for specificity and sensitivity, respectively). CONCLUSION: The availability of prior FFDM or DBT images is a largely independent contributing factor in reducing recall recommendations during mammographic interpretation.
Asunto(s)
Enfermedades de la Mama/diagnóstico por imagen , Mamografía/métodos , Intensificación de Imagen Radiográfica , Interpretación de Imagen Radiográfica Asistida por Computador , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Humanos , Persona de Mediana Edad , Estudios RetrospectivosRESUMEN
PURPOSE: To assess interpretation performance and radiation dose when two-dimensional synthesized mammography (SM) images versus standard full-field digital mammography (FFDM) images are used alone or in combination with digital breast tomosynthesis images. MATERIALS AND METHODS: A fully crossed, mode-balanced multicase (n = 123), multireader (n = 8), retrospective observer performance study was performed by using deidentified images acquired between 2008 and 2011 with institutional review board approved, HIPAA-compliant protocols, during which each patient signed informed consent. The cohort included 36 cases of biopsy-proven cancer, 35 cases of biopsy-proven benign lesions, and 52 normal or benign cases (Breast Imaging Reporting and Data System [BI-RADS] score of 1 or 2) with negative 1-year follow-up results. Accuracy of sequentially reported probability of malignancy ratings and seven-category forced BI-RADS ratings was evaluated by using areas under the receiver operating characteristic curve (AUCs) in the random-reader analysis. RESULTS: Probability of malignancy-based mean AUCs for SM and FFDM images alone was 0.894 and 0.889, respectively (difference, -0.005; 95% confidence interval [CI]: -0.062, 0.054; P = .85). Mean AUC for SM with tomosynthesis and FFDM with tomosynthesis was 0.916 and 0.939, respectively (difference, 0.023; 95% CI: -0.011, 0.057; P = .19). In terms of the reader-specific AUCs, five readers performed better with SM alone versus FFDM alone, and all eight readers performed better with combined FFDM and tomosynthesis (absolute differences from 0.003 to 0.052). Similar results were obtained by using a nonparametric analysis of forced BI-RADS ratings. CONCLUSION: SM alone or in combination with tomosynthesis is comparable in performance to FFDM alone or in combination with tomosynthesis and may eliminate the need for FFDM as part of a routine clinical study.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Mamografía/métodos , Intensificación de Imagen Radiográfica/métodos , Adulto , Anciano , Biopsia , Neoplasias de la Mama/patología , Femenino , Humanos , Persona de Mediana Edad , Variaciones Dependientes del Observador , Dosis de Radiación , Interpretación de Imagen Radiográfica Asistida por Computador/métodos , Estudios RetrospectivosRESUMEN
PURPOSE: To assess diagnostic performance of digital breast tomosynthesis (DBT) alone or combined with technologist-performed handheld screening ultrasound (US) in women with dense breasts. METHODS: In an institutional review board-approved, Health Insurance Portability and Accountability Act-compliant multicenter protocol in western Pennsylvania, 6,179 women consented to three rounds of annual screening, interpreted by two radiologist observers, and had appropriate follow-up. Primary analysis was based on first observer results. RESULTS: Mean participant age was 54.8 years (range, 40-75 years). Across 17,552 screens, there were 126 cancer events in 125 women (7.2/1,000; 95% CI, 5.9 to 8.4). In year 1, DBT-alone cancer yield was 5.0/1,000, and of DBT+US, 6.3/1,000, difference 1.3/1,000 (95% CI, 0.3 to 2.1; P = .005). In years 2 + 3, DBT cancer yield was 4.9/1,000, and of DBT+US, 5.9/1,000, difference 1.0/1,000 (95% CI, 0.4 to 1.5; P < .001). False-positive rate increased from 7.0% for DBT in year 1 to 11.5% for DBT+US and from 5.9% for DBT in year 2 + 3 to 9.7% for DBT+US (P < .001 for both). Nine cancers were seen only by double reading DBT and one by double reading US. Ten interval cancers (0.6/1,000 [95% CI, 0.2 to 0.9]) were identified. Despite reduction in specificity, addition of US improved receiver operating characteristic curves, with area under receiver operating characteristic curve increasing from 0.83 for DBT alone to 0.92 for DBT+US in year 1 (P = .01), with smaller improvements in subsequent years. Of 6,179 women, across all 3 years, 172/6,179 (2.8%) unique women had a false-positive biopsy because of DBT as did another 230/6,179 (3.7%) women because of US (P < .001). CONCLUSION: Overall added cancer detection rate of US screening after DBT was modest at 19/17,552 (1.1/1,000; CI, 0.5- to 1.6) screens but potentially overcomes substantial increases in false-positive recalls and benign biopsies.
Asunto(s)
Neoplasias de la Mama , Mamografía , Humanos , Femenino , Adulto , Persona de Mediana Edad , Anciano , Masculino , Mamografía/métodos , Densidad de la Mama , Estudios Prospectivos , Detección Precoz del Cáncer/métodos , Tamizaje Masivo/métodosRESUMEN
OBJECTIVE: The purpose of our study was to assess diagnostic performance when retrospectively interpreting full-field digital mammography (FFDM) and breast tomosynthesis examinations under a free-response receiver operating characteristic (FROC) paradigm. MATERIALS AND METHODS: We performed FROC analysis of a previously reported study in which eight experienced radiologists interpreted 125 examinations, including 35 with verified cancers. The FROC paradigm involves detecting, locating, and rating each suspected abnormality. Radiologists reviewed and rated both FFDM alone and a combined display mode of FFDM and digital breast tomosynthesis (DBT) (combined). Observer performance levels were assessed and compared with respect to the fraction of correctly identified abnormalities, the number of reported location-specific findings (both true and false), and their associated ratings. The analysis accounts for the number and locations of findings and the location-based ratings using a summary performance index (Λ), which is the FROC analog of the area between the receiver operating characteristic curve and the diagonal (chance) line. RESULTS: Under the FROC paradigm, each reader detected more true abnormalities associated with cancer, or a higher true-positive fraction, under the combined mode. In an analysis focused on both the number of findings and associated location-based ratings, each of the radiologists performed better under the combined mode compared with FFDM alone, with increases in Λ ranging from 5% to 34%. On average, under the combined mode radiologists achieved a 16% improvement in Λ compared with the FFDM alone mode (95% CI, 7-26%; p < 0.01). CONCLUSION: We showed that DBT-based breast imaging in combination with FFDM could result in better performance under the FROC paradigm.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Mamografía/métodos , Intensificación de Imagen Radiográfica/métodos , Femenino , Humanos , Curva ROC , Interpretación de Imagen Radiográfica Asistida por Computador/métodos , Estudios RetrospectivosRESUMEN
OBJECTIVE: The purpose of this article is to compare the ability of digital breast tomosynthesis and full field digital mammography (FFDM) to detect and characterize calcifications. MATERIALS AND METHODS: One hundred paired examinations were performed utilizing FFDM and digital breast tomosynthesis. Twenty biopsy-proven cancers, 40 biopsy-proven benign calcifications, and 40 randomly selected negative screening studies were retrospectively reviewed by five radiologists in a crossed multireader multimodal observer performance study. Data collected included the presence of calcifications and forced BI-RADS scores. Receiver operator curve analysis using BI-RADS was performed. RESULTS: Overall calcification detection sensitivity was higher for FFDM (84% [95% CI, 79-88%]) than for digital breast tomosynthesis (75% [95% CI, 70-80%]). [corrected] In the cancer cohort, 75 (76%) of 99 interpretations identified calcification in both modes. Of those, a BI-RADS score less than or equal to 2 was rendered in three (4%) and nine (12%) cases with FFDM and digital breast tomosynthesis, respectively. In the benign cohort, 123 (62%) of 200 interpretations identified calcifications in both modes. Of those, a BI-RADS score greater than or equal to 3 was assigned in 105 (85%) and 93 (76%) cases with FFDM and digital breast tomosynthesis, respectively. There was no significant difference in the nonparametric computed area under the receiver operating characteristic curves (AUC) using the BI-RADS scores (FFDM, AUC = 0.76 and SD = 0.03; digital breast tomosynthesis, AUC = 0.72 and SD = 0.04 [p = 0.1277]). CONCLUSION: In this small data set, FFDM appears to be slightly more sensitive than digital breast tomosynthesis for the detection of calcification. However, diagnostic performance as measured by area under the curve using BI-RADS was not significantly different. With improvements in processing algorithms and display, digital breast tomosynthesis could potentially be improved for this purpose.
Asunto(s)
Enfermedades de la Mama/diagnóstico por imagen , Calcinosis/clasificación , Calcinosis/diagnóstico por imagen , Mamografía/métodos , Intensificación de Imagen Radiográfica/métodos , Adulto , Neoplasias de la Mama/diagnóstico por imagen , Estudios de Cohortes , Diagnóstico Diferencial , Femenino , Enfermedad Fibroquística de la Mama/diagnóstico por imagen , Humanos , Curva ROC , Estudios Retrospectivos , Sensibilidad y Especificidad , Estadísticas no ParamétricasRESUMEN
OBJECTIVE: The purpose of our study was to subjectively compare additional mammographic views to digital breast tomosynthesis (DBT) in the characterizing of known masses, architectural distortions, or asymmetries. MATERIALS AND METHODS: Four experienced radiologists serially reviewed the imaging studies of 25 women with known masses, including full-field digital mammography (FFDM), additional views, and DBT. After review of the examinations, radiologists rated their relative preference in terms of classifying the finding in question when aided by the additional views versus aided by DBT, their combined diagnostic BI-RADS rating of the finding when both examinations were available, and whether or not they felt comfortable eliminating ultrasound in the specific cases being evaluated as a result of the DBT. RESULTS: FFDM and DBT (combined) were perceived to be better for diagnosis in 50% (50/100) of the ratings (25 cases x four readers = 100 ratings) compared with FFDM and additional diagnostic views. Over all readers, 92% of the ratings for verified cancer cases and 50% of the ratings for high-risk cases were rated as BI-RADS 4 or 5. In 12% (12/100) of the ratings, radiologists indicated that the availability of DBT would have eliminated the need for ultrasound as a part of the diagnostic process. CONCLUSION: DBT may be an alternative to obtaining additional mammographic views in most but not all cases of patients with a lesion that is not solely calcifications. In a fraction of cases, the use of DBT may eliminate the need for ultrasound.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Mamografía/métodos , Intensificación de Imagen Radiográfica/métodos , Adulto , Anciano , Anciano de 80 o más Años , Femenino , Humanos , Persona de Mediana Edad , Variaciones Dependientes del Observador , Reproducibilidad de los Resultados , Sensibilidad y Especificidad , Adulto JovenRESUMEN
OBJECTIVE: To assess prospectively the interpretative performance of automated breast ultrasound (ABUS) as a supplemental screening after digital breast tomosynthesis (DBT) or as a standalone screening of women with dense breast tissue. METHODS: Under an IRB-approved protocol (written consent required), women with dense breasts prospectively underwent concurrent baseline DBT and ABUS screening. Examinations were independently evaluated, in opposite order, by two of seven Mammography Quality Standards Act-qualified radiologists, with the primary radiologist arbitrating disagreements and making clinical management recommendations. We report results for 1111 screening examinations (598 first year and 513 second year) for which all diagnostic workups are complete. Imaging was also retrospectively reviewed for all cancers. Statistical assessments used a 0.05 significance level and accounted for correlation between participants' examinations. RESULTS: Of 1111 women screened, primary radiologists initially "recalled" based on DBT alone (6.6%, 73/1111, CI: 5.2%-8.2%), of which 20 were biopsied, yielding 6/8 total cancers. Automated breast ultrasound increased recalls overall to 14.4% (160/1111, CI: 12.4%-16.6%), with 27 total biopsies, yielding 1 additional cancer. Double reading of DBT alone increased the recall rate to 10.7% (119/1111), with 21 biopsies, with no improvement in cancer detection. Double reading ABUS increased the recall rate to 15.2% (169/1111, CI: 13.2%-17.5%) of women, of whom 22 were biopsied, yielding the detection of 7 cancers, including one seen only on double reading ABUS. Inter-radiologist agreement was similar for recall recommendations from DBT (κ = 0.24, CI: 0.14-0.34) and ABUS (κ = 0.23, CI: 0.15-0.32). Integrated assessments from both readers resulted in a recall rate of 15.1% (168/1111, CI: 13.1%-17.4%). CONCLUSION: Supplemental or standalone ABUS screening detected cancers not seen on DBT, but substantially increased noncancer recall rates.
RESUMEN
OBJECTIVE: The purpose of this study was to compare in a retrospective observer study the diagnostic performance of full-field digital mammography (FFDM) with that of digital breast tomosynthesis. MATERIALS AND METHODS: Eight experienced radiologists interpreted images from 125 selected examinations, 35 with verified findings of cancer and 90 with no finding of cancer. The four display conditions included FFDM alone, 11 low-dose projections, reconstructed digital breast tomosynthesis images, and a combined display mode of FFDM and digital breast tomosynthesis images. Observers rated examinations using the screening BI-RADS rating scale and the free-response receiver operating characteristic paradigm. Observer performance levels were measured as the proportion of examinations prompting recall of patients for further diagnostic evaluation. The results were presented in terms of true-positive fraction and false-positive fraction. Performance levels were compared among the acquisitions and reading modes. Time to view and interpret an examination also was evaluated. RESULTS: Use of the combination of digital breast tomosynthesis and FFDM was associated with 30% reduction in recall rate for cancer-free examinations that would have led to recall if FFDM had been used alone (p < 0.0001 for the participating radiologists, p = 0.047 in the context of a generalized population of radiologists). Use of digital breast tomosynthesis alone also tended to reduce recall rates, an average of 10%, although the observed decrease was not statistically significant (p = 0.09 for the participating radiologists). There was no convincing evidence that use of digital breast tomosynthesis alone or in combination with FFDM results in a substantial improvement in sensitivity. CONCLUSION: Use of digital breast tomosynthesis for breast imaging may result in a substantial decrease in recall rate.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Presentación de Datos , Imagenología Tridimensional , Mamografía/métodos , Intensificación de Imagen Radiográfica , Interpretación de Imagen Radiográfica Asistida por Computador/métodos , Reacciones Falso Positivas , Femenino , Humanos , Variaciones Dependientes del Observador , Valor Predictivo de las Pruebas , Curva ROC , Reproducibilidad de los Resultados , Estudios Retrospectivos , Sensibilidad y Especificidad , Análisis y Desempeño de Tareas , Tomografía Computarizada por Rayos XRESUMEN
Electrical impedance spectroscopy has been investigated with but limited success as an adjunct procedure to mammography and as a possible pre-screening tool to stratify risk for having or developing breast cancer in younger women. In this study, the authors explored a new resonance frequency based [resonance electrical impedance spectroscopy (REIS)] approach to identify breasts that may have highly suspicious abnormalities that had been recommended for biopsies. The authors assembled a prototype REIS system generating multifrequency electrical sweeps ranging from 100 to 4100 kHz every 12 s. Using only two probes, one in contact with the nipple and the other with the outer breast skin surface 60 mm away, a paired transmission signal detection system is generated. The authors recruited 150 women between 30 and 50 years old to participate in this study. REIS measurements were performed on both breasts. Of these women 58 had been scheduled for a breast biopsy and 13 had been recalled for additional imaging procedures due to suspicious findings. The remaining 79 women had negative screening examinations. Eight REIS output signals at and around the resonance frequency were computed for each breast and the subtracted signals between the left and right breasts were used in a simple jackknifing method to select an optimal feature set to be inputted into a multi-feature based artificial neural network (ANN) that aims to predict whether a woman's breast had been determined as abnormal (warranting a biopsy) or not. The classification performance was evaluated using a leave-one-case-out method and receiver operating characteristics (ROC) analysis. The study shows that REIS examination is easy to perform, short in duration, and acceptable to all participants in terms of comfort level and there is no indication of sensation of an electrical current during the measurements. Six REIS difference features were selected as input signals to the ANN. The area under the ROC curve (A(z)) was 0.707 +/- 0.033 for classifying between biopsy cases and non-biopsy (including recalled and screening negative) and the performance (A(z)) increased to 0.746 +/- 0.033 after excluding recalled but negative cases. At 95% specificity, the sensitivity levels were approximately 20.5% and 30.4% in the two data sets tested. The results suggest that differences in REIS signals between two breasts measured in and around the tissue resonance frequency can be used to identify at least some of the women with suspicious abnormalities warranting biopsy with high specificity.
Asunto(s)
Neoplasias de la Mama/diagnóstico , Neoplasias de la Mama/patología , Mama/patología , Impedancia Eléctrica , Análisis Espectral/métodos , Adulto , Biopsia , Femenino , Humanos , Mamografía/métodos , Oncología Médica/métodos , Persona de Mediana Edad , Modelos Estadísticos , Redes Neurales de la Computación , Curva ROC , RiesgoRESUMEN
OBJECTIVE: The objective of our study was to assess ergonomic and diagnostic performance-related issues associated with the interpretation of digital breast tomosynthesis-generated examinations. MATERIALS AND METHODS: Thirty selected cases were read under three different display conditions by nine experienced radiologists in a fully crossed, mode-balanced observer performance study. The reading modes included full-field digital mammography (FFDM) alone, the 11 low-dose projections acquired for the reconstruction of tomosynthesis images, and the reconstructed digital breast tomosynthesis examination. Observers rated cases under the free-response receiver operating characteristic, as well as a screening paradigm, and provided subjective assessments of the relative diagnostic value of the two digital breast tomosynthesis-based image sets as compared with FFDM. The time to review and diagnose each case was also evaluated. RESULTS: Observer performance measures were not statistically significant (p > 0.05) primarily because of the small sample size in this pilot study, suggesting that showing significant improvements in diagnosis, if any, will require a larger study. Several radiologists did perceive the digital breast tomosynthesis image set and the projection series to be better than FFDM (p < 0.05) for diagnosing this specific case set. The time to review, interpret, and rate the examinations was significantly different for the techniques in question (p < 0.05). CONCLUSION: Tomosynthesis-based breast imaging may have great potential, but much work is needed before its optimal role in the clinical environment is known.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Mamografía/métodos , Intensificación de Imagen Radiográfica , Interpretación de Imagen Radiográfica Asistida por Computador , Competencia Clínica , Ergonomía , Humanos , Variaciones Dependientes del Observador , Proyectos PilotoRESUMEN
RATIONALE AND OBJECTIVES: The clinical utility of interactive computer-aided diagnosis (ICAD) systems depends on clinical relevance and visual similarity between the queried breast lesions and the ICAD-selected reference regions. The objective of this study is to develop and test a new ICAD scheme that aims improve visual similarity of ICAD-selected reference regions. MATERIALS AND METHODS: A large and diverse reference library involving 3,000 regions of interests was established. For each queried breast mass lesion by the observer, the ICAD scheme segments the lesion, classifies its boundary spiculation level, and computes 14 image features representing the segmented lesion and its surrounding tissue background. A conditioned k-nearest neighbor algorithm is applied to select a set of the 25 most "similar" lesions from the reference library. After computing the mutual information between the queried lesion and each of these initially selected 25 lesions, the scheme displays the six reference lesions with the highest mutual information scores. To evaluate the automated selection process of the six "visually similar" lesions to the queried lesion, we conducted a two-alternative forced-choice observer preference study using 85 queried mass lesions. Two sets of reference lesions selected by one new automated ICAD scheme and the other previously reported scheme using a subjective rating method were randomly displayed on the left and right side of the queried lesion. Nine observers were asked to decide for each of the 85 queried lesions which one of the two reference sets was "more visually similar" to the queried lesion. RESULTS: In classification of mass boundary spiculation levels, the overall agreement rate between the automated scheme and an observer is 58.8% (Kappa = 0.31). In observer preference study, the nine observers preferred on average the reference lesion sets selected by the automated scheme as being more visually similar than the set selected by the subjective rating approach in 53.2% of the queried lesions. The results were not significantly different for the two methods (P = .128). CONCLUSIONS: This study suggests that using the new automated ICAD scheme, the interobserver variability related issues can thus be avoided. Furthermore, the new scheme maintains the similar performance level as the previous scheme using the subjective rating method that can select reference sets that are significantly more visually similar (P < .05) than when using traditional ICAD schemes in which the mass boundary spiculation levels are not accurately detected and quantified.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Mamografía , Interpretación de Imagen Radiográfica Asistida por Computador , Femenino , Humanos , Variaciones Dependientes del ObservadorRESUMEN
RATIONALE AND OBJECTIVES: Assess results of a prospective, single-site clinical study evaluating digital breast tomosynthesis (DBT) during baseline screening mammography. MATERIALS AND METHODS: Under an institutional review board-approved Health Insurance Portability and Accountability Act (HIPAA)-compliant protocol, consenting women between ages 34 and 56 years scheduled for their initial and/or baseline screening mammogram underwent both full field digital mammography (FFDM) and DBT. The FFDM and the FFDM plus DBT images were interpreted independently in a reader by mode balanced approach by two of 14 participating radiologists. A woman was recalled for a diagnostic work-up if either radiologist recommended a recall. We report overall recall rates and related diagnostic outcome from the 1080 participants. Proportion of recommended recalls (Breast Imaging Reporting and Data System 0) were compared using a generalized linear mixed model (SAS 9.3) with a significance level of P = .0294. RESULTS: The fraction of women without breast cancer recommended for recall using FFDM alone and FFDM plus DBT were 412 of 1074 (38.4%) and 274 of 1074 (25.5%), respectively (P < .001). Large inter-reader variability in terms of recall reduction was observed among the 14 readers; however, 11 of 14 readers recalled fewer women using FFDM plus DBT (5 with P < .015). Six cancers (four ductal carcinomas in situ [DCIS] and two invasive ductal carcinomas [IDC]) were detected. One IDC was detected only on DBT and one DCIS cancer was detected only on FFDM, whereas the remaining cancers were detected on both modalities. CONCLUSIONS: The use of FFDM plus DBT resulted in a significant decrease in recall rates during baseline screening mammography with no reduction in sensitivity.
Asunto(s)
Citas y Horarios , Neoplasias de la Mama/diagnóstico por imagen , Mamografía/métodos , Interpretación de Imagen Radiográfica Asistida por Computador , Adulto , Carcinoma Ductal de Mama/diagnóstico por imagen , Carcinoma Intraductal no Infiltrante/diagnóstico por imagen , Detección Precoz del Cáncer , Femenino , Humanos , Persona de Mediana Edad , Pennsylvania , Estudios Prospectivos , Tomografía Computarizada por Rayos X/métodosRESUMEN
RATIONALE AND OBJECTIVES: The aim of this study was to retrospectively compare the interpretive performance of synthetically reconstructed two-dimensional images in combination with digital breast tomosynthesis (DBT) versus full-field digital mammography (FFDM) plus DBT. MATERIALS AND METHODS: Ten radiologists trained in reading tomosynthesis examinations interpreted retrospectively, under two modes, 114 mammograms. One mode included the directly acquired full-field digital mammograms combined with DBT, and the other included synthetically reconstructed projection images combined with DBT. The reconstructed images do not require additional radiation exposure. The two modes were compared with respect to sensitivity, namely, recommendation to recall a breast with either a pathology-proven cancer (n = 48) or a high-risk lesion (n = 6), and specificity, namely, no recommendation to recall a breast not depicting an abnormality (n = 144) or depicting only benign abnormalities (n = 30). RESULTS: The average sensitivity for FFDM with DBT was 0.826, compared to 0.772 for synthetic FFDM with DBT (difference, 0.054; P = .017 and P = .053 for fixed and random reader effects, respectively). The proportions of breasts with no or benign abnormalities recommended to be recalled were virtually the same: 0.298 and 0.297 for the two modalities, respectively (95% confidence intervals for the difference, -0.028 to 0.036 and -0.070 to 0.066 for fixed and random reader effects, respectively). Sixteen additional clusters of microcalcifications ("positive" breasts) were missed by all readers combined when interpreting the mode with synthesized images versus FFDM. CONCLUSIONS: Lower sensitivity with comparable specificity was observed with the tested version of synthetically generated images compared to FFDM, both combined with DBT. Improved synthesized images with experimentally verified acceptable diagnostic quality will be needed to eliminate double exposure during DBT-based screening.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Dosis de Radiación , Interpretación de Imagen Radiográfica Asistida por Computador/métodos , Tomografía Computarizada por Rayos X/métodos , Adulto , Anciano , Femenino , Humanos , Imagenología Tridimensional , Modelos Lineales , Mamografía , Persona de Mediana Edad , Intensificación de Imagen Radiográfica/métodos , Estudios Retrospectivos , Sensibilidad y EspecificidadRESUMEN
RATIONALE AND OBJECTIVES: The aim of this study was to assess similarities and differences between methods of performance comparisons under binary (yes or no) and receiver-operating characteristic (ROC)-type pseudocontinuous (0-100) rating data ascertained during an observer performance study of interpretation of full-field digital mammography (FFDM) versus FFDM plus digital breast tomosynthesis. MATERIALS AND METHODS: Rating data consisted of ROC-type pseudocontinuous and binary ratings generated by eight radiologists evaluating 77 digital mammographic examinations. Overall performance levels were summarized with a conventionally used probability of correct discrimination or, equivalently, the area under the ROC curve (AUC), which under a binary scale is related to Youden's index. Magnitudes of differences in the reader-averaged empirical AUCs between FFDM alone and FFDM plus digital breast tomosynthesis were compared in the context of fixed-reader and random-reader variability of the estimates. RESULTS: The absolute differences between modes using the empirical AUCs were larger on average for the binary scale (0.12 vs 0.07) and for the majority of individual readers (six of eight). Standardized differences were consistent with this finding (2.32 vs 1.63 on average). Reader-averaged differences in AUCs standardized by fixed-reader and random-reader variances were also smaller under the binary rating paradigm. The discrepancy between AUC differences depended on the location of the reader-specific binary operating points. CONCLUSIONS: The human observer's operating point should be a primary consideration in designing an observer performance study. Although in general, the ROC-type rating paradigm provides more detailed information on the characteristics of different modes, it does not reflect the actual operating point adopted by human observers. There are application-driven scenarios in which analysis based on binary responses may provide statistical advantages.
Asunto(s)
Algoritmos , Neoplasias de la Mama/diagnóstico por imagen , Mamografía/métodos , Intensificación de Imagen Radiográfica/métodos , Interpretación de Imagen Radiográfica Asistida por Computador/métodos , Femenino , Humanos , Variaciones Dependientes del Observador , Curva ROC , Reproducibilidad de los Resultados , Sensibilidad y EspecificidadRESUMEN
RATIONALE AND OBJECTIVES: Computerized determination of optimal search areas on mammograms for matching breast mass regions depicted on two ipsilateral views remains a challenge for developing multiview-based computer-aided detection (CAD) schemes. The purpose of this study was to compare three methods aimed at matching CAD-cued mass regions depicted on two views and the associated impact on CAD performance. MATERIALS AND METHODS: The three search methods used (1) an annular (fan-shaped) band, (2) a straight strip perpendicular to the estimated centerline, and (3) a mixed search area bound on the chest wall side by a straight line and an annular arc on the nipple side, respectively. An image database of 200 examinations with positive results depicting the masses on two views and 200 examinations with negative results was used for testing. Two performance assessment experiments were conducted. The first investigated the maximum matching sensitivity as a function of the search area size, and the second assessed the change in CAD performance using these three search methods. RESULTS: To include all 200 paired mass regions within the search areas, maximum widths were 28 and 68 mm for the use of the straight strip and the annular band search methods, respectively. When applying a single-image-based CAD scheme to this image database, 172 masses (86% sensitivity) and 523 false-positive (FP) regions (0.33 per image) were detected and cued. Among the positive findings, 92 were cued by the CAD system on both views, and 80 were cued on only one view. In an attempt to match as many of the 172 CAD-cued masses (true-positive [TP] regions) on two views by incrementally reducing the CAD threshold inside the different search areas, the CAD scheme generated 158 TP-TP paired matches with 14 TP-FP paired matches, 142 TP-TP paired matches with 30 TP-FP paired matches, and 146 TP-TP paired matches with 26 TP-FP paired matches, using the methods involving the straight strip, the annular band, and the mixed search areas, respectively. Using the straight strip search method, the CAD also eliminated 25% of FP regions initially cued by the single-image-based CAD scheme and generated the lowest case-based FP detection rate, namely, 15% less than that generated by the annular band method. CONCLUSIONS: This study showed that among these three search methods, the straight strip method required a smaller search area and achieved the highest level of CAD performance.
Asunto(s)
Algoritmos , Neoplasias de la Mama/diagnóstico por imagen , Imagenología Tridimensional/métodos , Mamografía/métodos , Reconocimiento de Normas Patrones Automatizadas/métodos , Interpretación de Imagen Radiográfica Asistida por Computador/métodos , Técnica de Sustracción , Femenino , Humanos , Intensificación de Imagen Radiográfica/métodos , Reproducibilidad de los Resultados , Sensibilidad y EspecificidadRESUMEN
OBJECTIVE: This paper describes a high-quality, multisite telemammography system to enable "almost real-time" remote patient management while the patient remains in the clinic. One goal is to reduce the number of women who would physically need to return to the clinic for additional imaging procedures (termed "recall") to supplement "routine" imaging of screening mammography. MATERIALS AND METHODS: Mammography films from current and prior (when available) examinations are digitized at three remote sites and transmitted along with other pertinent information across low-level communication systems to the central site. Images are automatically cropped, wavelet compressed, and encrypted prior to transmission to the central site. At the central site, radiologists review and rate examinations on a high-resolution workstation that displays the images, computer-assisted detection results, and the technologist's communication. Intersite communication is provided instantly via a messaging "chat" window. RESULTS: The technologists recommended additional procedures at 2.7 times the actual clinical recall rate for the same cases. Using the telemammography system during a series of "off-line" clinically simulated studies, radiologists recommended additional procedures at 1.3 times the actual clinical recall rate. Percent agreement and kappa between the study and actual clinical interpretations were 66.1% and 0.315, respectively. For every physical recall potentially avoided using the telemammography system, approximately one presumed "unnecessary" imaging procedure was recommended. CONCLUSION: Remote patient management can reduce the number of women recalled by as much as 50% without performing an unreasonable number of presumed "unnecessary" procedures.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Mamografía , Tamizaje Masivo , Telerradiología , Instituciones de Atención Ambulatoria , Neoplasias de la Mama/epidemiología , Redes de Comunicación de Computadores , Simulación por Computador , Computadores , Sistemas de Administración de Bases de Datos , Femenino , Humanos , Procesamiento de Imagen Asistido por Computador , Variaciones Dependientes del Observador , Interpretación de Imagen Radiográfica Asistida por Computador , Sistemas de Información Radiológica , Proyectos de Investigación , Programas InformáticosRESUMEN
OBJECTIVE: Positive predictive value (PPV1) has been used as one important indicator of the quality of screening mammography programs. We show how the relationship between sensitivity and recall rate may affect the operating point at which optimal (maximum) PPV1 occurs. CONCLUSION: Optimal (maximum) PPV1 can occur at any sensitivity level and should not be used as the sole indicator for practice optimization because it does not take into account the number of cancers that would be missed at that sensitivity.
Asunto(s)
Neoplasias de la Mama/diagnóstico por imagen , Mamografía/normas , Calidad de la Atención de Salud , Neoplasias de la Mama/patología , Errores Diagnósticos , Humanos , Mamografía/estadística & datos numéricos , Tamizaje Masivo , Valor Predictivo de las Pruebas , Sensibilidad y EspecificidadRESUMEN
PURPOSE: To compare performance of two computer-aided detection (CAD) systems and an in-house scheme applied to five groups of sequentially acquired screening mammograms. MATERIALS AND METHODS: Two hundred nineteen film-based mammographic examinations, classified into five groups, were included in this study. Group 1 included 58 examinations in which verified malignant masses were detected during screening; group 2, 39 in which all available latest examinations were performed prior to diagnosis of these malignant masses (subset of 39 women from group 1); group 3, 22 in which findings were interpreted as negative but were verified as cancer within 1 year from the negative interpretation (missed cancers); group 4, 50 in which findings were negative and patients were not recalled for additional procedures; and group 5, 50 in which patients were recalled for additional procedures and findings were negative for cancer. In all examinations, images were processed with two Food and Drug Administration-approved commercially available CAD systems and an in-house scheme. Performance levels in terms of true-positive detection rates and number of false-positive identifications per image and per examination were compared. RESULTS: Mass detection rates in positive examinations (group 1) were 67%-72%. Detection rates among three systems were not significantly different (P > .05). In 50 negative screening examinations (group 4), false-positive rates ranged from 1.08 to 1.68 per four-view examination. Performance level differences among systems were significant for false-positive rates (P = .008). Performance of all systems was at levels lower than publicly suggested in some retrospective studies. False-positive CAD cueing rates were significantly higher for negative examinations in which patients were recalled (group 5) than they were for those in which patients were not recalled (group 4) (P < or = .002). CONCLUSION: Performance of CAD systems for mass detection at mammography varies significantly, depending on examination and system used. Actual performance of all systems in clinical environment can be improved.