ABSTRACT
OBJECTIVE: The purpose of this study was to evaluate the performance and potential contribution of computer-aided detection (CAD) to independent double reading of paired screen-film and full-field digital screening mammograms. MATERIALS AND METHODS: The cases of 3,683 women who underwent both screen-film mammography and full-field digital mammography (FFDM) with independent double reading for each technique were followed for 2 years to include cancers detected in the interval between screening rounds and cancers detected at the next screening round. Fifty-five biopsy-proven cancers were diagnosed. The baseline screening mammograms of the 55 cancers were defined as having positive findings if at least one of two independent readers scored it 2 or higher on a 5-point rating scale. The baseline mammograms of interval (n = 10) or secondround (n = 16) cancers were retrospectively classified as overlooked (n = 2), minimal sign actionable (n = 8), minimal sign nonactionable (n = 5), and normal (n = 11). The baseline mammograms of these cases of cancer were evaluated with a CAD system, and the CAD results were compared (McNemar's test for paired proportions) with the findings at prospective independent double reading of mammograms obtained with each technique. RESULTS: For FFDM, CAD sensitivity was 95% (37/39) compared with 64% (25/39) for double reading (p = 0.006), and for screen-film mammography, CAD sensitivity was 85% (33/39) compared with 77% (30/39) for prospective double reading (p = 0.57) of radiographically visible lesions in baseline mammograms. CAD correctly marked five (13%) of 39 cancers on screen-film mammography and 14 (36%) of 39 cancers on FFDM not detected at prospective independent double reading. CONCLUSION: CAD showed the potential to increase the cancer detection rate for FFDM and for screen-film mammography in breast cancer screening performed with independent double reading.
Subject(s)
Breast Neoplasms/diagnostic imaging , Breast Neoplasms/epidemiology , Mammography/statistics & numerical data , Mass Screening/methods , Mass Screening/statistics & numerical data , Pattern Recognition, Automated/methods , Radiographic Image Interpretation, Computer-Assisted/methods , Artificial Intelligence , Female , Humans , Mammography/methods , Middle Aged , Norway/epidemiology , Observer Variation , Radiographic Image Enhancement , Reproducibility of Results , Sensitivity and Specificity , X-Ray Film , X-Ray Intensifying ScreensABSTRACT
Full-field digital mammography (FFDM) with soft-copy reading is more complex than screen-film mammography (SFM) with hard-copy reading. The aim of this study was to compare inter- and intraobserver variability in SFM versus FFDM of paired mammograms from a breast cancer screening program. Six radiologists interpreted mammograms of 232 cases obtained with both techniques, including 46 cancers, 88 benign lesions, and 98 normals. Image interpretation included BI-RADS categories. A case consisted of standard two-view mammograms of one breast. Images were scored in two sessions separated by 5 weeks. Observer variability was substantial for SFM as well as for FFDM, but overall there was no significant difference between the observer variability at SFM and FFDM. Mean kappa values were lower, indicating less agreement, for microcalcifications compared with masses. The lower observer agreement for microcalcifications, and especially the low intraobserver concordance between the two imaging techniques for three readers, was noticeable. The level of observer agreement might be an indicator of radiologist performance and could confound studies designed to separate diagnostic differences between the two imaging techniques. The results of our study confirm the need for proper training for radiologists starting FFDM with soft-copy reading in breast cancer screening.
Subject(s)
Breast Neoplasms/diagnostic imaging , Mammography/methods , Radiographic Image Enhancement/methods , Aged , Female , Humans , Middle Aged , Observer Variation , ROC Curve , Statistics, NonparametricABSTRACT
PURPOSE: To retrospectively compare screen-film and full-field digital mammography with soft-copy interpretation for reader performance in detection and classification of breast lesions in women in a screening program. MATERIALS AND METHODS: Regional ethics committee approved the study; signed patient consents were obtained. Two-view mammograms were obtained with digital and screen-film systems at previous screening studies. Six readers interpreted images. Interpretation included Breast Imaging Reporting and Data System (BI-RADS) and five-level probability-of-malignancy scores. A case was one breast, with two standard views acquired with both screen-film mammography and digital mammography. The standard for an examination with normal findings was classification of normal (category 1) assigned by two independent readers; for cases with benign findings, the standard was benign results at diagnostic work-up in patients who were recalled. Cases with normal or benign findings that manifested as neither interval cancer nor as cancer at subsequent screening were considered the standard. All cancers were confirmed histologically. Images were interpreted by readers in two sessions 5 weeks apart; the same case was not seen twice in any session. Receiver operating characteristic (ROC) analysis and, for a given true-positive fraction, 2 x 2 table analysis and the McNemar test were used. For binary outcome, classification of BI-RADS category 3 or higher was defined as positive for cancer. RESULTS: Cases with proved findings (n = 232) were displayed: 46 with cancers, 88 with benign findings, and 98 with normal findings. ROC analysis for all readers and all cases revealed a higher area under ROC curve (A(z)) for digital mammography (0.916) than for screen-film mammography (0.887) (P = .22). Five of six readers had a higher performance rating with digital mammography; one of five demonstrated a significant difference in favor of digital mammography with A(z) values; two showed a significant difference in favor of digital mammography with ROC analysis for a given false-positive fraction (P = .01 and .03, respectively). For cases with cancer, digital mammography resulted in correct classification of an average of three additional cancers per reader. For digital versus screen-film mammography, 2 x 2 table analysis for cancers revealed a higher true-positive rate; for benign masses, a higher true-negative rate. Neither of these differences nor any others from analysis of subgroups between the modalities were significant. CONCLUSION: Digital mammography allowed correct classification of more breast cancers than did screen-film mammography. A(z) value was higher for digital mammography; this difference was not significant.
Subject(s)
Breast Diseases/diagnostic imaging , Breast Neoplasms/diagnostic imaging , Mammography/methods , Radiographic Image Enhancement/methods , Aged , Female , Humans , Middle Aged , Professional Competence , ROC Curve , Reference Standards , Retrospective StudiesABSTRACT
PURPOSE: To compare screen-film and full-field digital mammography with soft-copy reading in a population-based screening program. MATERIALS AND METHODS: Full-field digital and screen-film mammography were performed in 3,683 women aged 50-69 years. Two standard views of each breast were acquired with each modality. Images underwent independent double reading with use of a five-point rating scale for probability of cancer. Recall rates and positive predictive values were calculated. Cancer detection rates determined with both modalities were compared by using the McNemar test for paired proportions. Retrospective side-by-side analysis for conspicuity of cancers was performed by an external independent radiologist group with experience in both modalities. RESULTS: In 3,683 cases, 31 cancers were detected. Screen-film mammography depicted 28 (0.76%) malignancies, and full-field digital mammography depicted 23 (0.62%) malignancies. The difference between cancer detection rates was not significant (P =.23). The recall rate for full-field digital mammography (4.6%; 168 of 3,683 cases) was slightly higher than that for screen-film mammography (3.5%; 128 of 3,683 cases). The positive predictive value based on needle biopsy results was 46% for screen-film mammography and 39% for full-field digital mammography. Side-by-side image comparison for cancer conspicuity led to classification of 19 cancers as equal for probability of malignancy, six cancers as slightly better demonstrated at screen-film mammography, and six cancers as slightly better demonstrated at full-field digital mammography. CONCLUSION: There was no statistically significant difference in cancer detection rate between screen-film and full-field digital mammography. Cancer conspicuity was equal with both modalities. Full-field digital mammography with soft-copy reading is comparable to screen-film mammography in population-based screening.
Subject(s)
Mammography/methods , Mass Screening/methods , Radiographic Image Enhancement/methods , Aged , Biopsy, Needle , Breast Neoplasms/diagnostic imaging , Community Health Planning , Female , Humans , Middle Aged , Norway , Predictive Value of Tests , Retrospective StudiesABSTRACT
This study evaluated the results of fine needle aspiration cytology (FNAC) from the first four years of organized mammography screening for breast cancer in Oslo, particularly our policy in differentiating in situ and invasive carcinoma. Lesions were aspirated directly, ultrasound guided, by stereotaxic device or biopsy localization plate. All lesions were aspirated by cytopathologists working with the radiologists at the breast diagnostic centre. Smears were evaluated immediately for assessment of adequacy and a preliminary diagnosis was given to the surgeon. When FNAC revealed malignancy, diagnostic terms were as follows: (1) invasive carcinoma; (2) ductal carcinoma in situ of comedo type (high nuclear grade), cannot evaluate infiltration; (3) ductal carcinoma in situ of low nuclear grade and (4) papillary tumour, cannot evaluate infiltration. There were 953 cases, 70% of which were nonpalpable. Insufficient material was obtained in 5.8%. Absolute and complete sensitivity were 81% and 91%, respectively. Specificity was 85%. There were 448 histologically proven carcinomas. 383 of these were invasive. 362 carcinomas (in situ and invasive) (80.8%) were diagnosed directly on FNAC. Distinction between invasive and in situ carcinoma was possible in 294 of 320 directly diagnosed invasive carcinomas (91.8%). PPV of a diagnosis of invasive carcinoma was 97%. Our data showed that definitive cytological diagnosis of invasive carcinoma was possible in more than 90% of fully diagnostic smears and allowed definitive primary surgery in these women.