Repeatability and reproducibility of deep learning features for lung adenocarcinoma subtypes with nodules less than 10 mm in size: a multicenter thin-slice computed tomography phantom and clinical validation study.

Zhan, Yi; Dai, Renxiang; Li, Fangyun; Cheng, Zenghui; Zhuo, Yaoyao; Shan, Fei; Zhou, Lingxiao

Zhan, Yi; Dai, Renxiang; Li, Fangyun; Cheng, Zenghui; Zhuo, Yaoyao; Shan, Fei; Zhou, Lingxiao.

Affiliation

Zhan Y; Department of Radiology, Shanghai Public Health Clinical Center, Fudan University, Shanghai, China.
Dai R; Institute of Microscale Optoelectronics, Shenzhen University, Shenzhen, China.
Li F; Lianren Digital Health Technology Co., Ltd., Shanghai, China.
Cheng Z; Department of Radiology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
Zhuo Y; Department of Radiology, Zhongshan Hospital, Fudan University School of Medicine, Shanghai, China.
Shan F; Department of Radiology, Shanghai Public Health Clinical Center, Fudan University, Shanghai, China.
Zhou L; Institute of Microscale Optoelectronics, Shenzhen University, Shenzhen, China.

Quant Imaging Med Surg ; 14(8): 5396-5407, 2024 Aug 01.

Article in En | MEDLINE | ID: mdl-39144035

ABSTRACT

ABSTRACT

Background:

Deep learning features (DLFs) derived from radiomics features (RFs) fused with deep learning have shown potential in enhancing diagnostic capability. However, the limited repeatability and reproducibility of DLFs across multiple centers represents a challenge in the clinically validation of these features. This study thus aimed to evaluate the repeatability and reproducibility of DLFs and their potential efficiency in differentiating subtypes of lung adenocarcinoma less than 10 mm in size and manifesting as ground-glass nodules (GGNs).

Methods:

A chest phantom with nodules was scanned repeatedly using different thin-slice computed tomography (TSCT) scanners with varying acquisition and reconstruction parameters. The robustness of the DLFs was measured using the concordance correlation coefficient (CCC) and intraclass correlation coefficient (ICC). A deep learning approach was used for visualizing the DLFs. To assess the clinical effectiveness and generalizability of the stable and informative DLFs, three hospitals were used to source 275 patients, in whom 405 nodules were pathologically differentially diagnosed as GGN lung adenocarcinoma less than 10 mm in size and were retrospectively reviewed for clinical validation.

Results:

A total of 64 DLFs were analyzed, which revealed that the variables of slice thickness and slice interval (ICC, 0.79±0.18) and reconstruction kernel (ICC, 0.82±0.07) were significantly associated with the robustness of DLFs. Feature visualization showed that the DLFs were mainly focused around the nodule areas. In the external validation, a subset of 28 robust DLFs identified as stable under all sources of variability achieved the highest area under curve [AUC =0.65, 95% confidence interval (CI) 0.53-0.76] compared to other DLF models and the radiomics model.

Conclusions:

Although different manufacturers and scanning schemes affect the reproducibility of DLFs, certain DLFs demonstrated excellent stability and effectively improved diagnostic the efficacy for identifying subtypes of lung adenocarcinoma. Therefore, as the first step, screening stable DLFs in multicenter DLFs research may improve diagnostic efficacy and promote the application of these features.

Key words

Deep learning features (DLFs); lung adenocarcinoma; multicenter; phantom; thin-slice computed tomography (TSCT)

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Quant Imaging Med Surg Year: 2024 Document type: Article Affiliation country: China Country of publication: China

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google