A position-enhanced sequential feature encoding model for lung infections and lymphoma classification on CT images.

Zhao, Rui; Li, Wenhao; Chen, Xilai; Li, Yuchong; He, Baochun; Zhang, Yucong; Deng, Yu; Wang, Chunyan; Jia, Fucang

Zhao, Rui; Li, Wenhao; Chen, Xilai; Li, Yuchong; He, Baochun; Zhang, Yucong; Deng, Yu; Wang, Chunyan; Jia, Fucang.

Afiliação

Zhao R; Research Center for Medical AI, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
Li W; Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China.
Chen X; Depatrment of Hematology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.
Li Y; Department of Radiology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.
He B; Research Center for Medical AI, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
Zhang Y; Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China.
Deng Y; Research Center for Medical AI, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
Wang C; Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China.
Jia F; Department of Radiation Oncology, Shenzhen People's Hospital, Shenzhen, China.

Int J Comput Assist Radiol Surg ; 2024 Jul 14.

Article em En | MEDLINE | ID: mdl-39003438

ABSTRACT

ABSTRACT

PURPOSE:

Differentiating pulmonary lymphoma from lung infections using CT images is challenging. Existing deep neural network-based lung CT classification models rely on 2D slices, lacking comprehensive information and requiring manual selection. 3D models that involve chunking compromise image information and struggle with parameter reduction, limiting performance. These limitations must be addressed to improve accuracy and practicality.

METHODS:

We propose a transformer sequential feature encoding structure to integrate multi-level information from complete CT images, inspired by the clinical practice of using a sequence of cross-sectional slices for diagnosis. We incorporate position encoding and cross-level long-range information fusion modules into the feature extraction CNN network for cross-sectional slices, ensuring high-precision feature extraction.

RESULTS:

We conducted comprehensive experiments on a dataset of 124 patients, with respective sizes of 64, 20 and 40 for training, validation and testing. The results of ablation experiments and comparative experiments demonstrated the effectiveness of our approach. Our method outperforms existing state-of-the-art methods in the 3D CT image classification problem of distinguishing between lung infections and pulmonary lymphoma, achieving an accuracy of 0.875, AUC of 0.953 and F1 score of 0.889.

CONCLUSION:

The experiments verified that our proposed position-enhanced transformer-based sequential feature encoding model is capable of effectively performing high-precision feature extraction and contextual feature fusion in the lungs. It enhances the ability of a standalone CNN network or transformer to extract features, thereby improving the classification performance. The source code is accessible at https//github.com/imchuyu/PTSFE .

Palavras-chave

3D CT images; Classification; Lung infection; Lung lymphoma

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links