Your browser doesn't support javascript.
loading
A position-enhanced sequential feature encoding model for lung infections and lymphoma classification on CT images.
Zhao, Rui; Li, Wenhao; Chen, Xilai; Li, Yuchong; He, Baochun; Zhang, Yucong; Deng, Yu; Wang, Chunyan; Jia, Fucang.
Afiliação
  • Zhao R; Research Center for Medical AI, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
  • Li W; Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China.
  • Chen X; Depatrment of Hematology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.
  • Li Y; Department of Radiology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.
  • He B; Research Center for Medical AI, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
  • Zhang Y; Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China.
  • Deng Y; Research Center for Medical AI, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
  • Wang C; Shenzhen College of Advanced Technology, University of Chinese Academy of Sciences, Shenzhen, China.
  • Jia F; Department of Radiation Oncology, Shenzhen People's Hospital, Shenzhen, China.
Article em En | MEDLINE | ID: mdl-39003438
ABSTRACT

PURPOSE:

Differentiating pulmonary lymphoma from lung infections using CT images is challenging. Existing deep neural network-based lung CT classification models rely on 2D slices, lacking comprehensive information and requiring manual selection. 3D models that involve chunking compromise image information and struggle with parameter reduction, limiting performance. These limitations must be addressed to improve accuracy and practicality.

METHODS:

We propose a transformer sequential feature encoding structure to integrate multi-level information from complete CT images, inspired by the clinical practice of using a sequence of cross-sectional slices for diagnosis. We incorporate position encoding and cross-level long-range information fusion modules into the feature extraction CNN network for cross-sectional slices, ensuring high-precision feature extraction.

RESULTS:

We conducted comprehensive experiments on a dataset of 124 patients, with respective sizes of 64, 20 and 40 for training, validation and testing. The results of ablation experiments and comparative experiments demonstrated the effectiveness of our approach. Our method outperforms existing state-of-the-art methods in the 3D CT image classification problem of distinguishing between lung infections and pulmonary lymphoma, achieving an accuracy of 0.875, AUC of 0.953 and F1 score of 0.889.

CONCLUSION:

The experiments verified that our proposed position-enhanced transformer-based sequential feature encoding model is capable of effectively performing high-precision feature extraction and contextual feature fusion in the lungs. It enhances the ability of a standalone CNN network or transformer to extract features, thereby improving the classification performance. The source code is accessible at https//github.com/imchuyu/PTSFE .
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article