Dual encoder network with transformer-CNN for multi-organ segmentation.

Hong, Zhifang; Chen, Mingzhi; Hu, Weijie; Yan, Shiyu; Qu, Aiping; Chen, Lingna; Chen, Junxi

Hong, Zhifang; Chen, Mingzhi; Hu, Weijie; Yan, Shiyu; Qu, Aiping; Chen, Lingna; Chen, Junxi.

Afiliação

Hong Z; Computer School, University of South China, Hengyang, 421001, China.
Chen M; College of Mechanical and Vehicle Engineering, Hunan University, Hengyang, 410082, China.
Hu W; School of Economics and Management, Beijing University of Chemical Technology, Beijing, 100029, China.
Yan S; Computer School, University of South China, Hengyang, 421001, China.
Qu A; Computer School, University of South China, Hengyang, 421001, China.
Chen L; Computer School, University of South China, Hengyang, 421001, China. linda_cjx@163.com.
Chen J; Affiliated Nanhua Hospital, University of South China, Hengyang, 421001, China.

Med Biol Eng Comput ; 61(3): 661-671, 2023 Mar.

Article em En | MEDLINE | ID: mdl-36580181

ABSTRACT

ABSTRACT

Medical image segmentation is a critical step in many imaging applications. Automatic segmentation has gained extensive concern using a convolutional neural network (CNN). However, the traditional CNN-based methods fail to extract global and long-range contextual information due to local convolution operation. Transformer overcomes the limitation of CNN-based models. Inspired by the success of transformers in computer vision (CV), many researchers focus on designing the transformer-based U-shaped method in medical image segmentation. The transformer-based approach cannot effectively capture the fine-grained details. This paper proposes a dual encoder network with transformer-CNN for multi-organ segmentation. The new segmentation framework takes full advantage of CNN and transformer to enhance the segmentation accuracy. The Swin-transformer encoder extracts global information, and the CNN encoder captures local information. We introduce fusion modules to fuse convolutional features and the sequence of features from the transformer. Feature fusion is concatenated through the skip connection to smooth the decision boundary effectively. We extensively evaluate our method on the synapse multi-organ CT dataset and the automated cardiac diagnosis challenge (ACDC) dataset. The results demonstrate that the proposed method achieves Dice similarity coefficient (DSC) metrics of 80.68% and 91.12% on the synapse multi-organ CT and ACDC datasets, respectively. We perform the ablation studies on the ACDC dataset, demonstrating the effectiveness of critical components of our method. Our results match the ground-truth boundary more consistently than the existing models. Our approach gains more accurate results on challenging 2D images for multi-organ segmentation. Compared with the state-of-the-art methods, our proposed method achieves superior performance in multi-organ segmentation tasks. Graphical Abstract The key process in medical image segmentation.

Assuntos

Benchmarking; Fontes de Energia Elétrica; Coração; Redes Neurais de Computação; Sinapses; Processamento de Imagem Assistida por Computador

Palavras-chave

Convolutional neural network; Feature fusion; Image segmentation; Swin-transformer

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Fontes de Energia Elétrica / Benchmarking Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google