DSKCA-UNet: Dynamic selective kernel channel attention for medical image segmentation.

Shen, Longfeng; Wang, Qiong; Zhang, Yingjie; Qin, Fenglan; Jin, Hengjun; Zhao, Wei

Shen, Longfeng; Wang, Qiong; Zhang, Yingjie; Qin, Fenglan; Jin, Hengjun; Zhao, Wei.

Afiliação

Shen L; Anhui Engineering Research Center for Intelligent Computing and Application on Cognitive Behavior (ICACB), College of Computer Science and Technology, Huaibei Normal University, Huaibei, China.
Wang Q; Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, Hefei, China.
Zhang Y; Anhui Big-Data Research Center on University Management, Huaibei, China.
Qin F; Anhui Engineering Research Center for Intelligent Computing and Application on Cognitive Behavior (ICACB), College of Computer Science and Technology, Huaibei Normal University, Huaibei, China.
Jin H; Anhui Big-Data Research Center on University Management, Huaibei, China.
Zhao W; Anhui Engineering Research Center for Intelligent Computing and Application on Cognitive Behavior (ICACB), College of Computer Science and Technology, Huaibei Normal University, Huaibei, China.

Medicine (Baltimore) ; 102(39): e35328, 2023 Sep 29.

Article em En | MEDLINE | ID: mdl-37773842

ABSTRACT

ABSTRACT

U-Net has attained immense popularity owing to its performance in medical image segmentation. However, it cannot be modeled explicitly over remote dependencies. By contrast, the transformer can effectively capture remote dependencies by leveraging the self-attention (SA) of the encoder. Although SA, an important characteristic of the transformer, can find correlations between them based on the original data, secondary computational complexity might retard the processing rate of high-dimensional data (such as medical images). Furthermore, SA is limited because the correlation between samples is overlooked; thus, there is considerable scope for improvement. To this end, based on Swin-UNet, we introduce a dynamic selective attention mechanism for the convolution kernels. The weight of each convolution kernel is calculated to fuse the results dynamically. This attention mechanism permits each neuron to adaptively modify its receptive field size in response to multiscale input information. A local cross-channel interaction strategy without dimensionality reduction was introduced, which effectively eliminated the influence of downscaling on learning channel attention. Through suitable cross-channel interactions, model complexity can be significantly reduced while maintaining its performance. Subsequently, the global interaction between the encoder features is used to extract more fine-grained features. Simultaneously, the mixed loss function of the weighted cross-entropy loss and Dice loss is used to alleviate category imbalances and achieve better results when the sample number is unbalanced. We evaluated our proposed method on abdominal multiorgan segmentation and cardiac segmentation datasets, achieving Dice similarity coefficient and 95% Hausdorff distance metrics of 80.30 and 14.55%, respectively, on the Synapse dataset and Dice similarity coefficient metrics of 90.80 on the ACDC dataset. The experimental results show that our proposed method has good generalization ability and robustness, and it is a powerful tool for medical image segmentation.

Assuntos

Algoritmos; Benchmarking; Humanos; Fontes de Energia Elétrica; Entropia; Coração; Redução de Peso; Processamento de Imagem Assistida por Computador

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Benchmarking Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Benchmarking Idioma: En Ano de publicação: 2023 Tipo de documento: Article