Empowering Vision Transformer by Network Hyper-Parameter Selection for Whole Pelvis Prostate Planning Target Volume Auto-Segmentation.

Cho, Hyeonjeong; Lee, Jae Sung; Kim, Jin Sung; Koom, Woong Sub; Kim, Hojin

Cho, Hyeonjeong; Lee, Jae Sung; Kim, Jin Sung; Koom, Woong Sub; Kim, Hojin.

Affiliation

Cho H; Department of Radiation Oncology, Yonsei Cancer Center, Heavy Ion Therapy Research Institute, Yonsei University College of Medicine, Seoul 03722, Republic of Korea.
Lee JS; Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul 03080, Republic of Korea.
Kim JS; Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul 03080, Republic of Korea.
Koom WS; Department of Radiation Oncology, Yonsei Cancer Center, Heavy Ion Therapy Research Institute, Yonsei University College of Medicine, Seoul 03722, Republic of Korea.
Kim H; Department of Radiation Oncology, Yonsei Cancer Center, Heavy Ion Therapy Research Institute, Yonsei University College of Medicine, Seoul 03722, Republic of Korea.

Cancers (Basel) ; 15(23)2023 Nov 21.

Article in En | MEDLINE | ID: mdl-38067211

ABSTRACT

U-Net, based on a deep convolutional network (CNN), has been clinically used to auto-segment normal organs, while still being limited to the planning target volume (PTV) segmentation. This work aims to address the problems in two aspects: 1) apply one of the newest network architectures such as vision transformers other than the CNN-based networks, and 2) find an appropriate combination of network hyper-parameters with reference to recently proposed nnU-Net ("no-new-Net"). VT U-Net was adopted for auto-segmenting the whole pelvis prostate PTV as it consisted of fully transformer architecture. The upgraded version (v.2) applied the nnU-Net-like hyper-parameter optimizations, which did not fully cover the transformer-oriented hyper-parameters. Thus, we tried to find a suitable combination of two key hyper-parameters (patch size and embedded dimension) for 140 CT scans throughout 4-fold cross validation. The VT U-Net v.2 with hyper-parameter tuning yielded the highest dice similarity coefficient (DSC) of 82.5 and the lowest 95% Haussdorff distance (HD95) of 3.5 on average among the seven recently proposed deep learning networks. Importantly, the nnU-Net with hyper-parameter optimization achieved competitive performance, although this was based on the convolution layers. The network hyper-parameter tuning was demonstrated to be necessary even for the newly developed architecture of vision transformers.

Key words

VT U-Net v.2; auto-segmentation; hyper-parameter selection; planning target volume; prostate cancer; transformer

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Cancers (Basel) Year: 2023 Document type: Article Country of publication:

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Cancers (Basel) Year: 2023 Document type: Article Country of publication: