Pesquisa | Biblioteca Virtual em Saúde

Deep Upscale U-Net for automatic tongue segmentation.

Kusakunniran, Worapan; Imaromkul, Thanandon; Mongkolluksamee, Sophon; Thongkanchorn, Kittikhun; Ritthipravat, Panrasee; Tuakta, Pimchanok; Benjapornlert, Paitoon.

Med Biol Eng Comput ; 62(6): 1751-1762, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38372910

RESUMO

In a treatment or diagnosis related to oral health conditions such as oral cancer and oropharyngeal cancer, an investigation of tongue's movements is a major part. In an automatic measurement of such movement, it must first start with a task of tongue segmentation. This paper proposes a solution of tongue segmentation based on a decoder-encoder CNN-based structure i.e., U-Net. However, it could suffer from a problem of feature loss in deep layers. This paper proposes a Deep Upscale U-Net (DU-UNET). An additional up-sampling of the feature map from a contracting path is concatenated to an upper layer of an expansive path, based on an original U-Net structure. The segmentation model is constructed by training DU-UNET on the two publicly available datasets, and transferred to the self-collected dataset of tongue images with five tongue postures which were recorded at a far distance from a camera under a real-world scenario. The proposed DU-UNET outperforms the other existing methods in our literature reviews, with accuracy of 99.2%, mean IoU of 97.8%, Dice score of 96.8%, and Jaccard score of 96.8%.

Assuntos

Processamento de Imagem Assistida por Computador , Redes Neurais de Computação , Língua , Língua/diagnóstico por imagem , Humanos , Processamento de Imagem Assistida por Computador/métodos , Algoritmos , Aprendizado Profundo

Encoder-decoder network with RMP for tongue segmentation.

Kusakunniran, Worapan; Borwarnginn, Punyanuch; Karnjanapreechakorn, Sarattha; Thongkanchorn, Kittikhun; Ritthipravat, Panrasee; Tuakta, Pimchanok; Benjapornlert, Paitoon.

Med Biol Eng Comput ; 61(5): 1193-1207, 2023 May.

Artigo em Inglês | MEDLINE | ID: mdl-36692799

RESUMO

Tongue and its movements can be used for several medical-related tasks, such as identifying a disease and tracking a rehabilitation. To be able to focus on a tongue region, the tongue segmentation is needed to compute a region of interest for a further analysis. This paper proposes an encoder-decoder CNN-based architecture for segmenting a tongue in an image. The encoder module is mainly used for the tongue feature extraction, while the decoder module is used to reconstruct a segmented tongue from the extracted features based on training images. In addition, the residual multi-kernel pooling (RMP) is also applied into the proposed network to help in encoding multiple scales of the features. The proposed method is evaluated on two publicly available datasets under a scenario of front view and one tongue posture. It is then tested on a newly collected dataset of five tongue postures. The reported performances show that the proposed method outperforms existing methods in the literature. In addition, the re-training process could improve applying the trained model on unseen dataset, which would be a necessary step of applying the trained model on the real-world scenario.

Assuntos

Processamento de Imagem Assistida por Computador , Língua , Processamento de Imagem Assistida por Computador/métodos , Humanos , Língua/diagnóstico por imagem

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA