Application of deep learning technology for temporal analysis of videofluoroscopic swallowing studies.

Jeong, Seong Yun; Kim, Jeong Min; Park, Ji Eun; Baek, Seung Jun; Yang, Seung Nam

Jeong, Seong Yun; Kim, Jeong Min; Park, Ji Eun; Baek, Seung Jun; Yang, Seung Nam.

Afiliação

Jeong SY; Department of Computer Science and Engineering, Korea University, 145 Anam-ro Seongbuk-gu, Seoul, 02841, Korea.
Kim JM; Department of Physical Medicine and Rehabilitation, Korea University Guro Hospital, Korea University College of Medicine, 148, Gurodong-ro, Guro-gu, Seoul, 08308, Korea.
Park JE; Department of Physical Medicine and Rehabilitation, Korea University Guro Hospital, Korea University College of Medicine, 148, Gurodong-ro, Guro-gu, Seoul, 08308, Korea.
Baek SJ; Department of Computer Science and Engineering, Korea University, 145 Anam-ro Seongbuk-gu, Seoul, 02841, Korea. sjbaek@korea.ac.kr.
Yang SN; Department of Physical Medicine and Rehabilitation, Korea University Guro Hospital, Korea University College of Medicine, 148, Gurodong-ro, Guro-gu, Seoul, 08308, Korea. snamyang@korea.ac.kr.

Sci Rep ; 13(1): 17522, 2023 10 16.

Article em En | MEDLINE | ID: mdl-37845272

RESUMO

Temporal parameters during swallowing are analyzed for objective and quantitative evaluation of videofluoroscopic swallowing studies (VFSS). Manual analysis by clinicians is time-consuming, complicated and prone to human error during interpretation; therefore, automated analysis using deep learning has been attempted. We aimed to develop a model for the automatic measurement of various temporal parameters of swallowing using deep learning. Overall, 547 VFSS video clips were included. Seven temporal parameters were manually measured by two physiatrists as ground-truth data: oral phase duration, pharyngeal delay time, pharyngeal response time, pharyngeal transit time, laryngeal vestibule closure reaction time, laryngeal vestibule closure duration, and upper esophageal sphincter opening duration. ResNet3D was selected as the base model for the deep learning of temporal parameters. The performances of ResNet3D variants were compared with those of the VGG and I3D models used previously. The average accuracy of the proposed ResNet3D variants was from 0.901 to 0.981. The F1 scores and average precision were 0.794 to 0.941 and 0.714 to 0.899, respectively. Compared to the VGG and I3D models, our model achieved the best results in terms of accuracy, F1 score, and average precision values. Through the clinical application of this automatic model, temporal analysis of VFSS will be easier and more accurate.

Assuntos

Aprendizado Profundo; Transtornos de Deglutição; Humanos; Deglutição/fisiologia; Transtornos de Deglutição/diagnóstico por imagem; Transtornos de Deglutição/etiologia; Fluoroscopia/métodos; Esfíncter Esofágico Superior

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Transtornos de Deglutição / Aprendizado Profundo Limite: Humans Idioma: En Revista: Sci Rep Ano de publicação: 2023 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google