Your browser doesn't support javascript.
loading
Deep Learning for Visual Speech Analysis: A Survey.
IEEE Trans Pattern Anal Mach Intell ; 46(9): 6001-6022, 2024 Sep.
Article en En | MEDLINE | ID: mdl-38478434
ABSTRACT
Visual speech, referring to the visual domain of speech, has attracted increasing attention due to its wide applications, such as public security, medical treatment, military defense, and film entertainment. As a powerful AI strategy, deep learning techniques have extensively promoted the development of visual speech learning. Over the past five years, numerous deep learning based methods have been proposed to address various problems in this area, especially automatic visual speech recognition and generation. To push forward future research on visual speech, this paper will present a comprehensive review of recent progress in deep learning methods on visual speech analysis. We cover different aspects of visual speech, including fundamental problems, challenges, benchmark datasets, a taxonomy of existing methods, and state-of-the-art performance. Besides, we also identify gaps in current research and discuss inspiring future research directions.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Habla / Aprendizaje Profundo Límite: Humans Idioma: En Revista: IEEE Trans Pattern Anal Mach Intell / IEEE transactions on pattern analysis and machine intelligence (Online) Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Habla / Aprendizaje Profundo Límite: Humans Idioma: En Revista: IEEE Trans Pattern Anal Mach Intell / IEEE transactions on pattern analysis and machine intelligence (Online) Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article