Your browser doesn't support javascript.
loading
Microwave Speech Recognizer Empowered by a Programmable Metasurface.
Zhang, Hongrui; Ruan, Hengxin; Zhao, Hanting; Wang, Zhuo; Hu, Shengguo; Cui, Tie Jun; Del Hougne, Philipp; Li, Lianlin.
Afiliação
  • Zhang H; State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing, 100871, China.
  • Ruan H; State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing, 100871, China.
  • Zhao H; Peng Cheng Laboratory, Shenzhen, Guangdong, 518000, China.
  • Wang Z; State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing, 100871, China.
  • Hu S; State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing, 100871, China.
  • Cui TJ; State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing, 100871, China.
  • Del Hougne P; State Key Laboratory of Millimeter Waves, Southeast University, Nanjing, 210096, China.
  • Li L; Pazhou Laboratory (Huangpu), Guangzhou, Guangdong, 510555, China.
Adv Sci (Weinh) ; 11(17): e2309826, 2024 May.
Article em En | MEDLINE | ID: mdl-38380552
ABSTRACT
Speech recognition becomes increasingly important in the modern society, especially for human-machine interactions, but its deployment is still severely thwarted by the struggle of machines to recognize voiced commands in challenging real-life settings oftentimes, ambient noise drowns the acoustic sound signals, and walls, face masks or other obstacles hide the mouth motion from optical sensors. To address these formidable challenges, an experimental prototype of a microwave speech recognizer empowered by programmable metasurface is presented here that can remotely recognize human voice commands and speaker identities even in noisy environments and if the speaker's mouth is hidden behind a wall or face mask. The programmable metasurface is the pivotal hardware ingredient of the system because its large aperture and huge number of degrees of freedom allows the system to perform a complex sequence of sensing tasks, orchestrated by artificial-intelligence tools. Relying solely on microwave data, the system avoids visual privacy infringements. The developed microwave speech recognizer can enable privacy-respecting voice-commanded human-machine interactions is experimentally demonstrated in many important but to-date inaccessible application scenarios. The presented strategy will unlock new possibilities and have expectations for future smart homes, ambient-assisted health monitoring, as well as intelligent surveillance and security.
Assuntos
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Interface para o Reconhecimento da Fala / Micro-Ondas Limite: Humans Idioma: En Revista: Adv Sci (Weinh) Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Interface para o Reconhecimento da Fala / Micro-Ondas Limite: Humans Idioma: En Revista: Adv Sci (Weinh) Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China