Your browser doesn't support javascript.
loading
MagTrack: A Wearable Tongue Motion Tracking System for Silent Speech Interfaces.
Cao, Beiming; Ravi, Shravan; Sebkhi, Nordine; Bhavsar, Arpan; Inan, Omer T; Xu, Wen; Wang, Jun.
Afiliación
  • Cao B; Department of Electrical and Computer Engineering, The University of Texas at Austin.
  • Ravi S; Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin.
  • Sebkhi N; Department of Computer Science, The University of Texas at Austin.
  • Bhavsar A; School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta.
  • Inan OT; School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta.
  • Xu W; School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta.
  • Wang J; Division of Computer Science, Texas Woman's University, Denton.
J Speech Lang Hear Res ; 66(8S): 3206-3221, 2023 08 17.
Article en En | MEDLINE | ID: mdl-37146629
PURPOSE: Current electromagnetic tongue tracking devices are not amenable for daily use and thus not suitable for silent speech interface and other applications. We have recently developed MagTrack, a novel wearable electromagnetic articulograph tongue tracking device. This study aimed to validate MagTrack for potential silent speech interface applications. METHOD: We conducted two experiments: (a) classification of eight isolated vowels in consonant-vowel-consonant form and (b) continuous silent speech recognition. In these experiments, we used data from healthy adult speakers collected with MagTrack. The performance of vowel classification was measured by accuracies. The continuous silent speech recognition was measured by phoneme error rates. The performance was then compared with results using data collected with commercial electromagnetic articulograph in a prior study. RESULTS: The isolated vowel classification using MagTrack achieved an average accuracy of 89.74% when leveraging all MagTrack signals (x, y, z coordinates; orientation; and magnetic signals), which outperformed the accuracy using commercial electromagnetic articulograph data (only y, z coordinates) in our previous study. The continuous speech recognition from two subjects using MagTrack achieved phoneme error rates of 73.92% and 66.73%, respectively. The commercial electromagnetic articulograph achieved 64.53% from the same subject (66.73% using MagTrack data). CONCLUSIONS: MagTrack showed comparable results with the commercial electromagnetic articulograph when using the same localized information. Adding raw magnetic signals would improve the performance of MagTrack. Our preliminary testing demonstrated the potential for silent speech interface as a lightweight wearable device. This work also lays the foundation to support MagTrack's potential for other applications including visual feedback-based speech therapy and second language learning.
Asunto(s)

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Habla / Percepción del Habla Límite: Adult / Humans Idioma: En Revista: J Speech Lang Hear Res Asunto de la revista: AUDIOLOGIA / PATOLOGIA DA FALA E LINGUAGEM Año: 2023 Tipo del documento: Article

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Habla / Percepción del Habla Límite: Adult / Humans Idioma: En Revista: J Speech Lang Hear Res Asunto de la revista: AUDIOLOGIA / PATOLOGIA DA FALA E LINGUAGEM Año: 2023 Tipo del documento: Article