Interpretation of a deep analysis of speech imagery features extracted by a capsule neural network.

Macías-Macías, José M; Ramírez-Quintana, Juan A; Chacón-Murguía, Mario I; Torres-García, Alejandro A; Corral-Martínez, Luis F

Macías-Macías, José M; Ramírez-Quintana, Juan A; Chacón-Murguía, Mario I; Torres-García, Alejandro A; Corral-Martínez, Luis F.

Afiliación

Macías-Macías JM; Tecnológico Nacional de México/IT Chihuahua, Av. Tecnológico 2909, Chihuahua, 31310, Chihuahua, Mexico. Electronic address: D19060653@chihuahua.tecnm.mx.
Ramírez-Quintana JA; Tecnológico Nacional de México/IT Chihuahua, Av. Tecnológico 2909, Chihuahua, 31310, Chihuahua, Mexico.
Chacón-Murguía MI; Tecnológico Nacional de México/IT Chihuahua, Av. Tecnológico 2909, Chihuahua, 31310, Chihuahua, Mexico.
Torres-García AA; Instituto Nacional de Astrofísica Óptica y Electrónica, Luis Enrique Erro No 1, Tonanzintla, 72840, Puebla, Mexico.
Corral-Martínez LF; Tecnológico Nacional de México/IT Chihuahua, Av. Tecnológico 2909, Chihuahua, 31310, Chihuahua, Mexico.

Comput Biol Med ; 159: 106909, 2023 06.

Article en En | MEDLINE | ID: mdl-37071937

ABSTRACT

ABSTRACT

Speech imagery has been successfully employed in developing Brain-Computer Interfaces because it is a novel mental strategy that generates brain activity more intuitively than evoked potentials or motor imagery. There are many methods to analyze speech imagery signals, but those based on deep neural networks achieve the best results. However, more research is necessary to understand the properties and features that describe imagined phonemes and words. In this paper, we analyze the statistical properties of speech imagery EEG signals from the KaraOne dataset to design a method that classifies imagined phonemes and words. With this analysis, we propose a Capsule Neural Network that categorizes speech imagery patterns into bilabial, nasal, consonant-vocal, and vowels/iy/ and/uw/. The method is called Capsules for Speech Imagery Analysis (CapsK-SI). The input of CapsK-SI is a set of statistical features of EEG speech imagery signals. The architecture of the Capsule Neural Network is composed of a convolution layer, a primary capsule layer, and a class capsule layer. The average accuracy reached is 90.88%±7 for bilabial, 90.15%±8 for nasal, 94.02%±6 for consonant-vowel, 89.70%±8 for word-phoneme, 94.33%± for/iy/ vowel and, 94.21%±3 for/uw/ vowel detection. Finally, with the activity vectors of the CapsK-SI capsules, we generated brain maps to represent brain activity in the production of bilabial, nasal, and consonant-vocal signals.

Asunto(s)

Interfaces Cerebro-Computador; Habla; Habla/fisiología; Cápsulas; Electroencefalografía/métodos; Redes Neurales de la Computación; Encéfalo/fisiología; Imaginación/fisiología; Algoritmos

Palabras clave

BrainComputer Interface; Deep capsule neural network; EEG signal processing; Speech imagery

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Habla / Interfaces Cerebro-Computador Idioma: En Revista: Comput Biol Med Año: 2023 Tipo del documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google