Pesquisa | Portal Regional da BVS

On the similarities of representations in artificial and brain neural networks for speech recognition.

Wingfield, Cai; Zhang, Chao; Devereux, Barry; Fonteneau, Elisabeth; Thwaites, Andrew; Liu, Xunying; Woodland, Phil; Marslen-Wilson, William; Su, Li.

Front Comput Neurosci ; 16: 1057439, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-36618270

RESUMO

Introduction: In recent years, machines powered by deep learning have achieved near-human levels of performance in speech recognition. The fields of artificial intelligence and cognitive neuroscience have finally reached a similar level of performance, despite their huge differences in implementation, and so deep learning models can-in principle-serve as candidates for mechanistic models of the human auditory system. Methods: Utilizing high-performance automatic speech recognition systems, and advanced non-invasive human neuroimaging technology such as magnetoencephalography and multivariate pattern-information analysis, the current study aimed to relate machine-learned representations of speech to recorded human brain representations of the same speech. Results: In one direction, we found a quasi-hierarchical functional organization in human auditory cortex qualitatively matched with the hidden layers of deep artificial neural networks trained as part of an automatic speech recognizer. In the reverse direction, we modified the hidden layer organization of the artificial neural network based on neural activation patterns in human brains. The result was a substantial improvement in word recognition accuracy and learned speech representations. Discussion: We have demonstrated that artificial and brain neural networks can be mutually informative in the domain of speech recognition.

Relating dynamic brain states to dynamic machine states: Human and machine solutions to the speech recognition problem.

Wingfield, Cai; Su, Li; Liu, Xunying; Zhang, Chao; Woodland, Phil; Thwaites, Andrew; Fonteneau, Elisabeth; Marslen-Wilson, William D.

PLoS Comput Biol ; 13(9): e1005617, 2017 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-28945744

RESUMO

There is widespread interest in the relationship between the neurobiological systems supporting human cognition and emerging computational systems capable of emulating these capacities. Human speech comprehension, poorly understood as a neurobiological process, is an important case in point. Automatic Speech Recognition (ASR) systems with near-human levels of performance are now available, which provide a computationally explicit solution for the recognition of words in continuous speech. This research aims to bridge the gap between speech recognition processes in humans and machines, using novel multivariate techniques to compare incremental 'machine states', generated as the ASR analysis progresses over time, to the incremental 'brain states', measured using combined electro- and magneto-encephalography (EMEG), generated as the same inputs are heard by human listeners. This direct comparison of dynamic human and machine internal states, as they respond to the same incrementally delivered sensory input, revealed a significant correspondence between neural response patterns in human superior temporal cortex and the structural properties of ASR-derived phonetic models. Spatially coherent patches in human temporal cortex responded selectively to individual phonetic features defined on the basis of machine-extracted regularities in the speech to lexicon mapping process. These results demonstrate the feasibility of relating human and ASR solutions to the problem of speech recognition, and suggest the potential for further studies relating complex neural computations in human speech comprehension to the rapidly evolving ASR systems that address the same problem domain.

Assuntos

Encéfalo/fisiologia , Modelos Neurológicos , Redes Neurais de Computação , Percepção da Fala/fisiologia , Interface para o Reconhecimento da Fala , Adulto , Eletroencefalografia , Feminino , Humanos , Magnetoencefalografia , Masculino , Adulto Jovem

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA