Your browser doesn't support javascript.
loading
Environmental sound classification using temporal-frequency attention based convolutional neural network.
Mu, Wenjie; Yin, Bo; Huang, Xianqing; Xu, Jiali; Du, Zehua.
Afiliação
  • Mu W; College of Information Science and Engineering, Ocean University of China, Qingdao, China.
  • Yin B; College of Information Science and Engineering, Ocean University of China, Qingdao, China. ybfirst@126.com.
  • Huang X; Pilot National Laboratory for Marine Science and Technology, Qingdao, China. ybfirst@126.com.
  • Xu J; Pilot National Laboratory for Marine Science and Technology, Qingdao, China.
  • Du Z; Pilot National Laboratory for Marine Science and Technology, Qingdao, China.
Sci Rep ; 11(1): 21552, 2021 11 03.
Article em En | MEDLINE | ID: mdl-34732762
ABSTRACT
Environmental sound classification is one of the important issues in the audio recognition field. Compared with structured sounds such as speech and music, the time-frequency structure of environmental sounds is more complicated. In order to learn time and frequency features from Log-Mel spectrogram more effectively, a temporal-frequency attention based convolutional neural network model (TFCNN) is proposed in this paper. Firstly, an experiment that is used as motivation in proposed method is designed to verify the effect of a specific frequency band in the spectrogram on model classification. Secondly, two new attention mechanisms, temporal attention mechanism and frequency attention mechanism, are proposed. These mechanisms can focus on key frequency bands and semantic related time frames on the spectrogram to reduce the influence of background noise and irrelevant frequency bands. Then, a feature information complementarity is formed by combining these mechanisms to more accurately capture the critical time-frequency features. In such a way, the representation ability of the network model can be greatly improved. Finally, experiments on two public data sets, UrbanSound 8 K and ESC-50, demonstrate the effectiveness of the proposed method.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Atenção / Som / Redes Neurais de Computação / Reconhecimento Psicológico Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Atenção / Som / Redes Neurais de Computação / Reconhecimento Psicológico Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article