Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection.

Marchi, Erik; Vesperini, Fabio; Squartini, Stefano; Schuller, Björn

Marchi, Erik; Vesperini, Fabio; Squartini, Stefano; Schuller, Björn.

Afiliação

Marchi E; Machine Intelligence & Signal Processing Group, Technische Universität München, Munich, Germany; audEERING GmbH, Gilching, Germany; Chair of Complex & Intelligent Systems, University of Passau, Passau, Germany.
Vesperini F; A3LAB, Department of Information Engineering, Università Politecnica delle Marche, Ancona, Italy.
Squartini S; A3LAB, Department of Information Engineering, Università Politecnica delle Marche, Ancona, Italy.
Schuller B; audEERING GmbH, Gilching, Germany; Chair of Complex & Intelligent Systems, University of Passau, Passau, Germany; Department of Computing, Imperial College London, London, UK.

Comput Intell Neurosci ; 2017: 4694860, 2017.

Article em En | MEDLINE | ID: mdl-28182121

RESUMO

In the emerging field of acoustic novelty detection, most research efforts are devoted to probabilistic approaches such as mixture models or state-space models. Only recent studies introduced (pseudo-)generative models for acoustic novelty detection with recurrent neural networks in the form of an autoencoder. In these approaches, auditory spectral features of the next short term frame are predicted from the previous frames by means of Long-Short Term Memory recurrent denoising autoencoders. The reconstruction error between the input and the output of the autoencoder is used as activation signal to detect novel events. There is no evidence of studies focused on comparing previous efforts to automatically recognize novel events from audio signals and giving a broad and in depth evaluation of recurrent neural network-based autoencoders. The present contribution aims to consistently evaluate our recent novel approaches to fill this white spot in the literature and provide insight by extensive evaluations carried out on three databases: A3Novelty, PASCAL CHiME, and PROMETHEUS. Besides providing an extensive analysis of novel and state-of-the-art methods, the article shows how RNN-based autoencoders outperform statistical approaches up to an absolute improvement of 16.4% average F-measure over the three databases.

Assuntos

Acústica; Compressão de Dados; Redes Neurais de Computação; Bases de Dados Factuais; Humanos; Modelos Estatísticos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Acústica / Redes Neurais de Computação / Compressão de Dados Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Comput Intell Neurosci Ano de publicação: 2017 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google