Pesquisa | Biblioteca Virtual em Saúde

General audio tagging with ensembling convolutional neural networks and statistical features.

Xu, Kele; Zhu, Boqing; Kong, Qiuqiang; Mi, Haibo; Ding, Bo; Wang, Dezhi; Wang, Huaimin.

J Acoust Soc Am ; 145(6): EL521, 2019 06.

Artigo em Inglês | MEDLINE | ID: mdl-31255155

RESUMO

Audio tagging aims to infer descriptive labels from audio clips and it is challenging due to the limited size of data and noisy labels. The solution to the tagging task is described in this paper. The main contributions include the following: an ensemble learning framework is applied to ensemble statistical features and the outputs from the deep classifiers, with the goal to utilize complementary information. Moreover, a sample re-weight strategy is employed to address the noisy label problem within the framework. The approach achieves a mean average precision of 0.958, outperforming the baseline system with a large margin.

Assuntos

Aprendizado Profundo , Rede Nervosa/fisiologia , Redes Neurais de Computação , Neoplasias Cutâneas/fisiopatologia , Biometria/métodos , Humanos

Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification.

Koike, Tomoya; Qian, Kun; Kong, Qiuqiang; Plumbley, Mark D; Schuller, Bjorn W; Yamamoto, Yoshiharu.

Annu Int Conf IEEE Eng Med Biol Soc ; 2020: 74-77, 2020 07.

Artigo em Inglês | MEDLINE | ID: mdl-33017934

RESUMO

Cardiovascular disease is one of the leading factors for death cause of human beings. In the past decade, heart sound classification has been increasingly studied for its feasibility to develop a non-invasive approach to monitor a subject's health status. Particularly, relevant studies have benefited from the fast development of wearable devices and machine learning techniques. Nevertheless, finding and designing efficient acoustic properties from heart sounds is an expensive and time-consuming task. It is known that transfer learning methods can help extract higher representations automatically from the heart sounds without any human domain knowledge. However, most existing studies are based on models pre-trained on images, which may not fully represent the characteristics inherited from audio. To this end, we propose a novel transfer learning model pre-trained on large scale audio data for a heart sound classification task. In this study, the PhysioNet CinC Challenge Dataset is used for evaluation. Experimental results demonstrate that, our proposed pre-trained audio models can outperform other popular models pre-trained by images by achieving the highest unweighted average recall at 89.7 %.

Assuntos

Meios de Comunicação , Ruídos Cardíacos , Dispositivos Eletrônicos Vestíveis , Acústica , Humanos , Aprendizado de Máquina

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA