Your browser doesn't support javascript.
loading
Active learning for bird sound classification via a kernel-based extreme learning machine.
Qian, Kun; Zhang, Zixing; Baird, Alice; Schuller, Björn.
Affiliation
  • Qian K; Machine Intelligence and Signal Processing Group, Chair of Human-Machine Communication, Technische Universität München, Arcisstr. 21, Munich 80333, Germany.
  • Zhang Z; Chair of Complex and Intelligent Systems, University of Passau, Innstr. 43, Passau 94032, Germany.
  • Baird A; Chair of Complex and Intelligent Systems, University of Passau, Innstr. 43, Passau 94032, Germany.
  • Schuller B; GLAM-Group on Language, Audio and Music, Department of Computing, Imperial College London, 180 Queens' Gate, Huxley Building, London SW7 2AZ, United Kingdom.
J Acoust Soc Am ; 142(4): 1796, 2017 10.
Article in En | MEDLINE | ID: mdl-29092546
ABSTRACT
In recent years, research fields, including ecology, bioacoustics, signal processing, and machine learning, have made bird sound recognition a part of their focus. This has led to significant advancements within the field of ornithology, such as improved understanding of evolution, local biodiversity, mating rituals, and even the implications and realities associated to climate change. The volume of unlabeled bird sound data is now overwhelming, and comparatively little exploration is being made into methods for how best to handle them. In this study, two active learning (AL) methods are proposed, sparse-instance-based active learning (SI-AL), and least-confidence-score-based active learning (LCS-AL), both effectively reducing the need for expert human annotation. To both of these AL paradigms, a kernel-based extreme learning machine (KELM) is then integrated, and a comparison is made to the conventional support vector machine (SVM). Experimental results demonstrate that, when the classifier capacity is improved from an unweighted average recall of 60%-80%, KELM can outperform SVM even when a limited proportion of human annotations are used from the pool of data in both cases of SI-AL (minimum 34.5% vs minimum 59.0%) and LCS-AL (minimum 17.3% vs minimum 28.4%).
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Vocalization, Animal / Acoustics / Birds / Signal Processing, Computer-Assisted / Pattern Recognition, Automated / Supervised Machine Learning Limits: Animals Language: En Journal: J Acoust Soc Am Year: 2017 Document type: Article Affiliation country: Germany

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Vocalization, Animal / Acoustics / Birds / Signal Processing, Computer-Assisted / Pattern Recognition, Automated / Supervised Machine Learning Limits: Animals Language: En Journal: J Acoust Soc Am Year: 2017 Document type: Article Affiliation country: Germany
...