Active learning for bird sound classification via a kernel-based extreme learning machine.

Qian, Kun; Zhang, Zixing; Baird, Alice; Schuller, Björn

Qian, Kun; Zhang, Zixing; Baird, Alice; Schuller, Björn.

Affiliation

Qian K; Machine Intelligence and Signal Processing Group, Chair of Human-Machine Communication, Technische Universität München, Arcisstr. 21, Munich 80333, Germany.
Zhang Z; Chair of Complex and Intelligent Systems, University of Passau, Innstr. 43, Passau 94032, Germany.
Baird A; Chair of Complex and Intelligent Systems, University of Passau, Innstr. 43, Passau 94032, Germany.
Schuller B; GLAM-Group on Language, Audio and Music, Department of Computing, Imperial College London, 180 Queens' Gate, Huxley Building, London SW7 2AZ, United Kingdom.

J Acoust Soc Am ; 142(4): 1796, 2017 10.

Article in En | MEDLINE | ID: mdl-29092546

ABSTRACT

ABSTRACT

In recent years, research fields, including ecology, bioacoustics, signal processing, and machine learning, have made bird sound recognition a part of their focus. This has led to significant advancements within the field of ornithology, such as improved understanding of evolution, local biodiversity, mating rituals, and even the implications and realities associated to climate change. The volume of unlabeled bird sound data is now overwhelming, and comparatively little exploration is being made into methods for how best to handle them. In this study, two active learning (AL) methods are proposed, sparse-instance-based active learning (SI-AL), and least-confidence-score-based active learning (LCS-AL), both effectively reducing the need for expert human annotation. To both of these AL paradigms, a kernel-based extreme learning machine (KELM) is then integrated, and a comparison is made to the conventional support vector machine (SVM). Experimental results demonstrate that, when the classifier capacity is improved from an unweighted average recall of 60%-80%, KELM can outperform SVM even when a limited proportion of human annotations are used from the pool of data in both cases of SI-AL (minimum 34.5% vs minimum 59.0%) and LCS-AL (minimum 17.3% vs minimum 28.4%).

Subject(s)

Acoustics; Birds/classification; Birds/physiology; Pattern Recognition, Automated/methods; Signal Processing, Computer-Assisted; Supervised Machine Learning; Vocalization, Animal/classification; Animals; Databases, Factual; Support Vector Machine

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Vocalization, Animal / Acoustics / Birds / Signal Processing, Computer-Assisted / Pattern Recognition, Automated / Supervised Machine Learning Limits: Animals Language: En Journal: J Acoust Soc Am Year: 2017 Document type: Article Affiliation country: Germany

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google