Propagating variational model uncertainty for bioacoustic call label smoothing.

Rizos, Georgios; Lawson, Jenna; Mitchell, Simon; Shah, Pranay; Wen, Xin; Banks-Leite, Cristina; Ewers, Robert; Schuller, Björn W

Rizos, Georgios; Lawson, Jenna; Mitchell, Simon; Shah, Pranay; Wen, Xin; Banks-Leite, Cristina; Ewers, Robert; Schuller, Björn W.

Affiliation

Rizos G; GLAM - Group on Language, Audio, & Music, Department of Computing, Imperial College London, London SW7 2RH, UK.
Lawson J; Department of Life Sciences, Imperial College London, Ascot SL5 7PY, UK.
Mitchell S; DICE - Durrell Institute of Conservation and Ecology, University of Kent, Canterbury CT2 7NR, UK.
Shah P; GLAM - Group on Language, Audio, & Music, Department of Computing, Imperial College London, London SW7 2RH, UK.
Wen X; GLAM - Group on Language, Audio, & Music, Department of Computing, Imperial College London, London SW7 2RH, UK.
Banks-Leite C; Department of Life Sciences, Imperial College London, Ascot SL5 7PY, UK.
Ewers R; Department of Life Sciences, Imperial College London, Ascot SL5 7PY, UK.
Schuller BW; GLAM - Group on Language, Audio, & Music, Department of Computing, Imperial College London, London SW7 2RH, UK.

Patterns (N Y) ; 5(3): 100932, 2024 Mar 08.

Article in En | MEDLINE | ID: mdl-38487806

ABSTRACT

ABSTRACT

Along with propagating the input toward making a prediction, Bayesian neural networks also propagate uncertainty. This has the potential to guide the training process by rejecting predictions of low confidence, and recent variational Bayesian methods can do so without Monte Carlo sampling of weights. Here, we apply sample-free methods for wildlife call detection on recordings made via passive acoustic monitoring equipment in the animals' natural habitats. We further propose uncertainty-aware label smoothing, where the smoothing probability is dependent on sample-free predictive uncertainty, in order to downweigh data samples that should contribute less to the loss value. We introduce a bioacoustic dataset recorded in Malaysian Borneo, containing overlapping calls from 30 species. On that dataset, our proposed method achieves an absolute percentage improvement of around 1.5 points on area under the receiver operating characteristic (AU-ROC), 13 points in F1, and 19.5 points in expected calibration error (ECE) compared to the point-estimate network baseline averaged across all target classes.

Key words

adaptive label smoothing; bioacoustics; calibrated deep learning; epistemic uncertainty; machine audition; passive acoustic monitoring; uncertainty propagation; variational Bayesian deep learning; wildlife call detection

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Patterns (N Y) Year: 2024 Document type: Article Affiliation country: United kingdom

Fulltext

Add to My VHL

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Patterns (N Y) Year: 2024 Document type: Article Affiliation country: United kingdom