Similarity-based birdcall retrieval from environmental audio


Autoria(s): Dong, Xueyan; Towsey, Michael; Truskinger, Anthony; Cottman-Fields, Mark; Zhang, Jinglan; Roe, Paul
Data(s)

01/09/2015

Resumo

Automated digital recordings are useful for large-scale temporal and spatial environmental monitoring. An important research effort has been the automated classification of calling bird species. In this paper we examine a related task, retrieval of birdcalls from a database of audio recordings, similar to a user supplied query call. Such a retrieval task can sometimes be more useful than an automated classifier. We compare three approaches to similarity-based birdcall retrieval using spectral ridge features and two kinds of gradient features, structure tensor and the histogram of oriented gradients. The retrieval accuracy of our spectral ridge method is 94% compared to 82% for the structure tensor method and 90% for the histogram of gradients method. Additionally, this approach potentially offers a more compact representation and is more computationally efficient.

Identificador

http://eprints.qut.edu.au/86087/

Publicador

Elsevier BV

Relação

DOI:10.1016/j.ecoinf.2015.07.007

Dong, Xueyan, Towsey, Michael, Truskinger, Anthony, Cottman-Fields, Mark, Zhang, Jinglan, & Roe, Paul (2015) Similarity-based birdcall retrieval from environmental audio. Ecological Informatics, 29(Part 1), pp. 66-76.

Direitos

Copyright 2015 Elsevier B.V.

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #080104 Computer Vision #080109 Pattern Recognition and Data Mining #birdcall retrieval #environmental audio #ridge detection #spectral peak tracks
Tipo

Journal Article