Generalised features for bird vocalisation retrieval in acoustic recordings


Autoria(s): Dong, Xueyan; Xie, Jie; Towsey, Michael; Zhang, Jinglan; Roe, Paul
Data(s)

01/10/2015

Resumo

Bioacoustic monitoring has become a significant research topic for species diversity conservation. Due to the development of sensing techniques, acoustic sensors are widely deployed in the field to record animal sounds over a large spatial and temporal scale. With large volumes of collected audio data, it is essential to develop semi-automatic or automatic techniques to analyse the data. This can help ecologists make decisions on how to protect and promote the species diversity. This paper presents generic features to characterize a range of bird species for vocalisation retrieval. In the implementation, audio recordings are first converted to spectrograms using short-time Fourier transform, then a ridge detection method is applied to the spectrogram for detecting points of interest. Based on the detected points, a new region representation are explored for describing various bird vocalisations and a local descriptor including temporal entropy, frequency bin entropy and histogram of counts of four ridge directions is calculated for each sub-region. To speed up the retrieval process, indexing is carried out and the retrieved results are ranked according to similarity scores. The experiment results show that our proposed feature set can achieve 0.71 in term of retrieval success rate which outperforms spectral ridge features alone (0.55) and Mel frequency cepstral coefficients (0.36).

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/86528/

Publicador

Institute of Electrical and Electronics Engineers Inc. (IEEE)

Relação

http://eprints.qut.edu.au/86528/1/Generalised%20Features%20for%20Bird%20Vocalisation%20Retrieval%20in%20Acoustic%20Sensor%20Recordings-new.pdf

http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7340813

DOI:10.1109/MMSP.2015.7340813

Dong, Xueyan, Xie, Jie, Towsey, Michael, Zhang, Jinglan, & Roe, Paul (2015) Generalised features for bird vocalisation retrieval in acoustic recordings. In 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP), Institute of Electrical and Electronics Engineers Inc. (IEEE), Xiamen, China, pp. 1-6.

Direitos

Copyright 2015 IEEE

Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Fonte

School of Electrical Engineering & Computer Science; Science & Engineering Faculty

Palavras-Chave #bird vocalisation retrieval #spectral ridge feature #ridge detection #region representation #environmental audio
Tipo

Conference Paper