Acoustic feature extraction using perceptual wavelet packet decomposition for frog call classification


Autoria(s): Xie, Jie; Towsey, Michael; Eichinski, Philip; Zhang, Jinglan; Roe, Paul
Data(s)

01/08/2015

Resumo

Frog protection has become increasingly essential due to the rapid decline of its biodiversity. Therefore, it is valuable to develop new methods for studying this biodiversity. In this paper, a novel feature extraction method is proposed based on perceptual wavelet packet decomposition for classifying frog calls in noisy environments. Pre-processing and syllable segmentation are first applied to the frog call. Then, a spectral peak track is extracted from each syllable if possible. Track duration, dominant frequency and oscillation rate are directly extracted from the track. With k-means clustering algorithm, the calculated dominant frequency of all frog species is clustered into k parts, which produce a frequency scale for wavelet packet decomposition. Based on the adaptive frequency scale, wavelet packet decomposition is applied to the frog calls. Using the wavelet packet decomposition coefficients, a new feature set named perceptual wavelet packet decomposition sub-band cepstral coefficients is extracted. Finally, a k-nearest neighbour (k-NN) classifier is used for the classification. The experiment results show that the proposed features can achieve an average classification accuracy of 97.45% which outperforms syllable features (86.87%) and Mel-frequency cepstral coefficients (MFCCs) feature (90.80%).

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/89674/

Publicador

IEEE

Relação

http://eprints.qut.edu.au/89674/1/acoustic%20feature%20extraction%20using%20perceptual%20wavelet%20packet%20decomposition%20for%20frog%20call%20classification.pdf

DOI:10.1109/eScience.2015.47

Xie, Jie, Towsey, Michael, Eichinski, Philip, Zhang, Jinglan, & Roe, Paul (2015) Acoustic feature extraction using perceptual wavelet packet decomposition for frog call classification. In 2015 IEEE 11th International Conference on e-Science (e-Science), IEEE, Munich, Germany, pp. 237-242.

Direitos

Copyright 2015 IEEE

Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Fonte

Science & Engineering Faculty

Palavras-Chave #080105 Expert Systems #080109 Pattern Recognition and Data Mining #089999 Information and Computing Sciences not elsewhere classified #frog call classification #k-means clustering #spectral peak track #wavelet packet decomposition
Tipo

Conference Paper