Video genre categorization using audio wavelet coefficients


Autoria(s): Dinh, Phung Quoc; Dorai, Chitra; Venkatesh, Svetha
Contribuinte(s)

Suter, D.

Bab-Hadiashar, A.

Data(s)

01/01/2002

Resumo

In this paper, we investigate the use of a wavelet transform-based analysis of audio tracks accompanying videos for the problem of automatic program genre detection. We compare the classification performance based on wavelet-based audio features to that using conventional features derived from Fourier and time analysis for the task of discriminating TV programs such as news, commercials, music shows, concerts, motor racing games, and animated cartoons. Three different classifiers namely the Decision Trees, SVMs, and k-Nearest Neighbours are studied to analyse the reliability of the performance of our wavelet features based approach. Further, we investigate the issue of an appropriate duration of an audio clip to be analyzed for this automatic genre determination. Our experimental results show that features derived from the wavelet transform of the audio signal can very well separate the six video genres studied. It is also found that there is no significant difference in performance with varying audio clip durations across the classifiers.<br />

Identificador

http://hdl.handle.net/10536/DRO/DU:30044836

Idioma(s)

eng

Publicador

Asian Federation of Computer Vision Societies

Relação

http://dro.deakin.edu.au/eserv/DU:30044836/phung-videogenre-2002.pdf

http://www.aprs.org.au/accv2002/accv2002_proceedings/Dinh69.pdf

Direitos

2002, Springer

Palavras-Chave #wavelet #wavelet-based audio features #fourier #audio signal #automatic program genre detection
Tipo

Conference Paper