Comparative study of filter-bank mean-energy distance for automated segmentation of speech signals


Autoria(s): Ananthakrishnan, G; Ranjani, HG; Ramakrishnan, AG
Data(s)

2006

Resumo

This paper describes a method of automated segmentation of speech assuming the signal is continuously time varying rather than the traditional short time stationary model. It has been shown that this representation gives comparable if not marginally better results than the other techniques for automated segmentation. A formulation of the 'Bach' (music semitonal) frequency scale filter-bank is proposed. A comparative study has been made of the performances using Mel, Bark and Bach scale filter banks considering this model. The preliminary results show up to 80 % matches within 20 ms of the manually segmented data, without any information of the content of the text and without any language dependence. 'Bach' filters are seen to marginally outperform the other filters.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/30530/1/04156573.pdf

Ananthakrishnan, G and Ranjani, HG and Ramakrishnan, AG (2006) Comparative study of filter-bank mean-energy distance for automated segmentation of speech signals. In: International Conference on Signal Processing, Communications and Networks,, Feb 22-24, 2007, Chennai, India, pp. 6-10.

Publicador

Institute of Electrical and Electronics Engineers

Relação

http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4156573

http://eprints.iisc.ernet.in/30530/

Palavras-Chave #Electrical Engineering
Tipo

Conference Paper

PeerReviewed