Demodulation of Narrowband Speech Spectrograms Using the Riesz Transform


Autoria(s): Aragonda, Haricharan; Seelamantula, Chandra Sekhar
Data(s)

2015

Resumo

We propose a two-dimensional (2-D) multicomponent amplitude-modulation, frequency-modulation (AM-FM) model for a spectrogram patch corresponding to voiced speech, and develop a new demodulation algorithm to effectively separate the AM, which is related to the vocal tract response, and the carrier, which is related to the excitation. The demodulation algorithm is based on the Riesz transform and is developed along the lines of Hilbert-transform-based demodulation for 1-D AM-FM signals. We compare the performance of the Riesz transform technique with that of the sinusoidal demodulation technique on real speech data. Experimental results show that the Riesz-transform-based demodulation technique represents spectrogram patches accurately. The spectrograms reconstructed from the demodulated AM and carrier are inverted and the corresponding speech signal is synthesized. The signal-to-noise ratio (SNR) of the reconstructed speech signal, with respect to clean speech, was found to be 2 to 4 dB higher in case of the Riesz transform technique than the sinusoidal demodulation technique.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/52499/1/IEEE-ACM_Tra_on_Aud_Spe_and_Lan_Pro_23-11_1824_2015.pdf

Aragonda, Haricharan and Seelamantula, Chandra Sekhar (2015) Demodulation of Narrowband Speech Spectrograms Using the Riesz Transform. In: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 23 (11). pp. 1824-1834.

Publicador

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Relação

http://dx.doi.org/10.1109/TASLP.2015.2449088

http://eprints.iisc.ernet.in/52499/

Palavras-Chave #Electrical Engineering
Tipo

Journal Article

PeerReviewed