An Alternating l(p) - l(2) Projections Algorithm (ALPA) for Speech Modeling using Sparsity Constraints


Autoria(s): Adiga, Aniruddha; Seelamantula, Chandra Sekhar
Data(s)

2014

Resumo

We address the problem of separating a speech signal into its excitation and vocal-tract filter components, which falls within the framework of blind deconvolution. Typically, the excitation in case of voiced speech is assumed to be sparse and the vocal-tract filter stable. We develop an alternating l(p) - l(2) projections algorithm (ALPA) to perform deconvolution taking into account these constraints. The algorithm is iterative, and alternates between two solution spaces. The initialization is based on the standard linear prediction decomposition of a speech signal into an autoregressive filter and prediction residue. In every iteration, a sparse excitation is estimated by optimizing an l(p)-norm-based cost and the vocal-tract filter is derived as a solution to a standard least-squares minimization problem. We validate the algorithm on voiced segments of natural speech signals and show applications to epoch estimation. We also present comparisons with state-of-the-art techniques and show that ALPA gives a sparser impulse-like excitation, where the impulses directly denote the epochs or instants of significant excitation.

Formato

application/pdf

Identificador

http://eprints.iisc.ernet.in/52525/1/Pro_of%20_the_19th_Int_Con_on_Dig_Sig_Pro_291_2014.pdf

Adiga, Aniruddha and Seelamantula, Chandra Sekhar (2014) An Alternating l(p) - l(2) Projections Algorithm (ALPA) for Speech Modeling using Sparsity Constraints. In: 19th International Conference on Digital Signal Processing (DSP), AUG 20-23, 2014, Hong Kong, PEOPLES R CHINA, pp. 291-296.

Publicador

IEEE

Relação

http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6900673

http://eprints.iisc.ernet.in/52525/

Palavras-Chave #Electrical Engineering
Tipo

Conference Proceedings

NonPeerReviewed