Biblioteca Digital

MODELING THE UNVOICED COMPONENT IN THE CANONICAL REPRESENTATION OF SPEECH

**Autoria(s):** RAMIREZ, Miguel Arjona
Contribuinte(s)	UNIVERSIDADE DE SÃO PAULO
Data(s)	18/10/2012 18/10/2012 2009
Resumo	The canonical representation of speech constitutes a perfect reconstruction (PR) analysis-synthesis system. Its parameters are the autoregressive (AR) model coefficients, the pitch period and the voiced and unvoiced components of the excitation represented as transform coefficients. Each set of parameters may be operated on independently. A time-frequency unvoiced excitation (TFUNEX) model is proposed that has high time resolution and selective frequency resolution. Improved time-frequency fit is obtained by using for antialiasing cancellation the clustering of pitch-synchronous transform tracks defined in the modulation transform domain. The TFUNEX model delivers high-quality speech while compressing the unvoiced excitation representation about 13 times over its raw transform coefficient representation for wideband speech.
Identificador	2009 16TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, VOLS 1 AND 2, p.423-427, 2009 978-1-4244-3297-4 http://producao.usp.br/handle/BDPI/18638 http://apps.isiknowledge.com/InboundService.do?Func=Frame&product=WOS&action=retrieve&SrcApp=EndNote&UT=000276494500071&Init=Yes&SrcAuth=ResearchSoft&mode=FullRecord
Idioma(s)	eng
Publicador	IEEE
Relação	2009 16th International Conference on Digital Signal Processing, Vols 1 and 2
Direitos	restrictedAccess Copyright IEEE
Palavras-Chave	#speech analysis #speech coding #scalable coding #modulation transform #time-frequency analysis #Computer Science, Hardware & Architecture #Engineering, Electrical & Electronic #Telecommunications
Tipo	article original article publishedVersion

Acesso ao item digital