Biblioteca Digital

**Autoria(s):** Ramakrishnan, AG; Abhiram, B; Prasanna, Mahadeva SR
Data(s)	2015
Resumo	A characterization of the voice source (VS) signal by the pitch synchronous (PS) discrete cosine transform (DCT) is proposed. With the integrated linear prediction residual (ILPR) as the VS estimate, the PS DCT of the ILPR is evaluated as a feature vector for speaker identification (SID). On TIMIT and YOHO databases, using a Gaussian mixture model (GMM)-based classifier, it performs on par with existing VS-based features. On the NIST 2003 database, fusion with a GMM-based classifier using MFCC features improves the identification accuracy by 12% in absolute terms, proving that the proposed characterization has good promise as a feature for SID studies. (C) 2015 Acoustical Society of America
Formato	application/pdf
Identificador	http://eprints.iisc.ernet.in/51959/1/Jou_of_Aco_Sco_of_Ame_137-6_EL469_2015.pdf Ramakrishnan, AG and Abhiram, B and Prasanna, Mahadeva SR (2015) Voice source characterization using pitch synchronous discrete cosine transform for speaker identification. In: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 137 (6). EL469-EL475.
Publicador	ACOUSTICAL SOC AMER AMER INST PHYSICS
Relação	http://dx.doi.org/10.1121/1.4921679 http://eprints.iisc.ernet.in/51959/
Palavras-Chave	#Electrical Engineering
Tipo	Journal Article PeerReviewed

Acesso ao item digital