Robust speaker verification via fusion of speech and lip modalities


Autoria(s): Wark, T.; Sridharan, S.; Chandran, V.
Data(s)

1999

Resumo

This paper investigates the use of lip information, in conjunction with speech information, for robust speaker verification in the presence of background noise. It has been previously shown in our own work, and in the work of others, that features extracted from a speaker's moving lips hold speaker dependencies which are complementary with speech features. We demonstrate that the fusion of lip and speech information allows for a highly robust speaker verification system which outperforms the performance of either sub-system. We present a new technique for determining the weighting to be applied to each modality so as to optimize the performance of the fused system. Given a correct weighting, lip information is shown to be highly effective for reducing the false acceptance and false rejection error rates in the presence of background noise

Formato

application/pdf

Identificador

http://eprints.qut.edu.au/45590/

Publicador

IEEE

Relação

http://eprints.qut.edu.au/45590/1/c45590P.pdf

DOI:10.1109/ICASSP.1999.757487

Wark, T., Sridharan, S., & Chandran, V. (1999) Robust speaker verification via fusion of speech and lip modalities. In Proceedings of the 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Phoenix, Arizona, pp. 3061-3064.

Direitos

Copyright 1999 IEEE

Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Fonte

Faculty of Built Environment and Engineering; School of Engineering Systems

Palavras-Chave #acoustic noise #audio-visual systems #feature extraction #gesture recognition #sensor fusion #speaker recognition #background noise #error rates #false acceptance #false rejection #features extraction #fusion #moving lips #performance #robust speaker verification #speech features #speech information #weighting
Tipo

Conference Paper