Likelihood-maximising frameworks for enhanced in-car speech recognition
Data(s) |
25/06/2009
|
---|---|
Resumo |
Speech recognition in car environments has been identified as a valuable means for reducing driver distraction when operating non-critical in-car systems. Likelihood-maximising (LIMA) frameworks optimise speech enhancement algorithms based on recognised state sequences rather than traditional signal-level criteria such as maximising signal-to-noise ratio. Previously presented LIMA frameworks require calibration utterances to generate optimised enhancement parameters which are used for all subsequent utterances. Sub-optimal recognition performance occurs in noise conditions which are significantly different from that present during the calibration session - a serious problem in rapidly changing noise environments. We propose a dialog-based design which allows regular optimisation iterations in order to track the changing noise conditions. Experiments using Mel-filterbank spectral subtraction are performed to determine the optimisation requirements for vehicular environments and show that minimal optimisation assists real-time operation with improved speech recognition accuracy. It is also shown that the proposed design is able to provide improved recognition performance over frameworks incorporating a calibration session. |
Formato |
application/pdf |
Identificador | |
Relação |
http://eprints.qut.edu.au/26037/2/26037.pdf Kleinschmidt, Tristan, Sridharan, Sridha, & Mason, Michael (2009) Likelihood-maximising frameworks for enhanced in-car speech recognition. In 4th Biennial Workshop on DSP for In-Vehicle Systems and Safety, 25-27 June, 2009, Dallas, TX, USA. |
Fonte |
Faculty of Built Environment and Engineering; Information Security Institute; School of Engineering Systems |
Palavras-Chave | #090609 Signal Processing #In-vehicle speech technology #Robust speech recognition #Speech enhancement #Optimisation #Dialog systems |
Tipo |
Conference Paper |