On-line Emotion Recognition in a 3-D Activation-Valence-Time Continuum using Acoustic and Linguistic Cues
Data(s) |
01/03/2010
|
---|---|
Resumo |
For many applications of emotion recognition, such as virtual agents, the system must select responses while the user is speaking. This requires reliable on-line recognition of the user’s affect. However most emotion recognition systems are based on turnwise processing. We present a novel approach to on-line emotion recognition from speech using Long Short-Term Memory Recurrent Neural Networks. Emotion is recognised frame-wise in a two-dimensional valence-activation continuum. In contrast to current state-of-the-art approaches, recognition is performed on low-level signal frames, similar to those used for speech recognition. No statistical functionals are applied to low-level feature contours. Framing at a higher level is therefore unnecessary and regression outputs can be produced in real-time for every low-level input frame. We also investigate the benefits of including linguistic features on the signal frame level obtained by a keyword spotter. |
Identificador |
http://dx.doi.org/10.1007/s12193-009-0032-6 http://www.scopus.com/inward/record.url?scp=77949304464&partnerID=8YFLogxK |
Idioma(s) |
eng |
Direitos |
info:eu-repo/semantics/restrictedAccess |
Fonte |
Eyben , F , Wollmer , M , Graves , A , Schuller , B , Douglas-Cowie , E & Cowie , R 2010 , ' On-line Emotion Recognition in a 3-D Activation-Valence-Time Continuum using Acoustic and Linguistic Cues ' Journal on Multimodal User Interfaces , vol 3 , no. 1-2 , pp. 7-19 . DOI: 10.1007/s12193-009-0032-6 |
Palavras-Chave | #emotion recognition #databases #acoustic and linguistic cues #/dk/atira/pure/subjectarea/asjc/1700/1709 #Human-Computer Interaction #/dk/atira/pure/subjectarea/asjc/1700/1711 #Signal Processing |
Tipo |
article |