Biblioteca Digital

ASR emotional speech: Clarifying the issues and enhancing performance

**Autoria(s):** Athanaselis, T.; Bakamidis, S.; Dologlou, I.; Cowie, Roddy; Douglas-Cowie, Ellen; Cox, C.
Data(s)	01/05/2005
Resumo	There are multiple reasons to expect that recognising the verbal content of emotional speech will be a difficult problem, and recognition rates reported in the literature are in fact low. Including information about prosody improves recognition rate for emotions simulated by actors, but its relevance to the freer patterns of spontaneous speech is unproven. This paper shows that recognition rate for spontaneous emotionally coloured speech can be improved by using a language model based on increased representation of emotional utterances. The models are derived by adapting an already existing corpus, the British National Corpus (BNC). An emotional lexicon is used to identify emotionally coloured words, and sentences containing these words are recombined with the BNC to form a corpus with a raised proportion of emotional material. Using a language model based on that technique improves recognition rate by about 20%. (c) 2005 Elsevier Ltd. All rights reserved.
Identificador	http://pure.qub.ac.uk/portal/en/publications/asr-emotional-speech-clarifying-the-issues-and-enhancing-performance(47691c76-7cb2-4e9e-a8b2-bd4cd8627fed).html http://dx.doi.org/10.1016/j.neunet.2005.03.008
Idioma(s)	eng
Direitos	info:eu-repo/semantics/restrictedAccess
Fonte	Athanaselis , T , Bakamidis , S , Dologlou , I , Cowie , R , Douglas-Cowie , E & Cox , C 2005 , ' ASR emotional speech: Clarifying the issues and enhancing performance ' Neural Networks , vol 18 , no. 4 , pp. 437-444 . DOI: 10.1016/j.neunet.2005.03.008
Palavras-Chave	#/dk/atira/pure/subjectarea/asjc/1700/1702 #Artificial Intelligence #/dk/atira/pure/subjectarea/asjc/2800 #Neuroscience(all)
Tipo	article

Acesso ao item digital