Biblioteca Digital

Creating a spontaneous conversational speech corpus

**Autoria(s):** Husin, Maria; Stewart, Darryl; Ji, Ming; Smith, F. J.
Data(s)	15/01/2012
Resumo	Speech recognition and language analysis of spontaneous speech arising in naturally spoken conversations are becoming the subject of much research. However, there is a shortage of spontaneous speech corpora that are freely available for academics. We therefore undertook the building of a natural conversation speech database, recording over 200 hours of conversations in English by over 600 local university students. With few exceptions, the students used their own cell phones from their own rooms or homes to speak to one another, and they were permitted to speak on any topic they chose. Although they knew that they were being recorded and that they would receive a small payment, their conversations in the corpus are probably very close to being natural and spontaneous. This paper describes a detailed case study of the problems we faced and the methods we used to make the recordings and control the collection of these social science data on a limited budget.
Formato	application/pdf
Identificador	http://pure.qub.ac.uk/portal/en/publications/creating-a-spontaneous-conversational-speech-corpus(3eff6d89-6d66-4d9d-8b79-7f84f7f50977).html http://dx.doi.org/10.2481/dsj.10-011 http://pure.qub.ac.uk/ws/files/948806/Creating%20a%20Spontaneous%20Conversational%20Speech%20Corpus.pdf
Idioma(s)	eng
Direitos	info:eu-repo/semantics/restrictedAccess
Fonte	Husin , M , Stewart , D , Ji , M & Smith , F J 2012 , ' Creating a spontaneous conversational speech corpus ' Data Science Journal , vol 10 , no. null , pp. 42-51 . DOI: 10.2481/dsj.10-011
Palavras-Chave	#/dk/atira/pure/subjectarea/asjc/1700/1701 #Computer Science (miscellaneous) #/dk/atira/pure/subjectarea/asjc/1700/1706 #Computer Science Applications
Tipo	article

Acesso ao item digital