ClInt: A bilingual Spanish-Catalan spoken corpus of clinical interviews


Autoria(s): Vila Rigat, Marta; González Fuente, Santiago; Martí Antonin, M. Antònia; Llisterri, Joaquim; Machuca Ayuso, María Jesús
Contribuinte(s)

Universitat de Barcelona

Resumo

In this paper we present ClInt (Clinical Interview), a bilingual Spanish-Catalan spoken corpus that contains 15 hours of clinical interviews. It consists of audio files aligned with multiple-level transcriptions comprising orthographic, phonetic and morphological information, as well as linguistic and extralinguistic encoding. This is a previously non-existent resource for these languages and it offers a wide-ranging exploitation potential in a broad variety of disciplines such as Linguistics, Natural Language Processing and related fields.

Identificador

http://hdl.handle.net/2445/48252

Idioma(s)

eng

Publicador

Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN)

Direitos

(c) Vila Rigat, Marta et al., 2010

info:eu-repo/semantics/openAccess

Palavras-Chave #Tipologia (Lingüística) #Corpus (Lingüística) #Tractament del llenguatge natural (Informàtica) #Lingüística computacional #Català #Castellà (Llengua) #Typology (Linguistics) #Corpora (Linguistics) #Natural language processing (Computer science) #Computational linguistics #Catalan language #Spanish language
Tipo

info:eu-repo/semantics/article

info:eu-repo/semantics/publishedVersion