Patrol team language identification system for DARPA RATS P1 evaluation


Autoria(s): Matějka, Pavel; Plchot, Oldřich; Soufifar, Mehdi; Glembek, Ondřej; D'haro Enríquez, Luis Fernando; Veselý, Karel; Grézl, František; Ma, Jeff; Matsoukas, Spyros; Dehak, Najim
Data(s)

2012

Resumo

This paper describes the language identification (LID) system developed by the Patrol team for the first phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded communication channels. We show that techniques originally developed for LID on telephone speech (e.g., for the NIST language recognition evaluations) remain effective on the noisy RATS data, provided that careful consideration is applied when designing the training and development sets. In addition, we show significant improvements from the use of Wiener filtering, neural network based and language dependent i-vector modeling, and fusion.

Formato

application/pdf

Identificador

http://oa.upm.es/20384/

Idioma(s)

eng

Publicador

E.T.S.I. Telecomunicación (UPM)

Relação

http://oa.upm.es/20384/1/INVE_MEM_2012_134238.pdf

info:eu-repo/semantics/altIdentifier/doi/null

Direitos

http://creativecommons.org/licenses/by-nc-nd/3.0/es/

info:eu-repo/semantics/openAccess

Fonte

InterSpeech 2012 - 13th Annual Conference of the International Speech Communication Association | InterSpeech 2012 - 13th Annual Conference of the International Speech Communication Association | 09/09/2012 - 13/09/2012 | Portland, Oregon

Palavras-Chave #Telecomunicaciones
Tipo

info:eu-repo/semantics/conferenceObject

Ponencia en Congreso o Jornada

PeerReviewed