A simple but efficient voice activity detection algorithm through Hilbert transform and dynamic threshold for speech pathologies


Autoria(s): Ortiz P., D.; Villa, Luisa F.; Salazar, Carlos; Quintero, O.L.
Contribuinte(s)

Universidad EAFIT. Escuela de Ciencias. Grupo de Investigación Modelado Matemático

dpuerta1@eafit.edu.co

oquinte1@eafit.edu.co

Mathematical Modeling Research Group, GRIMMAT, School of Sciences, Universidad EAFIT, Medellín, Colombia

Data(s)

2016

11/05/2016

2016

11/05/2016

Resumo

A simple but efficient voice activity detector based on the Hilbert transform and a dynamic threshold is presented to be used on the pre-processing of audio signals -- The algorithm to define the dynamic threshold is a modification of a convex combination found in literature -- This scheme allows the detection of prosodic and silence segments on a speech in presence of non-ideal conditions like a spectral overlapped noise -- The present work shows preliminary results over a database built with some political speech -- The tests were performed adding artificial noise to natural noises over the audio signals, and some algorithms are compared -- Results will be extrapolated to the field of adaptive filtering on monophonic signals and the analysis of speech pathologies on futures works

20th Argentinean Bioengineering Society Congress, SABI 2015 (XX Congreso Argentino de Bioingeniería y IX Jornadas de Ingeniería Clínica)28–30 October 2015, San Nicolás de los Arroyos, Argentina

Formato

application/pdf

Identificador

1742-6596

http://dx.doi.org/10.1088/1742-6596/705/1/012037

http://hdl.handle.net/10784/8373

10.1088/1742-6596/705/1/012037

Idioma(s)

eng

Publicador

IOP Publishing

Relação

Journal of Physics: Conference Series; Vol. 705, Núm. 1 (2016); pp.9

http://dx.doi.org/10.1088/1742-6596/705/1/012037

Direitos

info:eu-repo/semantics/openAccess

openAccess

Libre acceso

Creative Commons Attribution 3.0 licence (CC BY 3.0)

Fonte

Journal of Physics: Conference Series; Vol. 705, Núm. 1 (2016); pp.9

Palavras-Chave #Transformada de Hilbert #Cancelación de ruidos #Señal monofónica #PROCESAMIENTO DE SEÑALES #PROCESAMIENTO DE SEÑALES - TÉCNICAS DIGITALES #MEDICIÓN DEL RUIDO #FILTROS ADAPTIVOS #ANÁLISIS DE FOURIER #TEORÍA ESPECTRAL (MATEMÁTICAS) #ANÁLISIS ESPECTRAL #PROCESOS DE GAUSS #UMBRAL AUDITIVO #Signal processing #Signal processing - Digital techniques #Noise - Measurement #Adaptive filters #Fourier analysis #Spectral theory (mathematics) #Spectrum analysis #Gaussian processes #Auditory threshold
Tipo

info:eu-repo/semantics/article

info:eu-repo/semantics/publishedVersion

article

Artículo