PERFORMANCE OF DIFFERENT CLASSIFIERS IN SPEECH RECOGNITION


Autoria(s): Poulose Jacob,K; Sonia, Sunny; David, Peter S
Data(s)

13/06/2014

13/06/2014

01/04/2013

Resumo

Speech is the most natural means of communication among human beings and speech processing and recognition are intensive areas of research for the last five decades. Since speech recognition is a pattern recognition problem, classification is an important part of any speech recognition system. In this work, a speech recognition system is developed for recognizing speaker independent spoken digits in Malayalam. Voice signals are sampled directly from the microphone. The proposed method is implemented for 1000 speakers uttering 10 digits each. Since the speech signals are affected by background noise, the signals are tuned by removing the noise from it using wavelet denoising method based on Soft Thresholding. Here, the features from the signals are extracted using Discrete Wavelet Transforms (DWT) because they are well suitable for processing non-stationary signals like speech. This is due to their multi- resolutional, multi-scale analysis characteristics. Speech recognition is a multiclass classification problem. So, the feature vector set obtained are classified using three classifiers namely, Artificial Neural Networks (ANN), Support Vector Machines (SVM) and Naive Bayes classifiers which are capable of handling multiclasses. During classification stage, the input feature vector data is trained using information relating to known patterns and then they are tested using the test data set. The performances of all these classifiers are evaluated based on recognition accuracy. All the three methods produced good recognition accuracy. DWT and ANN produced a recognition accuracy of 89%, SVM and DWT combination produced an accuracy of 86.6% and Naive Bayes and DWT combination produced an accuracy of 83.5%. ANN is found to be better among the three methods.

IJRET | APR 2013 Volume: 2 Issue: 4,590 - 597

Cochin University of Science & Technology

Identificador

2319 - 1163

http://dyuthi.cusat.ac.in/purl/3914

Idioma(s)

en

Publicador

IJRET

Palavras-Chave #Speech Recognition #Soft Thresholding #Discrete Wavelet Transforms #Artificial Neural Networks #Support Vector Machines and Naive Bayes Classifier
Tipo

Article