Performance Analysis of Different Wavelet Families in Recognizing Speech

  ijett-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
  
© 2013 by IJETT Journal
Volume-4 Issue-4                       
Year of Publication : 2013
Authors :  Sonia Sunny , David Peter S , K Poulose Jacob

Citation 

Sonia Sunny , David Peter S , K Poulose Jacob. "Performance Analysis o f Different Wavelet Families i n Recognizing Speech". International Journal of Engineering Trends and Technology (IJETT). V4(4):512-517 Apr 2013. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group

Abstract

Automatic Speech Recognition (ASR) is one of the challenging areas of research in digital signal processing and engineering due to its wide range of applications. In this paper, a speech recognition system is developed for recognizing speaker independent spoken isolated words in Malayalam. Voice signals are sampled directly from the microphone and the features are extracted using Discrete Wavelet Transforms (DWT). Different types of wavelet families are available for speech processing and mathematical analysis. Since DWT uses wavelets, the main issue here is to find out the optimal wavelets for speech recognition. This paper investigates the performance of different wavelet families like Haar, Daubechies, Symlets, Coiflets etc. A multi - layer neural network trained with back propagation algorithm is used for classification. The proposed method is implemented for 1000 speakers uttering 10 isolated words each. The experimental results show different recognition accuracies for different wavelet families and the best result of 90.2% is obtained using Daubechies wavelet families with order 4.

References

[1] R. Lawrence ,”Applications of Speech Recognition in the Area of Telecommunications,” in Proc. of IEEE Workshop on Automatic Speech Recognition and Understanding , 1997, p. 501 - 510.
[2] Recognition [Online]. Available: http://www.learnartificialneuralnetworks.com/speechrecognition.html
[3] Santos h K.Gaikwad, Bharti W.Gawali and Pravin Yannawar, “A Review on Speech Recognition Technique,” International Journal of Computer Applications , Vol. 10, No.3, pp. 16 - 24, Nov. 2010.
[4] Wei - Bing Hu, Kai - Cheng Li and Dang - Jun Zhao,” A Novel Probabil istic Neural Network System for Power Quality Classification Based On Different Wavelet Transform,” in Proc. of the International Conference on Wavelet Analysis and Pattern Recognition , 2007, p. 746 - 750.
[5] H. T. Yang and C. C. Liao, “A de - noising scheme fo renhancing wavelet - based power quality monitoring system,” IEEE Trans. Power Delivery , Vol. 16, No. 7, pp. 353 - 360, July 2001.
[6] Cédric Vonesch , Thierry Blu and Michael Unser , “Generalized Daubechies Wavelet Families,” IEEE Trans. On Signal Processing , Vol. 55, No. 9, pp. 4415 - 4429, September 2007.
[7] S. Kadambe, P. Srinivasan.”Application of adaptive wavelets for speech,” Optical Engineering, Vol. 3(7), pp. 2204 - 2211, 1994.
[8] D.L. Donoho, “De - noising by soft thresholding,” IEEE transactions on information theory , Vol. 41, No. 3, pp. 613 - 627, 1995.
[9] S. Mallat , A Wavelet Tour of Signal Processing . New York: Academic, 1998.
[10] Y. Meyer , Wavelets and Operators . Cambridge, U.K.: Cambridge Univ. Press, 1992.
[11] Mohamed I. Mahmoud, Moawa d I. M. Dessouky, Salah Deyab, and Fatma H. Elfouly, “ Comparison between Haar and Daubechies Wavelet Transform ati ons on FPGA Technology, ” World Academy of Science, Engineering and Technology , pp . 68 - 72 , 2007.
[12] S. Mallat, A wavelet Tour of Signal Process ing , Academic Press, San Diego: 1999.
[13] K. P Soman, K.I Ramachandran, N.G Resmi , Insight into Wavelets From Theory to Practice , PHI Learning Private Ltd, New Delhi: 2010.
[14] Elif Derya Ubeyil, “Combined Neural Network Model Employing Wavelet Coefficients for ECG Signals Classification,” Digital signal Processing , Vol. 19, pp. 297 - 308, 2009.
[15] S. Chan Woo, C.Peng Lin, R. Osman, “ Development of a Speaker Recognition System using Wavelets and Artificial Neural networks, “ in Proc. of 2001 Int. Symposium on Int elligent Multimedia, Video and Speech processing , 2001, pp. 413 - 416.
[16] S .G. Mallat, “ A Theory for Multiresolution Signal Decomposition: The Wavelet Representation,” IEEE Transactions on Pattern Analysis And Machine Intelligence , Vol.11, pp. 674 - 693, 198 9.
[17] Economou K., Lymberopoulos D, “A New Perspective in Learning Pattern Generation for Teaching Neural Networks,” Neural Networks , Vol. 12, Issue 4 - 5, pp. 767 - 775, 1999.

Keywords
Speech Recognition , Feature Extraction , Wavelet F amilies, Discrete Wavelet Transforms, Classification, Artificial Neural Networks .