Speech and Language Recognition using MFCC and DELTA-MFCC

  IJETT-book-cover  International Journal of Engineering Trends and Technology (IJETT)          
  
© 2014 by IJETT Journal
Volume-12 Number-9
Year of Publication : 2014
Authors : Samiksha Sharma , Anupam Shukla , Pankaj Mishra
  10.14445/22315381/IJETT-V12P286

MLA 

Samiksha Sharma , Anupam Shukla , Pankaj Mishra. "Speech and Language Recognition using MFCC and DELTA-MFCC", International Journal of Engineering Trends and Technology (IJETT), V12(9),449-452 June 2014. ISSN:2231-5381. www.ijettjournal.org. published by seventh sense research group

Abstract

In this paper, a model is proposed to recognize speech and language by speech signal for countries like India where many languages are spoken. We have used MFCC and delta-MFCCs as acoustic features. We used a supervised learning technique to train ANN (Artificial neural network) as recognizer. To train this model resilient back propagation algorithm and radial basis function neural network used and results are compared. The ANN model tries to classify the input with respect to a set of words and languages. In this work four Indian languages Hindi, English, Sanskrit and Telugu are used. A multi speaker Speech recognition and language recognizer proposed for these four Indian languages

References

[1] Wiqas Ghai and Navdeep Singh. Article: Literature Review on Automatic Speech Recognition. International Journal of Computer Applications 41(8):42-50, March 2012. Published by Foundation of Computer Science, New York, USA.
[2] Pruthi T, Saksena, S and Das, P K Swaranjali “Isolated Word Recognition for Hind Language using VQ and HMM” International Conference on Multimedia Processing and Systems (ICMPS), IIT Madras.
[3] Mohamad Adnan Al-Alaoui, Lina Al-Kanj, Jimmy Azar, Elias Yaacoub,”Speech Recognition using Artifical Neural Network and Hidden arkov Modal” IEEE Multidisciplinary Engineering Education ol3, No. 3, 2008 .
[4] Roopa A.Thorat , Ruchira.A.Jadhav , “Speech Recognition System ”, International Conference on Advances in Computing, Communication and Control, 2009.
[5] Sandipan Chakroborty and Goutam Saha, “mproved Text-Independent Speaker Identification using Fused MFCC & IMFCC Feature Sets based on Gaussian Filter”, International Journal of Signal Processing 5, Winter 2009.
[6] Akram M. Othman, May H. Riadh , “Speech Recognition Using Scaly Neural Networks”, proceeding of world academy of science, engineering & technology, 2008.
[7] Masmoudi, S.; Chtourou, M.; Ben Hamida, A., "Isolated word recognition system using MLP neural network constructive training algorithm," Systems, Signals and Devices, 2009. SSD '09. 6th International Multi-Conference on , vol., no., pp.1,6, 23-26 March 2009
[8] Hui Lin and Jui-Ting Huang and Francoise Beaufays and Brian Strope and Yun-hsuan Sung,”Recognition of Multilingual Speech in Mobile Applications,” ICASSP 2012.
[9] Zissman, M.A.; Singer, E., "Automatic language identification of telephone speech messages using phoneme recognition and N-gram modeling," Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on, vol.i, no., pp.I/305,I/308 vol.1, 19-22 Apr 1994.
[10] Leena, M.; Srinivasa Rao, K.; Yegnanarayana, B., "Neural network classifiers for language identification using phonotactic and prosodic features," Intelligent Sensing and Information Processing, 2005. Proceedings of 2005 International Conference on , vol., no., pp.404,408, 4-7 Jan. 2005
[11] Syama, R “ Speech Recognition System for Malayalam” Department of Computer Science .Cochin University of Science & Technology, Cochin 2008.
[12] Deivapalan, P G and Murthy, H A, ” A syllable-based isolated word recognizer for Tamil handling OOV words,” The National Conference on Communications, pp. 267-271 2008.
[13] Saurabh Karsoliya, “Approximating Number of Hidden layer neurons in Multiple Hidden Layer BPNN Architecture,” International Journal of Engineering Trends and Technology, Volume 3, Issue6, 2012
[14] Rahul Kala, Harsh Vazirani, Anupam Shukl, Ritu Tiwari, “Fusion of Speech and Face by Enhanced Modular Neural Network,” ICISTM 2010.
[15] Akram M. Othman, and May H. Riadh, “Speech Recognition Using Scaly Neural Networks”, World Academy of Science, Engineering and Technology, vol. 38, 2008.
[16] Chee Peng Lim, Siew Chan Woo, Aun Sim Loh, Rohaizan Osman, "Speech Recognition Using Artificial Neural Networks," wise, vol. 1, pp.0419, First International Conference on Web Information Systems Engineering (WISE'00)-Volume 1, 2000.
[17] J.H. McClellan, R.W. Schafer, M.A. Yoder, "Signal Processing First", Prentice Hall, 2003, pp. 415-426.
[18] S. Haykin, "Neural Networks: a comprehensive foundation", 2nd Edition, Prentice Hall, 1999.

References
ANN, Speech Recognition, language Recognition, back propagation algorithm, Radial basis function.