Enhancing Automatic Recognition of Isolated Arabic Speech Using Artificial Intelligence Techniques: A Systematic Review

Enhancing Automatic Recognition of Isolated Arabic Speech Using Artificial Intelligence Techniques: A Systematic Review

  IJETT-book-cover           
  
© 2025 by IJETT Journal
Volume-73 Issue-3
Year of Publication : 2025
Author : Mithal Khaleel Ismael, Goh Chin Hock, Hazem Noori Abdulrazzak
DOI : 10.14445/22315381/IJETT-V73I3P116

How to Cite?
Mithal Khaleel Ismael, Goh Chin Hock, Hazem Noori Abdulrazzak, "Enhancing Automatic Recognition of Isolated Arabic Speech Using Artificial Intelligence Techniques: A Systematic Review," International Journal of Engineering Trends and Technology, vol. 73, no. 3, pp. 212-229, 2025. Crossref, https://doi.org/10.14445/22315381/IJETT-V73I3P116

Abstract
Automatic Speech Recognition (ASR) and spoken language systems have indeed made remarkable strides, fueled by advances in artificial intelligence, deep learning, and computational power. Modern ASR systems are now more accurate, robust, and capable of handling a variety of applications, from voice assistants to real-time transcription services. This review discusses the developments in isolated Arabic speech recognition using different AI methodologies. It emphasizes fundamental techniques, such as deep learning and machine learning algorithms, and then evaluates their effectiveness in enhancing recognition accuracy. This paper highlights the inherent obstacles of the Arabic language, including dialectal differences and phonetic complexities. It also examines the importance of feature extraction and model training in improving performance. The methodologies used in speech detection and processing, identifying gaps and correlations between known patterns, and presenting recent patterns are illustrated in this paper. A systematic review of the selected studies was conducted to identify and select relevant papers. The evaluation indicates that despite significant progress, additional research is needed to overcome current limitations and enhance practical application.

Keywords
Dialect recognition, MSA, Speech recognition, Artificial Intelligence, Machine Learning, ASR.

References
[1] Vlado Delić et al., “Speech Technology Progress Based on New Machine Learning Paradigm,” Computational Intelligence and Neuroscience, vol. 2019, no. 1, pp. 1-19, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Siddique Latif et al., “Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends,” Arxiv, pp. 1-25, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Yogesh Kumar, and Navdeep Singh, “A Comprehensive View of Automatic Speech Recognition System: A Systematic Literature Review,” International Conference on Automation, Computational and Technology Management, London, UK, pp. 168-173, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Anjuli Kannan et al., “Large-Scale Multilingual Speech Recognition with a Streaming End-To-End Model,” arXiv Preprint, pp. 2130-2134, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Chung-Cheng Chiu et al., “State-Of-The-Art Speech Recognition with Sequence-To-Sequence Models,” IEEE International Conference on Acoustics, Speech and Signal Processing, Calgary, AB, Canada, pp. 1-5, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Jing-Xuan Zhang et al., “Sequence-To-Sequence Acoustic Modeling for Voice Conversion,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27, no. 3, pp. 631-644, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Garima Sharma, Kartikeyan Umapathy, and Sridhar Krishnan, “Trends in Audio Signal Feature Extraction Methods,” Applied Acoustics, vol. 158, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Parashar Dhakal et al., “A Near Real-Time Automatic Speaker Recognition Architecture for Voice-Based User Interface,” Machine Learning and Knowledge Extraction, vol. 1, no. 1, pp. 504-520, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Motaz Hamza, Touraj Khodadadi, and Sellappan Palaniappan, “A Novel Automatic Voice Recognition System Based on Text-Independent in A Noisy Environment,” International Journal of Electrical and Computer Engineering, vol. 10, no. 4, pp. 3643-3650, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Suvarnsing Bhable, Ashish Lahase, and Santosh Maher, “Automatic Speech Recognition (ASR) of Isolated Words in Hindi Low-Resource Language,” International Journal for Research in Applied Science & Engineering Technology, vol. 9, no. 2, pp. 260-265, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Ankita Wadhawan, and Parteek Kumar, “Deep Learning-Based Sign Language Recognition System for Static Signs,” Neural Computing and Applications, vol. 33, no. 12, pp. 7957-7968, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Imane Guellil et al., “Arabic Natural Language Processing: An Overview,” Journal of King Saud University-Computer and Information Sciences, vol. 33, no. 5, pp. 497-507, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Samira Hazmoune et al., “A New Hybrid Framework Based on Hidden Markov Models and K-Nearest Neighbors for Speech Recognition,” International Journal of Speech Technology, vol. 21, no. 3, pp. 689-704, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Abdelmadjid Benmachiche, and Amina Makhlouf, “Optimization of Hidden Markov Model with Gaussian Mixture Densities for Arabic Speech Recognition,” WSEAS Transactions on Signal Processing, vol. 15, pp. 85-94, 2019.
[Google Scholar] [Publisher Link]
[15] Vishal Passricha, and Rajesh Kumar Aggarwal, “End-To-End Acoustic Modeling Using Convolutional Neural Networks,” Intelligent Speech Signal Processing, pp. 5-37, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Marvin Coto-Jiménez, “Improving Post-Filtering of Artificial Speech Using Pre-Trained LSTM Neural Networks,” Biomimetics, vol. 4, no. 2, pp. 1-17, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Brahim Fares Zaidi et al., “Deep Neural Network Architectures for Dysarthric Speech Analysis and Recognition,” Neural Computing and Applications, vol. 33, no. 15, pp. 9089-9108, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Kateřina Žmolíková et al., “Speakerbeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures,” IEEE Journal of Selected Topics in Signal Processing, vol. 13, no. 4, pp. 800-814, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Mutian Zhuang, “New Features for Speech Processing Standard Pronunciation Classification,” Master Thesis, Northern Illinois University, 2021.
[Google Scholar]
[20] Jinchuan Tian et al., “Improving Mandarin End-To-End Speech Recognition with Word N-Gram Language Model,” IEEE Signal Processing Letters, vol. 29, no. 8, pp. 812-816, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[21] T. Serrenho, and P. Bertoldi, “Smart Home and Appliances: State of The Art,” European Commision, Technical Report, pp. 1-59, 2019.
[Google Scholar]
[22] Chao-Han Huck Yang et al., “Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition,” IEEE International Conference on Acoustics, Speech and Signal Processing, Toronto, ON, Canada, pp. 6523-6527, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Roshna Muhamad M. Amin, and Miran H.M. Baban, “Implementation of Controlling Home Appliances Via Secure Short Message Service Technology,” Tikrit Journal of Engineering Science, vol. 28, no. 1, pp. 21-30, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Pete Warden, “Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition,” Arxiv, pp. 1-11, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Dragos Mocrii, Yuxiang Chen, and Petr Musilek, “Iot-Based Smart Homes: A Review of System Architecture, Software, Communications, Privacy and Security,” Internet of Things, vol. 1-2, pp. 81-98, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Marcin Woźniak, and Dawid Połap, “Intelligent Home Systems for Ubiquitous User Support by Using Neural Networks and Rule-Based Approach,” IEEE Transactions on Industrial Informatics, vol. 16, no. 4, pp. 2651-2658, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[27] Yu Xiao, and Maria Watson, “Guidance on Conducting a Systematic Literature Review,” Journal of Planning Education and Research, vol. 39, no. 1, pp. 93-112, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[28] Debajyoti Pal et al., “Analyzing the Elderly Users’ Adoption of Smart-Home Services,” IEEE Access, vol. 6, pp. 51238-51252, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Berrak Sisman et al., “An Overview of Voice Conversion and Its Challenges: From Statistical Modeling to Deep Learning,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 132-157, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[30] Mahmood S. Mahmood, and Najla B. Al Dabagh, “Improving IOT Security Using Lightweight Based Deep Learning Protection Model,” Tikrit Journal of Engineering Science, vol. 30, no. 1, pp. 119-129, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[31] Ali Bou Nassif et al., “Speech Recognition Using Deep Neural Networks: A Systematic Review,” IEEE Access, vol. 7, pp. 19143-19165, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[32] Mahmoud A. Elawi, Noori H. Noori, and Auday T.S. Al-Bayati, “Comparison Between the Relations of HpGe Detector Efficiency Curve and Background ‘Spectrum Shape’,” Tikrit Journal of Pure Science, vol. 26, no. 1, pp. 96-100, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[33] Haoye Lu, Haolong Zhang, and Amit Nayak, “A Deep Neural Network for Audio Classification with A Classifier Attention Mechanism,” Arxiv, pp. 1-15, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[34] Yishuang Ning et al., “Review of Deep Learning Based Speech Synthesis,” Applied Sciences, vol. 9, no. 19, pp. 1-16, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[35] Salima Harrat, Karima Meftouh, and Kamel Smaïli, “Maghrebi Arabic Dialect Processing: An Overview,” Journal of International Science and General Applications, vol. 1, no. 1, pp. 1-8, 2018.
[Google Scholar] [Publisher Link]
[36] Zhaojuan Song, “English Speech Recognition Based on Deep Learning with Multiple Features,” Computing, vol. 102, no. 3, pp. 663-682, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[37] Mine Büşra Gelen, and Alparslan Serhat Demir, “Selection of Information Technology Personnel for an Enterprise in the Process of Industry 4.0 with the MultiMoora Method,” Sakarya University Journal of Science, vol. 23, no. 43328, pp. 663-675, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[38] Apeksha Shewalkar, Deepika Nyavanandi, and Simone A. Ludwig, “Performance Evaluation of Deep Neural Networks Applied to Speech Recognition: RNN, LSTM and GRU,” Journal of Artificial Intelligence and Soft Computing Research, vol. 9, no. 4, pp. 235-245, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[39] Mohammad Ayache et al., “Speech Command Recognition Using Deep Learning,” 2021 Sixth International Conference on Advances in Biomedical Engineering, Werdanyeh, Lebanon, pp. 24-29, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[40] Anand Umashankar, “Large Scale Speech Recognition with Deep Learning,” Master Thesis, Aalto University, pp. 1-83, 2021.
[Google Scholar] [Publisher Link]
[41] Samer Alsammarraie, and Nazar K. Hussein, “A New Hybrid Grasshopper Optimization-Backpropagation for Feedforward Neural Network Training,” Tikrit Journal of Pure Science, vol. 25, no. 1, pp. 118-127, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[42] Tanvira Ismail, “A Survey of Language and Dialect Identification Systems,” Adalya Journal, vol. 9, no. 1, pp. 682-690, 2020.
[Google Scholar] [Publisher Link]
[43] Aalaa Ahmed Mohammed, and Yusra Faisal Al-Irhayim, “Speaker Age and Gender Estimation Based on Deep Learning Bidirectional Long-Short Term Memory (BiLSTM),” Tikrit Journal of Pure Science, vol. 26, no. 4, pp. 76-84, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[44] Dwi Sari Widyowaty, Andi Sunyoto, and Hanif Al Fatta, “Accent Recognition Using Mel-Frequency Cepstral Coefficients and Convolutional Neural Network,” Proceedings of the International Conference on Innovation in Science and Technology, pp. 43-46, 2021.
[Google Scholar] [Publisher Link]
[45] Mohammed Jawad Al Dujaili, Abbas Ebrahimi-Moghadam, and Ahmed Fatlawi, “Speech Emotion Recognition Based on SVM and KNN Classifications Fusion,” International Journal of Electrical and Computer Engineering, vol. 11, no. 2, pp. 1259-1264, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[46] Thibault Viglino, Petr Motlicek, and Milos Cernak, “End-To-End Accented Speech Recognition,” Interspeech, pp. 2140-2144, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[47] Shreyas Ramoji, and Sriram Ganapathy, “Supervised I-Vector Modeling for Language and Accent Recognition,” Computer Speech & Language, vol. 60, pp. 1-39, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[48] Samantha Wray, “Classification of Closely Related Sub-Dialects of Arabic Using Support-Vector Machines,” Proceedings of the 11th International Conference on Language Resources and Evaluation, pp. 3671-3674, 2018.
[Google Scholar] [Publisher Link]
[49] Radek Martinek et al., “Voice Communication in Noisy Environments in A Smart House Using Hybrid LMS+ICA Algorithm,” Sensors, vol. 20, no. 21, pp. 1-24, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[50] Hannah Snyder, “Literature Review as A Research Methodology: An Overview and Guidelines,” Journal of Business Research, vol. 104, pp. 333-339, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[51] Matthew J. Page et al., “Updating Guidance for Reporting Systematic Reviews: Development of the PRISMA 2020 Statement,” Journal of Clinical Epidemiology, vol. 134, pp. 103-112, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[52] Catrin Sohrabi et al., “PRISMA 2020 Statement: What’s New and the Importance of Reporting Guidelines,” International Journal of Surgery, vol. 88, pp. 39-42, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[53] Xin Huang et al., “Synthesizing Qualitative Research in Software Engineering: A Critical Review,” Proceedings of the 40th International Conference on Software Engineering, Gothenburg Sweden, pp. 1207-1218, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[54] Hayrol Azril Mohamed Shaffril, Samsul Farid Samsuddin, and Asnarulkhadi Abu Samah, “The ABC of Systematic Literature Review: The Basic Methodological Guidance for Beginners,” Quality and Quantity, vol. 55, no. 4, pp. 1319-1346, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[55] Quan Nha Hong, and Pierre Pluye, “A Conceptual Framework for Critical Appraisal in Systematic Mixed Studies Reviews,” Journal of Mixed Methods Research, vol. 13, no. 4, pp. 446-460, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[56] Bezoui Mouaz, Beni Hssane Abderrahim, and Elmoutaouakkil Abdelmajid, “Speech Recognition of Moroccan Dialect using Hidden Markov Models,” Procedia Computer Science, vol. 151, pp. 985-991, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[57] Naima Zerari et al., “Bidirectional Deep Architecture for Arabic Speech Recognition,” Open Computer Science, vol. 9, no. 1, pp. 92-102, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[58] Bilal Dendani, Halima Bahi, and Toufik Sari, “Speech Enhancement Based on Deep AutoEncoder for Remote Arabic Speech Recognition,” Image and Signal Processing: 9th International Conference, Marrakesh, Morocco, pp. 221-229, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[59] Fawaz S. Al-Anzi, “Improved Noise-Resilient Isolated Words Speech Recognition Using Piecewise Differentiation,” Fractals, vol. 30, no. 8, pp. 1-12, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[60] Lubna Eljawad et al., “Arabic Voice Recognition Using Fuzzy Logic and Neural Network,” International Journal of Applied Engineering Research, vol. 14, no. 3, pp. 651-662, 2019.
[Google Scholar] [Publisher Link]
[61] Yousef A. Alotaibi et al., “A Canonicalization of Distinctive Phonetic Features to Improve Arabic Speech Recognition,” Acta Acustica United with Acustica, vol. 105, no. 6, pp. 1269-1277, 2019.
[Google Scholar]
[62] Lotfi Boussaid, and Mohamed Hassine, “Arabic Isolated Word Recognition System Using Hybrid Feature Extraction Techniques and Neural Network,” International Journal of Speech Technology, vol. 21, pp. 29-37, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[63] Lina Tarek Benamer, and Osama A.S. Alkishriwo, “Database for Arabic Speech Commands Recognition,” 3rd Conference for Engineering Sciences and Technology, 2020.
[Google Scholar] [Publisher Link]
[64] Wafa Helali, Zied Hajaiej, and Adnane Cherif, “Arabic Corpus Implementation: Application to Speech Recognition,” 2018 International Conference on Advanced Systems and Electric Technologies, Hammamet, Tunisia, pp. 50-53, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[65] Naima Zerari et al., “Bi-Directional Recurrent End-To-End Neural Network Classifier for Spoken Arab Digit Recognition,” 2018 2nd International Conference on Natural Language and Speech Processing, Algiers, Algeria, pp. 1-6, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[66] Abdelkader Guerid, and Amrane Houacine, “Recognition of Isolated Digits Using DNN-HMM and Harmonic Noise Model,” IET Signal Processing, vol. 13, no. 2, pp. 207-214, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[67] Nourredine Oukas et al., “ArabAlg: A New Dataset for Arabic Speech Commands Recognition for Machine Learning Purposes,” International Journal of Computing and Digital Systems, vol. 15, no. 1, pp. 989-1005, 2024.
[Google Scholar]
[68] Ahmed Boumehdi, and Abdellah Yousfi, “Construction of a Database for Speech Recognition of Isolated Arabic Words,” Proceedings of the 13th International Conference on Intelligent Systems: Theories and Applications, New York, United States, pp. 1-4, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[69] Hani S. Hassan, S. Jammila Harbi, and Maisa'a Abid Ali Kodher, “RETRACTED: Arabic Command Based Human Computer Interaction,” International Conference for Modern Applications of Information and Communication Technology, Baghdad, IRAQ, vol. 1530, no. 1, pp. 1-12, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[70] Shaker K. Ali, and Zahraa M. Mahdi, “Arabic Voice System to Help Illiterate or Blind for Using Computer,” Journal of Physics: Conference Series: International Conference of Modern Applications on Information and Communication Technology, Babylon-Hilla City, Iraq, vol. 1804, pp. 1-12, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[71] Amjad Rehman Khan, “Automatic Gender Authentication from Arabic Speech Using Hybrid Learning,” Journal of Advances in Information Technology, vol. 15, no. 4, pp. 532-543, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[72] Abdulaziz Saleh Mahfoudh Ba Wazir, and Joon Huang Chuah, “Spoken Arabic Digits Recognition Using Deep Learning,” 2019 IEEE International Conference on Automatic Control and Intelligent Systems, Selangor, Malaysia, pp. 339-344, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[73] Abdelkbir Ouisaadane, and Said Safi, “A Comparative Study for Arabic Speech Recognition System in Noisy Environments,” International Journal of Speech Technology, vol. 24, pp. 761-770, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[74] Engy Ragaei Abdelmaksoud et al., “Convolutional Neural Network for Arabic Speech Recognition,” Egyptian Journal of Language Engineering, vol. 8, no. 1, pp. 27-38, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[75] Rafik Amari et al., “Deep Convolutional Neural Network for Arabic Speech Recognition,” 14th International Conference on Computational Collective Intelligence, Hammamet, Tunisia, pp. 120-134, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[76] Hiba Qasim Jaber, and Huda Abdulaali Abdulbaqi, “Real Time Arabic Speech Recognition Based on Convolution Neural Network,” Journal of Information and Optimization Sciences, vol. 42, no. 7, pp. 1657-1663, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[77] Asroni Asroni et al., “Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network,” Baghdad Science Journal, vol. 18, no. 2, pp. 925-936, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[78] Mohamed Hassine, Lotfi Boussaid, and Hassani Massaoud, “Tunisian Dialect Recognition Based on Hybrid Techniques,” International Arab Journal of Information Technology, vol. 15, no. 1, pp. 58-65, 2018.
[Google Scholar] [Publisher Link]
[79] Moner N.M. Arafa et al., “A Dataset for Speech Recognition to Support Arabic Phoneme Pronunciation,” International Journal of Image, Graphics and Signal Processing, vol. 10, no. 4, pp. 31-38, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[80] Abdulmalik A. Alasadi et al., “Efficient Feature Extraction Algorithms to Develop an Arabic Speech Recognition System,” Engineering, Technology & Applied Science Research, vol. 10, no. 2, pp. 5547-5553, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[81] Bilal Dendani, Halima Bahi, and Toufik Sari, “Self-Supervised Speech Enhancement for Arabic Speech Recognition in Real-World Environments,” Signal Processing, vol. 38, no. 2, pp. 349-358, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[82] Haneen Bahjat Khalafallah, Mohamed Abdel Fattah, and Ruqayya Abdulrahman, “Speech Corpus for Medina Dialect,” Journal of King Saud University-Computer and Information Sciences, vol. 36, no. 2, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[83] Zoubir Talai, Nada Kherici, and Halima Bahi, “Comparative Study of CNN Structures for Arabic Speech Recognition,” Information Systems Engineering, vol. 28, no. 2, pp. 327- 333, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[84] Mohamed Walid et al., “Real-Time Implementation of Isolated-Word Speech Recognition System on Raspberry Pi 3 Using WAT-MFCC,” International Journal of Computer Science and Network Security, vol. 19, no. 3, pp. 42-49, 2019.
[Google Scholar] [Publisher Link]
[85] Jihad Anwar Qadir, Abdulbasit K. Al-Talabani, and Hiwa A. Aziz, “Isolated Spoken Word Recognition Using One-Dimensional Convolutional Neural Network,” International Journal of Fuzzy Logic and Intelligent Systems, vol. 20, no. 4, pp. 272-277, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[86] Abdelkbir Ouisaadane, Said Safi, and Miloud Frikel, “Arabic Digits Speech Recognition and Speaker Identification in Noisy Environment Using a Hybrid Model of VQ and GMM,” TELKOMNIKA (Telecommunication Computing Electronics and Control), vol. 18, no. 4, pp. 2193-2204, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[87] Shipra J. Arora, “Dialectal Variations of Isolated Word Recognition,” INFOCOMP Journal of Computer Science, vol. 17, no. 1, pp. 38-44, 2018.
[Google Scholar] [Publisher Link]