A Literature Review of Machine Learning Techniques for Cyberbullying Detection in Arabic Social Media Text

A Literature Review of Machine Learning Techniques for Cyberbullying Detection in Arabic Social Media Text

  IJETT-book-cover           
  
© 2025 by IJETT Journal
Volume-73 Issue-3
Year of Publication : 2025
Author : Bader Azi Alanazi, Chin-Teng Lin
DOI : 10.14445/22315381/IJETT-V73I3P106

How to Cite?
Bader Azi Alanazi, Chin-Teng Lin, "A Literature Review of Machine Learning Techniques for Cyberbullying Detection in Arabic Social Media Text," International Journal of Engineering Trends and Technology, vol. 73, no. 3, pp. 73-94, 2025. Crossref, https://doi.org/10.14445/22315381/IJETT-V73I3P106

Abstract
Social media, including platforms such as X (known as Twitter), have become an integral part of our daily lives, serving as a primary means of communication for people globally. Social media are widely used to share thoughts, viewpoints, and critiques. However, the accessibility of these platforms can sometimes lead to misuse, paving the way for cyberbullying–a damaging form of online harassment. Although much research has been conducted on cyberbullying detection in the English language, there is a noticeable research gap regarding the Arabic language. Spotting cyberbullying in Arabic posts on X (Twitter) can help make the platform safer and friendlier for Arabic users while highlighting the harm it inflicts. Cyberbullying instances can be identified and categorized using tools such as Natural Language Processing (NLP) and machine learning algorithms. This paper reviews studies that leverage Machine Learning to identify instances of cyberbullying in Arabic.

Keywords
Machine Learning, Cyberbullying, Arabic text, Twitter, Natural language processing.

References
[1] Saudi Arabia Social Media Statistics, Global Media Insight, 2023. [Online]. Available: https://www.globalmediainsight.com/blog/saudi-arabia-social-media-statistics/#KSA_Social_Media_Statistics_2023_Top_Picks
[2] Ditch the Label, Cyberbullying Statistics: What They Tell Us, 2017. [Online]. Available: https://mx.ditchthelabel.org/cyber-bullying-statistics-what-they-tell-us
[3] Raju Kumar, and Aruna Bhat, “A Study of Machine Learning-Based Models for Detection, Control, and Mitigation of Cyberbullying in Online Social Media,” International Journal of Information Security, vol. 21, no. 6, pp. 1409-1431, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[4] A.K. Jaithunbi et al., “Detecting Twitter Cyberbullying Using Machine Learning,” Annals of the Romanian Society for Cell Biology, vol. 25, no. 4, pp. 16307-16315, 2021.
[Google Scholar] [Publisher Link]
[5] Monirah Abdullah Al-Ajlan, and Mourad Ykhlef, “Deep Learning Algorithm for Cyberbullying Detection,” International Journal of Advanced Computer Science and Applications, vol. 9, no. 9, pp. 199-205, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Asif Ahmad Khan, and Aruna Bhat, “A Study on Automatic Detection of Cyberbullying Using Machine Learning,” 6th International Conference on Intelligent Computing and Control Systems, Madurai, India, pp. 1167-1174, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Vimala Balakrishnan, Shahzaib Khan, and Hamid R. Arabnia, “Improving Cyberbullying Detection Using Twitter Users’ Psychological Features and Machine Learning,” Computers and Security, vol. 90, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Alanoud Mohammed Alduailaj, and Aymen Belghith, “Detecting Arabic Cyberbullying Tweets Using Machine Learning,” Machine Learning and Knowledge Extraction, vol. 5, no. 1, pp. 29-42, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni, “Multilingual Cyberbullying Detection System: Detecting Cyberbullying in Arabic Content,” 2017 1st Cyber Security in Networking Conference, Rio de Janeiro, Brazil, pp. 1-8, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Azalden Alakrot, Liam Murray, and Nikola S. Nikolov, “Towards Accurate Detection of Offensive Language in Online Communication in Arabic,” Procedia Computer Science, vol. 142, pp. 315-320, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Djedjiga Mouheb et al., “Detection of Arabic Cyberbullying on Social Networks using Machine Learning,” 16th International Conference on Computer Systems and Applications, Abu Dhabi, United Arab Emirates, pp. 1-5, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Deema Alghamdi et al., “Automatic Detection of Cyberbullying and Threatening in Saudi Tweets using Machine Learning,” International Journal of Advanced and Applied Sciences, vol. 8, no. 10, pp. 17-25, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Pariwat Ongsulee, “Artificial Intelligence, Machine Learning and Deep Learning,” 15th International Conference on ICT and Knowledge Engineering, Bangkok, Thailand, pp. 1-6, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Kamran Kowsari et al., “Text Classification Algorithms: A Survey,” Information, vol. 10, no. 4, pp. 1-68, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[15] M. Ikonomakis, S. Kotsiantis, and V. Tampakas, “Text Classification using Machine Learning Techniques,” WSEAS Transactions on Computers, vol. 4, no. 8, pp. 966-974, 2005
[Google Scholar] [Publisher Link]
[16] Charu C. Aggarwal, and ChengXiang Zhai, “A Survey of Text Classification Algorithms,” Mining Text Data, Springer, Boston, USA, pp. 163-222, 2012.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Cynthia Van Hee et al., “Automatic Detection of Cyberbullying in Social Media Text,” PLOS One, vol. 13, no. 10, pp. 1-22, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[18] John Hani et al., “Social Media Cyberbullying Detection Using Machine Learning,” International Journal of Advanced Computer Science and Applications, vol. 10, no. 5, pp. 703-707, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Taeho Jo, Machine Learning Foundations Supervised, Unsupervised, and Advanced Learning, Springer International Publishing, pp. 1-391, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[20] M.A. Al-Garadi, K.D. Varathan, and S.D. Ravana, “Cybercrime Detection in Online Communications: The Experimental Case of Cyberbullying Detection in the Twitter Network,” Computers in Human Behavior, vol. 63, pp. 433-443, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Michele Di Capua, Emanuel Di Nardo, and Alfredo Petrosino, “Unsupervised Cyber Bullying Detection in Social Networks,” IEEE 23rd International Conference on Pattern Recognition, Cancun, Mexico, pp. 432-437, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Xiaowei Gu, A Self-Training Hierarchical Prototype-Based Approach for Semi-Supervised Classification, Information Sciences, vol. 535, pp. 204-224, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Vinita Nahar et al., “Semi-Supervised Learning for Cyberbullying Detection in Social Networks,” Proceedings Databases Theory and Applications 25th Australasian Database Conference, Australia, pp. 160-171, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Yann Le Cun, Yoshua Bengio, and Geoffrey Hinton, “Deep Learning,” Nature, vol. 521, pp. 436-444, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Shervin Minaee et al., “Deep Learning-Based Text Classification: A Comprehensive Review,” ACM Computing Surveys (CSUR), vol. 54, no. 3, pp. 1-40, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Celestine Iwendi et al., “Cyberbullying Detection Solutions Based on Deep Learning Architectures,” Multimedia Systems, vol. 29, pp. 1839-1852, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[27] Monirah A. Al-Ajlan, and Mourad Ykhlef, “Optimized Twitter Cyberbullying Detection Based on Deep Learning,” IEEE 21st Saudi Computer Society National Computer Conference, Riyadh, Saudi Arabia, pp. 1-5, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[28] K.G. Apoorva, and D. Uma, “Detection of Cyberbullying Using Machine Learning and Deep Learning Algorithms,” IEEE 2nd Asian Conference on Innovation in Technology, Ravet, India, pp. 1-7, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Jalal Omer Atoum, “Cyberbullying Detection Neural Networks using Sentiment Analysis,” IEEE International Conference on Computational Science and Computational Intelligence, Las Vegas, NV, USA, pp. 158-164, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[30] Roman Egger, and Enes Gokce, Natural Language Processing (NLP): An Introduction, Applied Data Science in Tourism Interdisciplinary Approaches, Methodologies, and Applications, Springer, Cham, pp. 307-334, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[31] K.R. Chowdhary, Natural Language Processing, Fundamentals of Artificial Intelligence, Springer, New Delhi, pp. 603-649, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[32] Dipanjan Sarkar, Text Analytics with Python, A Practitioner's Guide to Natural Language Processing, Apress Berkeley, CA, pp. 1-674, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[33] Elizabeth D. Liddy, Natural Language Processing, In Encyclopedia of Library and Information Science, 2nd ed., Marcel Decker, NY, pp. 1-16, 2001.
[Publisher Link]
[34] Yue Kang et al., “Natural Language Processing (NLP) in Management Research: A Literature Review,” Journal of Management Analytics, vol. 7, no. 2, pp. 139-172, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[35] Muhammad Abdul-Mageed, and Mona Diab, “AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis,” Proceedings of the Eighth International Conference on Language Resources and Evaluation, Istanbul, Turkey, pp. 3907-3914, 2012.
[Google Scholar] [Publisher Link]
[36] Hossam S. Ibrahim, Sherif M. Abdou, and Mervat Gheith, “Sentiment Analysis for Modern Standard Arabic and Colloquial,” arXiv, pp. 95-109, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[37] Kenneth R. Beesley, “Finite-State Morphological Analysis and Generation of Arabic at Xerox Research: Status and Plans in 2001,” ACL Workshop on Arabic Language Processing: Status and Perspective, vol. 1, pp. 1-8, 2001.
[Google Scholar]
[38] Tim Buckwalter, “Issues in Arabic Orthography and Morphology Analysis,” Proceedings of the Workshop on Computational Approaches to Arabic Script-Based Languages, Geneva, Switzerland, pp. 31-34, 2004.
[Google Scholar] [Publisher Link]
[39] Ali Farghaly, and Khaled Shaalan, “Arabic Natural Language Processing: Challenges and Solutions,” ACM Transactions on Asian Language Information Processing, vol. 8, no. 4, pp. 1-22, 2009.
[CrossRef] [Google Scholar] [Publisher Link]
[40] Khaled Shaalan, “Rule-Based Approach in Arabic Natural Language Processing,” International Journal on Information and Communication Technologies, vol. 3, no. 3, pp. 11-19, 2010.
[Google Scholar] [Publisher Link]
[41] Mohamed Elmahdy et al., “Survey on Common Arabic Language Forms from a Speech Recognition Point of View,” Proceeding of International conference on Acoustics (NAG-DAGA), pp. 63-66, 2009.
[Google Scholar] [Publisher Link]
[42] Khaled Shaalam, and Nizar Y. Habash, “Introduction to Arabic Natural Language Processing (Synthesis Lectures on Human Language Technologies),” Machine Translation, vol. 24, no. 3-4, pp. 285-289, 2010.
[CrossRef] [Google Scholar] [Publisher Link]
[43] Kareem Darwish et al., “A Panoramic Survey of Natural Language Processing in the Arab World,” Communications of the ACM, vol. 64, no. 4, pp. 72-81, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[44] Mohamed Abd Elaziz et al., Recent Advances in NLP: The Case of Arabic Language, Springer Cham, pp. 1-209, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[45] Sourabh Parime, and Vaibhav Suri, “Cyberbullying Detection and Prevention: Data Mining and Psychological Perspective,” IEEE International Conference on Circuits, Power and Computing Technologies, Nagercoil, India, pp. 1541-1547, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[46] Vikas S. Chavan, and S.S. Shylaja, “Machine Learning Approach for Detection of Cyber-Aggressive Comments by Peers on Social Media Network,” IEEE International Conference on Advances in Computing, Communications and Informatics, Kochi, India, pp. 2354-2358, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[47] Marilyn Campbell, and Sheri Bauman, Cyberbullying: Definition, Consequences, Prevalence, Reducing Cyberbullying in Schools International Evidence-Based Best Practices, pp. 3-16, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[48] Jaana Juvonen, and Elisheva F. Gross, “Extending the School Grounds?-Bullying Experiences in Cyberspace,” Journal of School Health, vol. 78, no. 9, pp. 496-505, 2008.
[CrossRef] [Google Scholar] [Publisher Link]
[49] Andreas König, Mario Gollwitzer, and Georges Steffgen, “Cyberbullying as An Act of Revenge?,” Journal of Psychologists and Counsellors in Schools, vol. 20, no. 2, pp. 210-224, 2010.
[CrossRef] [Google Scholar] [Publisher Link]
[50] Peter K. Smith et al., “Cyberbullying: Its Nature and Impact in Secondary School Pupils,” The Journal of Child Psychology and Psychiatry, vol. 49, no. 4, pp. 376-385, 2008.
[CrossRef] [Google Scholar] [Publisher Link]
[51] Amrita Mangaonkar, Allenoush Hayrapetian, and Rajeev Raje, “Collaborative Detection of Cyberbullying Behavior in Twitter Data,” IEEE International Conference on Electro Information Technology, Dekalb, IL, USA, pp. 611-616, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[52] Batoul Haidar, Maroun Chamoun, and Fadi Yamout, “Cyberbullying Detection: A Survey on Multilingual Techniques,” IEEE European Modelling Symposium, Pisa, Italy, pp. 165-171, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[53] Samaneh Nadali et al., “A Review of Cyberbullying Detection: An Overview,” IEEE 13th International Conference on Intelligent Systems Design and Applications, Salangor, Malaysia, pp. 325-330, 2013.
[CrossRef] [Google Scholar] [Publisher Link]
[54] Walisa Romsaiyud et al., “Automated Cyberbullying Detection using Clustering Appearance Patterns,” IEEE 9th International Conference on Knowledge and Smart Technology, Chonburi, Thailand, pp. 242-247, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[55] Norulzahrah Mohd Zainudin et al., “A Review on Cyberbullying in Malaysia from Digital Forensic Perspective,” IEEE International Conference on Information and Communication Technology, Kuala Lumpur, Malaysia, pp. 246-250, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[56] Deborah Goebert et al., “The Impact of Cyberbullying on Substance Use and Mental Health in a Multiethnic Sample,” Maternal and Child Health Journal, vol. 15, pp. 1282-1286, 2011.
[CrossRef] [Google Scholar] [Publisher Link]
[57] Tanya Beran, and Qing Li, “The Relationship Between Cyberbullying and School Bullying,” The Journal of Student Wellbeing, vol. 1, no. 2, pp. 16-33, 2007.
[CrossRef] [Google Scholar] [Publisher Link]
[58] Justin W. Patchin, and Sameer Hinduja, Summary of Our Cyberbullying Research (2007-2023), 2024. [Online]. Available: https://cyberbullying.org/summary-of-our-cyberbullying-research
[59] Ana M. Giménez Gualdo et al., “The Emotional Impact of Cyberbullying: Differences in Perceptions and Experiences as A Function of Role,” Computers and Education, vol. 82, pp. 228-235, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[60] Victoria Brown, Elizabeth Clery, and Christopher Ferguson, Estimating the Prevalence of Young People Absent from School Due to Bullying, National Centre for Social Research, 2011.
[Google Scholar] [Publisher Link]
[61] Zachary Munn et al., “Systematic Review or Scoping Review? Guidance for Authors When Choosing Between a Systematic or Scoping Review Approach,” BMC Medical Research Methodology, vol. 18, no. 1, pp. 1-7, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[62] Rebecca Dredge, John F.M. Gleeson, and Xochitl de la Piedad Garcia, “Risk Factors Associated with Impact Severity of Cyberbullying Victimization: A Qualitative Study of Adolescent Online Social Networking,” Cyberpsychology, Behavior, and Social Networking, vol. 17, no. 5, pp. 287-291, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[63] Robin M. Kowalski et al., “Bullying in the Digital Age: A Critical Review and Meta-Analysis of Cyberbullying Research among Youth,” Psychological Bulletin, vol. 140, no. 4, pp. 1073-1137, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[64] Sameer Hinduja, and Justin W. Patchin, “Bullying, Cyberbullying, and Suicide,” Archives of Suicide Research, vol. 14, no. 3, pp. 206-221, 2010.
[CrossRef] [Google Scholar] [Publisher Link]
[65] Justin W. Patchin, and Sameer Hinduja, “Measuring Cyberbullying: Implications for Research,” Aggression and Violent Behavior, vol. 23, pp. 69-74, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[66] Brian Dean, X (Twitter) Statistics: How Many People Use X?, 2025. [Online]. Available: https://backlinko.com/twitter-users#twitter-daily-active-users
[67] Homa Hosseinmard et al., “Detection of Cyberbullying Incidents on the Instagram Social Network,” arXiv, pp. 1-9, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[68] Rahul Ramesh Dalvi, Sudhanshu Baliram Chavan, and Aparna Halbe, “Detecting a Twitter Cyberbullying using Machine Learning,” IEEE 4th International Conference on Intelligent Computing and Control Systems, Madurai, India, pp. 297-301, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[69] Amgad Muneer, and Suliman Mohamed Fati, “A Comparative Analysis of Machine Learning Techniques for Cyberbullying Detection on Twitter,” Future Internet, vol. 12, no. 11, pp. 1-20, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[70] Jalal Omer Atoum, “Cyberbullying Detection through Sentiment Analysis,” IEEE International Conference on Computational Science and Computational Intelligence, Las Vegas, USA, pp. 292-297, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[71] Varun Jain et al., “Detection of Cyberbullying on Social Media Using Machine Learning,” IEEE 5th International Conference on Computing Methodologies and Communication, Erode, India, pp. 1091-1096, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[72] Munif Alotaibi, Bandar Alotaibi, and Abdul Razaque, “A Multichannel Deep Learning Framework for Cyberbullying Detection on Social Media,” Electronics, vol. 10, no. 21, pp. 1-14, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[73] Umair Khan et al., “Aggression Detection in Social Media from Textual Data Using Deep Learning Models,” Applied Sciences, vol. 12, no. 10, pp. 1-16, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[74] Batoul Haidar, Maroun Chamoun, and Ahmed Serhrouchni, “Arabic Cyberbullying Detection: Using Deep Learning,” IEEE 7th International Conference on Computer and Communication Engineering, Kuala Lumpur, Malaysia, pp. 284-289, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[75] Benaissa Azzeddine Rachid, Harbaoui Azza, and Hajjami Henda Ben Ghezala, “Classification of Cyberbullying Text in Arabic,” IEEE International Joint Conference on Neural Networks, Glasgow, UK, pp. 1-7, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[76] Reem Albayari, Sherief Abdallah, and Khaled Shaalan, “Cyberbullying Detection Model for Arabic Text Using Deep Learning,” Journal of Information and Knowledge Management, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[77] Selma Ayşe Özel et al., “Detection of Cyberbullying on Social Media Messages in Turkish,” IEEE International Conference on Computer Science and Engineering, Antalya, Turkey, pp. 366-370, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[78] Rohit Pawar, and Rajeev R. Raje, “Multilingual Cyberbullying Detection System,” IEEE International Conference on Electro Information Technology, Brookings, SD, USA, pp. 40-44, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[79] Alican Bozyiğit, Semih Utku, and Efendi Nasibov, “Cyberbullying Detection: Utilizing Social Media Features,” Expert Systems with Applications, vol. 179, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[80] Sara Khan, and Amna Qureshi, “Cyberbullying Detection in Urdu Language Using Machine Learning,” IEEE International Conference on Emerging Trends in Electrical, Control, and Telecommunication Engineering, Lahore, Pakistan, pp. 1-6, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[81] Srabon Bhowmik Shanto, Mohammed Jahirul Islam, and Md. Abdus Samad, “Cyberbullying Detection using Deep Learning Techniques on Bangla Facebook Comments,” IEEE International Conference on Intelligent Systems, Advanced Computing and Communication, Silchar, India, pp. 1-7, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[82] Nizar Y. Habash, Introduction to Arabic Natural Language Processing, Springer Cham, pp. 1-170, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[83] Abdulrahman Ahmed Alzand, and Rosziati Ibrahim, “Diacritics of Arabic Natural Language Processing (ANLP) and Its Quality Assessment,” IEEE International Conference on Industrial Engineering and Operations Management, Dubai, United Arab Emirates, pp. 1-5, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[84] Arafat Awajan, “Arabic Text Preprocessing for the Natural Language Processing Applications,” Arab Gulf Journal of Scientific Research, vol. 25, no. 4, pp. 179-189, 2007.
[Google Scholar] [Publisher Link]
[85] Bandeh Ali Talpur, and Declan O’Sullivan, “Cyberbullying Severity Detection: A Machine Learning Approach,” PLOS One, vol. 15, no. 10, pp. 1-19, 2020.
[CrossRef] [Google Scholar] [Publisher Link]