Aspect-Based Sentiment Analysis Advancements and Applications in Code-Mixed Text and Gujarati Language Processing
Aspect-Based Sentiment Analysis Advancements and Applications in Code-Mixed Text and Gujarati Language Processing |
||
![]() |
![]() |
|
© 2025 by IJETT Journal | ||
Volume-73 Issue-9 |
||
Year of Publication : 2025 | ||
Author : Shyam Viththalani, Krunalkumar Patel | ||
DOI : 10.14445/22315381/IJETT-V73I9P130 |
How to Cite?
Shyam Viththalani, Krunalkumar Patel,"Aspect-Based Sentiment Analysis Advancements and Applications in Code-Mixed Text and Gujarati Language Processing", International Journal of Engineering Trends and Technology, vol. 73, no. 9, pp.356-368, 2025. Crossref, https://doi.org/10.14445/22315381/IJETT-V73I9P130
Abstract
Aspect-Based Sentiment Analysis (ABSA), on the other hand, gives more granular results regarding the sentiments being expressed specifically for the aspects, but this has only been limited work done for code-mixed languages or regional languages like Gujarati. This paper presents a novel approach to deal with the limitations, such as syntax mixing of code-mixed text, considering the morphological features of Gujarati. By using specific preprocessing, aspect extraction methods and classifiers for multilingual and low-resource scenarios, the proposed approach outperforms the basic solutions. Its usefulness is further demonstrated with real-world datasets, thus its applicability to social media surveillance and regional sentiment analysis has great potential for incorporating culturally sensitive natural language processing. Aspect-Based Sentiment Analysis (ABSA) is a critical area within Natural Language Processing (NLP) that enables detailed sentiment interpretation by associating opinions with specific aspects mentioned in text. While ABSA has seen substantial advancements in high-resource languages, its application to code-mixed texts and low-resource languages such as Gujarati remains relatively limited. This paper explores recent progress in ABSA, with a particular focus on two linguistically challenging domains: Hindi-English code-mixed texts and monolingual Gujarati content. This paper highlights key obstacles, including language mixing, orthographic inconsistencies, and the scarcity of annotated datasets. To tackle these challenges, the study investigates hybrid strategies that combine deep learning models (e.g., LSTM, BERT) with sentiment lexicons, along with emerging techniques such as contrastive learning and multilingual transformer architectures. Additionally, a newly developed Gujarati sentiment corpus is presented and assessed using various machine learning and lexicon-based methods for aspect-level sentiment classification. The experimental results underscore the importance of customized feature extraction, language-aware pre-processing, and ensemble approaches in enhancing ABSA performance for multilingual and low-resource settings. The study aims to broaden the scope of sentiment analysis by offering methodologies and resources tailored to underrepresented languages and code-mixed communication.
Keywords
Aspect-Based Sentiment Analysis, Sentiment Analysis, Code-Mixed Language, Gujarati Language, Multilingual NLP, Low-Resource Languages, Sentiment Classification, Aspect Extraction, Social Media Analysis.
References
[1] Neeraj Anand Sharma, A.B.M. Shawkat Ali, and Muhammad Ashad Kabir, “A Review of Sentiment Analysis: Tasks, Applications, and Deep Learning Techniques,” International Journal of Data Science and Analytics, vol. 19, no. 3, pp. 351-388, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Roshan Poudel, “Exploring Transformers for Aspect Based Sentiment Analysis,” MS Thesis, Universidade de Aveiro (Portugal), pp. 1-17, 2022.
[Google Scholar]
[3] Deena Nath, and Sanjay K. Dwivedi, “Aspect-Based Sentiment Analysis: Approaches, Applications, Challenges and Trends,” Knowledge and Information Systems, vol. 66, no. 12, pp. 7261-7303, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Gazi Imtiyaz Ahmad, and Jimmy Singla, “Sentiment Analysis of Code-Mixed Social Media Text (SA-CMSMT) in Indian-Languages,” 2021 International Conference on Computing Sciences (ICCS), Phagwara, India, pp. 25-33, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Ruhina Tabasshum Prome, Tarikul Islam Tamiti, and Anomadarshi Barua, “Leveraging the Potential of Prompt Engineering for Hate Speech Detection in Low-Resource Languages,” arXiv Preprint, pp. 1-12, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Asara Senaratne, “Anomaly Detection in Graphs for Knowledge Discovery and Data Quality Enhancement,” Thesis (PhD), The Australian National University, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Meerabahen M. Shah, and Hiren R. Kavathiya, “Development of A Model to Analyze & Interpret Vernacular Voice Recognition Of Gujarati Dialects,” Ph.D. Thesis Computer Applications, Department of Computer Science, Faculty of Science, Atmiya University, 2024.
[Google Scholar] [Publisher Link]
[8] Rasmus Kær Jørgensen, “Multilingual Natural Language Processing for Applications in the Financial Domain,” Ph.D. Thesis, University of Copenhagen, pp. 1-163, 2023.
[Publisher Link]
[9] Ranit Kumar Dey, and Asit Kumar Das, “Modified Binary Particle Swarm Optimization Based Deep Hybrid Framework for Sentiment Analysis,” International Journal of Information Technology & Decision Making, pp. 1-28, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Jinshi Wang, “Cross-lingual Transfer Learning for Low-Resource Natural Language Processing Tasks,” Master Thesis, Karlsruhe Institute of Technology, 2021.
[Google Scholar]
[11] C.S Anoop, and A.G. Ramakrishnan, “CTC-Based End-to-End ASR for the Low Resource Sanskrit Language with Spectrogram Augmentation,” 2021 National Conference on Communications (NCC), Kanpur, India, vol. 1, pp. 1-6, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Pablo Duboue, The Art of Feature Engineering: Essentials for Machine Learning, Cambridge University Press, pp. 1-284, 2020.
[CrossRef] [Google Scholakr] [Publisher Link]
[13] Ni Made Satvika Iswari, and Nunik Afriliana, “Enhancing Aspect-Based Sentiment Analysis in Tourism Reviews Through Hybrid Data Augmentation,” Journal of Applied Data Sciences, vol. 6, no. 3, pp. 2192-2206, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Sunil D. Kale et al., “A Comprehensive Review of Sentiment Analysis on Indian Regional Languages: Techniques, Challenges, and Trends,” International Journal on Recent and Innovation Trends in Computing and Communication, vol. 11, no. 9s, pp. 93-110, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Bing Liu, Introduction: Sentiment Analysis: Mining Opinions, Sentiments, and Emotions, Cambridge University Press, pp. 1-15, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Jatinderkumar R. Saini, and Saikat Roy, “Preparation of Rich Lists of Research Gaps in the Specific Sentiment Analysis Tasks of Code-Mixed Indian Languages,” SN Computer Science, vol. 5, no. 1, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Yusuf Aliyu et al., “Sentiment Analysis in Low-Resource Settings: A Comprehensive Review of Approaches, Languages, and Data Sources,” IEEE Access, vol. 12, pp. 66883-66909, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Manuel Romeo Flores III, “Assessing the Functional Significance of Wood Nutrient Resorption Along a Soil Fertility Gradient,” University of Illinois at Urbana-Champaign, pp. 1-48, 2022.
[Google Scholar] [Publisher Link]
[19] Zeena Al-Tekreeti et al., “AI-Based Visual Early Warning System,” Informatics, vol. 11, no. 3, pp. 1-26, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Prakhar Tandon et al., “Evaluating SEG for Gujarati News Clustering,” International Conference on Hybrid Intelligent Systems, pp. 324-335, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Tianyang Zhong et al., “Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research,” arXiv Preprints, pp. 1-41, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Shumaila Mughal, Arfan Jaffar, and M. Waleed Arif, “Sentiment Analysis of Social Media Data: Understanding Public Perception,” Journal of Computing & Biomedical Informatics, vol. 7, no. 02, pp. 1-12, 2024.
[Google Scholar] [Publisher Link]
[23] Ringki Das, and Thoudam Doren Singh, “Multimodal Sentiment Analysis: A Survey of Methods, Trends, and Challenges,” ACM Computing Surveys, vol. 55, no. 13s, pp. 1-38, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[24] James T. Frith, Matthew J. Lacey, and Ulderico Ulissi, “A Non-Academic Perspective on the Future of Lithium-Based Batteries,” Nature Communications, vol. 14, no. 1, pp. 1-17, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Monali Bordoloi, and Saroj Kumar Biswas, “Sentiment Analysis: A Survey on Design Framework, Applications and Future Scopes,” Artificial Intelligence Review, vol. 56, no. 11, pp. 12505-12560, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Yan Cathy Hua et al., “A Systematic Review of Aspect-Based Sentiment Analysis: Domains, Methods, and Trends,” Artificial Intelligence Review, vol. 57, no. 11, pp. 1-51, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[27] Vasileios Ballas et al., “Automating Mobile App Review User Feedback with Aspect-Based Sentiment Analysis,” International Conference on Human-Computer Interaction, pp. 179-193, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[28] Nozi Sidali, “An Assessment of the Business Innovation Model Strategy used to Enhance Information Technology by ABSA,” University of the Witswatersrand.
[Google Scholar]
[29] Clark Aldrich, Simulations and the Future of Learning: An Innovative (and Perhaps Revolutionary) Approach to E-Learning, John Wiley & Sons, pp. 1-304, 2003.
[Google Scholar]
[30] Suman Dowlagar, “A Code Mixed Dialog System in Medical Domain,” International Institute of Information Technology Hyderabad, pp. 1-133, 2023.
[Google Scholar] [Publisher Link]
[31] Myagmarsuren Orosoo et al., “Analysing Code-Mixed Text in Programming Instruction Through Machine Learning for Feature Extraction,” International Journal of Advanced Computer Science and Applications, vol. 15, no. 7, pp. 890-900, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[32] Deanna Kuhn, “Thinking as Argument,” Harvard Educational Review, vol. 62, no. 2, pp. 155-179, 1992.
[CrossRef] [Google Scholar] [Publisher Link]
[33] Monil Gokani, “Exploring Sentiment Analysis in Low-resource Languages,” Master of Science, International Institute of Information Technology, Hyderabad, 2023.
[Google Scholar]
[34] Rishabh Agrawal, “Adaptive Few-Shot Learning (AFSL): Tackling Data Scarcity with Stability, Robustness, and Versatility,” arXiv Preprint, pp. 1-8, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[35] Brijeshkumar Y. Panchal, and Apurva Shah, “NLP Research: A Historical Survey and Current Trends in Global, Indic, and Gujarati Languages,” 2024 4th International Conference on Ubiquitous Computing and Intelligent Information Systems (ICUIS), Gobichettipalayam, India, pp. 1263-1272, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[36] Albert Greenberg et al., “Towards a Next Generation Data Center Architecture: Scalability and Commoditization,” PRESTO '08: Proceedings of the ACM Workshop on Programmable Routers for Extensible Services of Tomorrow, pp. 57-62, 2008.
[CrossRef] [Google Scholar] [Publisher Link]
[37] Stanislaw P. Stawicki et al., “The 2019-2020 Novel Coronavirus (Severe Acute Respiratory Syndrome Coronavirus 2) Pandemic A Joint American College of Academic International Medicine-World Academic Council of Emergency Medicine Multidisciplinary COVID-19 Working Group Consensus Paper,” Journal of Global Infectious Diseases, vol. 12, no. 2, pp. 47-93, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[38] Josina W. Geringer et al., “Analysis of the ASME Code Rules for Subsection III-5-HHB (Composite Materials) for Current HTR Design Requirements,” Technical Report, Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States), pp. 1-37, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[39] Jun Li et al., “A Review of Remote Sensing for Environmental Monitoring in China,” Remote Sensing, vol. 12, no. 7, pp. 1-25, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[40] Raju Anitha, and K.S. Anil Kumar, “Sentiment Analysis in Low-Resource Language: Exploring BERT, mBERT, XLM-R, and RNN Architectures to Underpin Deep Language Understanding,” Journal of Nonlinear Analysis and Optimization, vol. 15, no. 2, pp. 180-189, 2024.
[Google Scholar] [Publisher Link]
[41] Jing Zhang et al., “Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective,” ACM Computing Surveys (CSUR), vol. 52, no. 1, pp. 1-38, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[42] Roland T. Rust, “The Future of Marketing,” International Journal of Research in Marketing, vol. 37, no. 1, pp. 15-26, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[43] Mohit Dua et al., “A Review on Gujarati Language based Automatic Speech Recognition (ASR) Systems,” International Journal of Speech Technology, vol. 27, no. 1, pp. 133-156, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[44] Jukka Räty, Kai-Erik Peiponen, and Toshimitsu Asakura, UV-Visible Reflection Spectroscopy of Liquids, 1st ed., Springer Science & Business Media, vol. 92, 2004.
[CrossRef] [Google Scholar] [Publisher Link]
[45] Damini Dey et al., “Artificial Intelligence in Cardiovascular Imaging: JACC State-of-the-Art Review,” Journal of the American College of Cardiology, vol. 73, no. 11, pp. 1317-1335, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[46] Sanele Gerald Khakhu, “Liberating Technologies: Automatic Speech Recognition - AI Virtual Assistant and the Future of Language in South Africa,” Master of Arts (MA), University of Johannesburg, pp. 1-82, 2023.
[Google Scholar] [Publisher Link]
[47] Soni Sweta, “Application of Sentiment Analysis in Diverse Domains,” Sentiment Analysis and its Application in Educational Data Mining, pp. 19-46, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[48] Payal Joshi, and Dhaval Joshi, “Code Mixed Information Retrieval for Gujarati Script News Articles,” International Conference on Advances in Computing and Data Sciences, Kolkata, India, pp. 265-276, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[49] Qiang Wan, Fanming Wang, and Sanhong Deng, “Optimizing Social Media Public Opinion Analysis with ABSA: A Case Study on Weibo,” International Conference on Information Management, Kolkata, India, pp. 332-343, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[50] Pavani Rangachari, A Holistic Framework of Strategies and Best Practices for Telehealth Service Design and Implementation, Service Design Practices for Healthcare Innovation, Springer, Cham, pp. 315-335, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[51] Surinder Pal Singh, and Neeraj Mangla, “A Hybrid Framework Combining Dictionary-Based Methods, BERT, and CRF for Language Identification and Normalization in Code-Mixed Hinglish,” 2025 Seventh International Conference on Computational Intelligence and Communication Technologies (CCICT), Sonepat, India, pp. 646-651, 2025.
[CrossRef] [Google Scholar] [Publisher Link]
[52] Chandrakant Patel, and Jayeshkumar Patel, “Dynamic Stop List for the Gujarati Language using Rule Based Approach,” Towards Excellence, vol. 13, no. 1, pp. 594-607, 2021.
[Google Scholar] [Publisher Link]
[53] Shraddha Kalbhor, and Dinesh Goyal, “Survey on ABSA based on Machine Learning, Deep Learning and Transfer Learning Approach,” Recent Advances in Sciences, Engineering, Information Technology & Management, vol. 2782, no. 1, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[54] Farzana Kulsoom et al., “A Review of Machine Learning-Based Human Activity Recognition for Diverse Applications,” Neural Computing and Applications, vol. 34, no. 21, pp. 18289-18324, 2022.
[CrossRef] [Google Scholar] [Publisher Link]