Proposing a Hybrid Machine Learning Technique for Efficient Outlier Detection
Proposing a Hybrid Machine Learning Technique for Efficient Outlier Detection |
||
|
||
© 2024 by IJETT Journal | ||
Volume-72 Issue-11 |
||
Year of Publication : 2024 | ||
Author : Bondu Venkateswarlu, C. Jayaramulu |
||
DOI : 10.14445/22315381/IJETT-V72I11P120 |
How to Cite?
Bondu Venkateswarlu, C. Jayaramulu, "Proposing a Hybrid Machine Learning Technique for Efficient Outlier Detection," International Journal of Engineering Trends and Technology, vol. 72, no. 11, pp. 184-191, 2024. Crossref, https://doi.org/10.14445/22315381/IJETT-V72I11P120
Abstract
Finding the best results when outliers in high-dimensional data is challenging because of data imbalance and dimensionality. Several algorithms were created in an attempt to address this problem. However, they are made using either an unsupervised learning approach or a supervised learning technique, and they can discover outliers from such data. Supervised learning methods utilize training data, whereas unsupervised learning methods provide tools for finding and applying complex patterns. The central claim of this article is that it is possible to “merge two approaches to produce a hybrid and gain from both worlds.” A state-of-the-art ML system that combines both under supervision and without methods for efficiently identifying outliers is provided to assess this claim. Furthermore, the Outlier Detection Multi-Model Approach (MMA-OD) is a method proposed in this paper. This method combines the advantages of both supervised and unsupervised learning models to improve performance. Its strength is increasing the size of the feature space. Many benchmark datasets are used to assess the suggested approach. The empirical results show that MMA-OD outperforms a wide range of alternative approaches.
Keywords
Hybrid Machine Learning, Outlier detection, Framework, ROC comparison, Execution time comparison.
References
[1] Srikanth Thudumu et al., “A Comprehensive Survey of Anomaly Detection Techniques for High Dimensional Big Data,” Journal of Big Data, vol. 7, no. 1, pp. 1-30, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Xiaoliang Chen et al., “Self-Taught Anomaly Detection with Hybrid Unsupervised/Supervised Machine Learning in Optical Networks,” Journal of Lightwave Technology, vol. 37, no. 7, pp. 1742-1749, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Nicholas Jeffrey, Qing Tan, and José R. Villar, “A Hybrid Methodology for Anomaly Detection in Cyber–Physical Systems,” Neurocomputing, vol. 568, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[4] CédricPoutré, Didier Chételat, and Manuel Morales, “Deep Unsupervised Anomaly Detection in High-Frequency Markets,” The Journal of Finance and Data Science, vol. 10, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Anika Terbuch et al., “Detecting Anomalous Multivariate Time-Series via Hybrid Machine Learning,” IEEE Transactions on Instrumentation and Measurement, vol. 72, pp. 1-11, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Nazmul Kabir Sikder, and Feras A. Batarseh, “Outlier Detection using AI: A Survey,” AI Assurance, pp. 231-291, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Muhammad Ali et al., “A Novel Machine Learning Approach for Detecting Outliers, Rebuilding Well Logs, and Enhancing Reservoir Characterization,” Natural Resources Research, vol. 32, pp. 1047-1066, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Kyung Sung Lee, Seong Beom Kim, and Hee-Woong Kim, “Enhanced Anomaly Detection in Manufacturing Processes Through Hybrid Deep Learning Techniques,” IEEE Access, vol. 11, pp. 93368-93380, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Yusong Cao, Chunqin Hu, and Limin Ma, “Separation of Organic Compound from Water Using Membrane Process: Hybrid Machine Learning-Based Modeling and Validation,” Case Studies in Thermal Engineering, vol. 51, pp. 1-10, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Toya Acharya, Annamalai Annamalai, and Mohamed F Chouikha, “Efficacy of CNN-Bidirectional LSTM Hybrid Model for Network-Based Anomaly Detection,” In Proceedings 2023 IEEE 13th Symposium on Computer Applications & Industrial Electronics (ISCAIE), Penang, Malaysia, pp. 348-353, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Qingyuan Cai, Peng Li, and Ruchuan Wang, “Electricity Theft Detection Based on Hybrid Random Forest and Weighted Support Vector Data Description,” International Journal of Electrical Power & Energy Systems, vol. 153, pp. 1-15, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Morteza Matinkia et al., “Prediction of Permeability from Well Logs Using a New Hybrid Machine Learning Algorithm.,” Petroleum, vol. 9, no. 1, pp. 108-123, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Jos´e Escorcia-Gutierrez et al., “Sea Turtle Foraging Algorithm with Hybrid Deep Learning-Based Intrusion Detection for the Internet of Drones Environment,” Computers and Electrical Engineering, vol. 108, pp. 1-17, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[14] David Velásquez et al., “A Hybrid Machine-Learning Ensemble for Anomaly Detection in Real-Time Industry 4.0 Systems,” IEEE Access, vol. 10, pp. 72024-72036, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Moamen Ibrahim, Khaled M. Badran, and Ahmed Esmat Hussien, “Artificial Intelligence-Based Approach for Univariate Time-Series Anomaly Detection Using Hybrid CNN-BiLSTM Model,” In Proceedings 2022 13th International Conference on Electrical Engineering (ICEENG), Cairo, Egypt, pp. 129-133, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Alona Sakhnenko et al., “Hybrid Classical-Quantum Autoencoder for Anomaly Detection,” Quantum Machine Intelligence, vol. 4, no. 27, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Liwen Zhou, Qingkui Zeng, and Bo Li, “Hybrid Anomaly Detection via Multihead Dynamic Graph Attention Networks for Multivariate Time Series,” IEEE Access, vol. 10, pp. 40967-40978, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Julia Rosenberger et al., “Extended Kernel Density Estimation for Anomaly Detection in Streaming Data,” Procedia CIRP, vol. 112, pp. 156-161, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Muhammad Azmi Umer et al., “Machine Learning for Intrusion Detection in Industrial Control Systems: Applications, Challenges, and Recommendations,” International Journal of Critical Infrastructure Protection, vol. 38, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Zirgham Ilyas et al., “A hybrid Deep Network Based Approach for Crowd Anomaly Detection,” Multimedia Tools and Applications, vol. 80, pp. 24053-24067, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Mohd. Mustaqeem and Mohd. Saqib, “Principal Component Based Support Vector Machine (PC-SVM): A Hybrid Technique for Software Defect Detection,” Cluster Computing, vol. 24, pp. 2581-2595, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[22] K. Muthamil Sudar et al., “Analysis of Intruder Detection in Big Data Analytics,” In Proceedings 2021 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, India, pp. 1-5, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Chao Liu, Zhaojun Gu, and Jialiang Wang, “A Hybrid Intrusion Detection System Based on Scalable K-Means+ Random Forest and Deep Learning,” IEEE Access, vol. 9, pp. 75729-75740, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Mehmet Necip Kurt, Yasin Yılmaz, and Xiaodong Wang, “Real-Time Nonparametric Anomaly Detection in High-Dimensional Settings,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 7, pp. 2463-2479, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Mahdi Rabbani et al., “A Hybrid Machine Learning Approach for Malicious Behaviour Detection and Recognition in Cloud Computing,” Journal of Network and Computer Applications, vol. 151, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[26] Mahsa Alirezaei, Seyed Taghi Akhavan Niaki, and Seyed Armin Akhavan Niaki, “A bi-objective Hybrid Optimization Algorithm to Reduce Noise and Data Dimension in Diabetes Diagnosis Using Support Vector Machines,” Expert Systems with Applications, vol. 127, pp. 47-57, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[27] Guo Pu et al., “A Hybrid Unsupervised Clustering-Based Anomaly Detection Method,” Tsinghua Science and Technology, vol. 26, no. 2, pp. 146-153, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[28] Akanksha Toshniwal, Kavi Maheshand, and R. Jayashree, “Overview of Anomaly Detection Techniques in Machine Learning,” In Proceedings 2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), Palladam, India, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Xiaokang Zhou et al., “Variational LSTM Enhanced Anomaly Detection for Industrial Big Data,” IEEE Transactions on Industrial Informatics, vol. 17, no. 5, pp. 3469-3477, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[30] Chencheng Ma, Xuehui Du, and Lifeng Cao, “Analysis of Multi-Types of Flow Features Based on Hybrid Neural Network for Improving Network Anomaly Detection,” IEEE Access, vol. 7, pp. 148363-148380, 2019.
[CrossRef] [Google Scholar] [Publisher Link]