A Hybrid Grouped-Artificial Bee Colony Optimization (G-ABC) Technique for Feature Selection and Mean-Variance Optimization for Rule Mining

A Hybrid Grouped-Artificial Bee Colony Optimization (G-ABC) Technique for Feature Selection and Mean-Variance Optimization for Rule Mining

  IJETT-book-cover           
  
© 2023 by IJETT Journal
Volume-71 Issue-4
Year of Publication : 2023
Author : Mrinalini Rana, Omdev Dahiya
DOI : 10.14445/22315381/IJETT-V71I4P202

How to Cite?

Mrinalini Rana, Omdev Dahiya, "A Hybrid Grouped-Artificial Bee Colony Optimization (G-ABC) Technique for Feature Selection and Mean-Variance Optimization for Rule Mining," International Journal of Engineering Trends and Technology, vol. 71, no. 4, pp. 12-20, 2023. Crossref, https://doi.org/10.14445/22315381/IJETT-V71I4P202

Abstract
Data mining is a widely used method for analyzing and discovering knowledge in large data sets. It identifies the pattern hidden in the information by using mathematics and algorithms that can help discover hidden attributes. Data mining techniques are used to mine relevant patterns from large databases. It is broadly categorized into data preprocessing and mining results based on analysis outcomes. Soft computing procedures are used popularly these days for pattern predictions these days. This paper presents rule mining using Grouped - Artificial Bee Colony Optimization(G-ABC) technique for feature selection and mean-variance optimization for further rule mining. Classifiers are used to train and test the model for both feature selection and rule mining. For performing the experimental analysis of the work, Twitter and Baseball datasets were used. The proposed algorithm demonstrated the most optimized for the number of rules generated, the time required for calculation, and getting supplementary normalized information for rule mining. The best performer G-ABC with Neural Network (NN) classifier represents an average of 97.56% accuracy, a precision of 61.11, a recall of 96%, and an f-measure of 75% with G-ABC and mean-variance optimization technique with the Neural Network classifier.

Keywords
Machine learning, Artificial bee colony optimization, Feature selection, Particle swarm optimization, Mean-variance optimization, Rule mining.

References
[1] Rakesh Aggarwal, Tomasz Imielinksi, and Arun Swami, "Mining Association Rules between Sets of Items in Large Databases," ACM SIGMOD International Conference on Management of Data, vol. 22, no. 2, pp. 207-216, 1993.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Krzysztof J. Cios, Witold Pedrycz, and Roman W. Swiniarski, “Data Mining and Knowledge Discovery,” Data Mining Methods for Knowledge Discovery, Springer, vol. 458, pp. 1-26, 1998.
[CrossRef] [Google Scholar] [Publisher Link]
[3] M. V. Fidelis, H. S. Lopes, and A. A. Freitas, "Discovering Comprehensible Classification Rules with a Genetic Algorithm," Proceedings of the 2000 Congress on Evolutionary Computation, vol. 1, pp. 805-810, 2000.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Ujjwal Maulik, and Sanghamitra Bandyopadhyay, "Genetic Algorithm-Based Clustering Technique," Pattern Recognition, vol. 33, no. 9, pp.1455-1465, 2000.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Celia C. Bojarczuk, Heitor S. Lopes, and Alex A. Freitas, "Genetic Programming for Knowledge Discovery in Chest Pain Diagnosis," IEEE Engineering in Medicine and Biology Magazine, vol. 19, no. 4, pp. 38-44, 2000.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Jacinto Mata, Jose Luis Alvarez, and Jose- Cristobal Riquelme, Discovering Numeric Association Rules via Evolutionary Algorithm, Advances in Knowledge Discovery and Data Mining, PAKDD, Lecture Notes in Computer Science, Springer, vol. 2336. pp. 40–51, 2002.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Vahid Beiranvand, Mohamad Mobasher-Kashani, and Azuraliza Abu Bakar, "Multi-Objective PSO Algorithm for Mining Numerical Association Rules without a Priori Discretization," Expert Systems with Applications, vol. 41, no. 9, pp. 4259–4273, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[8] N.Deexit and A.Surya Prakasa Rao, "Solving Economic Load Dispatch Problem using Particle Swarm Optimization and Artificial Bee Colony Optimization Algorithms," SSRG International Journal of Electrical and Electronics Engineering, vol. 5, no. 7, pp. 5-8, 2018.
[CrossRef] [Publisher Link]
[9] Fateme Moslehi, and Abdorrahman Haeri, "A Novel Hybrid Wrapper–Filter Approach Based on Genetic Algorithm, Particle Swarm Optimization for Feature Subset Selection," Journal of Ambient Intelligence and Humanized Computing, vol. 11, no. 3, pp.1105-1127, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Qiumei Pu et al., "An Efficient Hybrid Approach Based on PSO, ABC, and k-means for Cluster Analysis," Multimedia Tools and Applications, vol. 81, pp.19321-19339, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Jitendra Agrawal et al., "SET-PSO-based Approach for Mining Positive and Negative Association Rules," Knowledge and Information Systems, Springer, vol. 45, pp. 453–47, 2015.
[CrossRef] [Google Scholar] [Publisher Link]
[12] M. Karunyalakshmi, and N. Tajunisha, "Classification of Cancer Datasets Using Artificial Bee Colony and Deep Feed-Forward Neural Networks," International Journal of Advanced Research in Computer and Communication Engineering, vol. 6, no. 2, pp. 33–41, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Beatriz de la Lglesia et al., "Data Mining Rules Using Multi-Objective Evolutionary Algorithms," Congress on Evolutionary Computation, pp. 1552-1559, 2003.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Chad Creighton, and Samir Hanash, "Mining Gene Expression Databases for Association Rules," Bioinformatics, vol. 19, no. 1, pp.79-86, 2003.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Ashish Ghosh, and Bhabesh Nath, "Multi-Objective Rule Mining using Genetic Algorithms," Information Sciences, vol.163, no. 1-3, pp.123–133, 2004.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Vanam Satyanarayana, and Vairavasamy Jayasankar, "Advanced Modeling and Optimization of Hybrid Renewable Energy Management Strategy Based on Artificial Bee Colony Algorithm in Micro Grid," International Journal of Engineering Trends and Technology, vol. 70, no. 10, pp. 329-338, 2022.
[CrossRef] [Publisher Link]
[17] Bilal Alatas, and Erhan Akin, "An Efficient Genetic Algorithm for Automated Mining of both Positive and Negative Quantitative Association Rules," Soft Computing, Springer, vol. 10, pp. 230–237, 2006.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Daniela Zaharie, Diana Lungeanu, and Flavia Zamfirache, "Influence of Missing Values Handling on classification Rules Evolved from Medical Data," 8th Industrial Conference on Data Mining ICDM, pp. 86-95, 2008.
[Google Scholar] [Publisher Link]
[19] S. Meera, and C. Sundar, "A Hybrid Metaheuristic Approach for Efficient Feature Selection Methods in Big Data," Journal of Ambient Intelligence and Humanized Computing, vol. 12, pp. 3743-3751, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Satish S. Banait et al., "Literature Review: An Efficient Clustering Approach to Big Data," International Journal of Computer Trends and Technology, vol. 71, no. 2, pp. 25-31, 2023.
[CrossRef] [Publisher Link]
[21] Mrinalini Rana, and Jimmy Singla, "A Framework for Selecting Features using Various Soft Computing Algorithms," 6th International Conference on Trends in Electronics and Informatics (ICOEI), pp. 1743-1748, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Salam Salameh Shreem et al., "Enhanced Binary Genetic Algorithm as a Feature Selection to Predict Student Performance," Soft Computing, vol. 26, no. 4, pp.1811-1823, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Omdev Dahiya, and Kamna Solanki, "An Efficient Requirement-Based Test Case Prioritization Technique Using Optimized TFC-SVM Approach," International Journal of Engineering Trends and Technology, vol. 69, no. 1, pp. 5-16, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[24] C.Ambika, M.Karnan, and R.Sivakumar, "Resolving Dynamic Shortest Path Routing Problems in Mobile Adhoc Networks Using ABCAnd ACO," International Journal of Computer & organization Trends, vol. 3, no. 1, pp. 58-62, 2013.
[Google Scholar] [Publisher Link]
[25] Mrinalini Rana, and Jimmy Singla, "A Comprehensive Review of Data Mining Rules Generation Algorithms," Turkish Online Journal of Qualitative Inquiry, vol. 12, no. 6, pp. 5269-5276, 2021.
[Google Scholar] [Publisher Link]
[26] Mrinalini Rana, and Jimmy Singla, "A Systematic Review on Data Mining Rules Generation Optimizing via Genetic Algorithm," International Conference on Innovative Computing & Communications (ICICC), 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[27] R. J. Kuo, Monalisa Gosumolo, and Ferani E. Zulvia, "Multi-Objective Particle Swarm Optimization Algorithm Using Adaptive Archive Grid for Numerical Association Rule Mining," Neural Computing and Applications, vol. 31, no. 8, pp. 3559-3572, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[28] Shady Shehata, Fakhri Karray, and Mohamed Kamel, "Enhancing Text Clustering using Concept-based Mining Model," Sixth International Conference on Data Mining (ICDM'06), pp. 1043-1048, 2006.
[CrossRef] [Google Scholar] [Publisher Link]
[29] Senthil Murugan Nagarajan et al., "Feature Selection Model for Healthcare Analysis and Classification Using Classifier Ensemble Technique," International Journal of System Assurance Engineering and Management, pp. 1-12, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[30] Omdev Dahiya, and Kamna Solanki, "An Efficient APHT Technique for Requirement-Based Test Case Prioritization," International Journal of Engineering Trends and Technology, vol. 69, no. 4, pp. 215-227, 2021.
[CrossRef] [Google Scholar] [Publisher Link]