Heart Disease Prediction based on Ensemble Classification Model with Tuned Training Weights

Heart Disease Prediction based on Ensemble Classification Model with Tuned Training Weights

© 2022 by IJETT Journal
Volume-70 Issue-4
Year of Publication : 2022
Authors : Parvathaneni Rajendra Kumar, Suban Ravichandran, S. Narayana
DOI :  10.14445/22315381/IJETT-V70I4P206

How to Cite?

Parvathaneni Rajendra Kumar, Suban Ravichandran, S. Narayana, "Heart Disease Prediction based on Ensemble Classification Model with Tuned Training Weights," International Journal of Engineering Trends and Technology, vol. 70, no. 4, pp. 59-81, 2022. Crossref, https://doi.org/10.14445/22315381/IJETT-V70I4P206

Heart disease (HD) is the most serious human disease, causing havoc on people`s health. Heart disease detection must be accurate and timely to prevent and cure heart failure. In many instances, the diagnosis of HD based on standard medical history is seen as unreliable. Therefore, this paper introduces a novel HD prediction system that includes five major phases such as (a) Preprocessing, (b) Imbalance processing, (c)Feature extraction, (d) Feature Selection and (e) Classification. Originally, the input data is given to the preprocessing phase. Subsequently, the imbalance processing phase is carried out, where an improved strategy for the class imbalance process is performed. The features, including raw features, improved mutual information, higher-order statistical features, entropy, correlation, and statistical features, are extracted in the feature extraction phase. Moreover, appropriate features will be selected from the extracted features in the feature selection phase, for which an improved ReliefF process will be carried out. These selected features is then subjected to the classification phase, where the ensemble classifiers include Neural Network (NN), Recurrent Neural Network (RNN), Random Forest (RF), and K-Nearest Neighbour (k-NN) model. Here, the output of NN, RNN, and RF is given as the input of k-NN. To make the system more precise in disease prediction, the weights of NN and RNN are optimally tuned by a Self-improved Shark Smell Optimization with Gaussmap Estimation and Cycle crossover Operation (SISSGECO) model. Then, the final output is obtained effectively in a precise manner. Finally, the outcomes of the adopted scheme are computed to the other extant schemes in terms of various measures like precision, sensitivity, accuracy, specificity, NPV, MCC, FPR, F1-score, and FNR, respectively.

Heart Disease Prediction, Imbalance Processing, Improved ReliefF, Ensemble Classifiers, Optimization.

[1] Farman AliShaker El-SappaghKyung-Sup Kwak, A Smart Healthcare Monitoring System for Heart Disease Prediction Based on Ensemble Deep Learning and Feature Fusion, Information Fusion. 63 (2020) 208-222.
[2] R. ValarmathiT. Sheela, Heart Disease Prediction Using Hyper Parameter Optimization (HPO) Tuning, Biomedical Signal Processing and Control. 70 (2021) 103033.
[3] Rani, P., Kumar, R., Ahmed, NMOS et al., A Decision Support System for Heart Disease Prediction Based Upon Machine Learning. J Reliable Intell Environ. 7 (2021) 263–275. https://doi.org/10.1007/s40860-021-00133-6
[4] Harika N, Swamy S.R., & Nilima, Artificial Intelligence-Based Ensemble Model for Rapid Prediction of Heart Disease, SN COMPUT. SCI. 2 (2021) 431. https://doi.org/10.1007/s42979-021-00829-9
[5] Prakash S, Sangeetha K, & Ramkumar N, An Optimal Criterion Feature Selection Method for Prediction and Effective Analysis of Heart Disease. Cluster Comput. 22 (2019) 11957–11963. https://doi.org/10.1007/s10586-017-1530-z.
[6] Renji P. CherianNoby ThomasSunder Venkitachalam, Weight Optimized Neural Network for Heart Disease Prediction Using Hybrid Lion Plus Particle Swarm Algorithm, Journal of Biomedical Informatics. 110 (2020) 103543.
[7] D. Shiny IreneT. SethukarasiN. Vadivelan, Heart Disease Prediction Using Hybrid Fuzzy K-Medoids Attribute Weighting Method with DBN-KELM Based Regression Model Medical Hypotheses. 143 (2020) 8.
[8] liyaShailendra Kumar ShrivastavaVivek Sharma, An Optimized Xgboost Based Diagnostic System for Effective Prediction of Heart Disease, Journal of King Saud University - Computer and Information Sciences. (2020).
[9] Ibomoiye Domor MienyeYanxia SunZenghui Wan, An Improved Ensemble Learning Approach for the Prediction of Heart Disease Risk, Informatics in Medicine. 20 (2020) 100402.
[10] C. Beulah Christalin LathaS. Carolin Jeeva, Improving the Accuracy of Prediction of Heart Disease Risk Based on Ensemble Classification Techniques, Informatics in Medicine. 16 (2019) 100203
[11] Md Mamun AliBikash Kumar PaulMohammad Ali Moni, Heart Disease Prediction Using Supervised Machine Learning Algorithms: Performance Analysis and Comparison, Computers in Biology and Medicine.136 (2021) 104672.
[12] Ibomoiye Domor MienyeYanxia SunZenghui Wang, Improved Sparse Autoencoder Based Artificial Neural Network Approach for Prediction of Heart Disease, Informatics in Medicine. 18 (2020) 100307.
[13] Zafer Al-MakhadmehAmr Tolba, Utilizing IoT Wearable Medical Device for Heart Disease Prediction Using Higher-Order Boltzmann Model: A Classification Approach, Measurement. 147 (2019) 106815.
[14] Aniruddha DuttaTamal BatabyalScott T. Acton, An Efficient Convolutional Neural Network for Coronary Heart Disease Prediction Expert Systems with Applications. 159 (2020) 113408.
[15] S. Mohan, C. Thirumalai and G. Srivastava, Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques, IEEE Access. 7 (2019) 81542-81554. doi: 10.1109/ACCESS.2019.2923707.
[16] N. L. Fitriyani, M. Syafrudin, G. Alfian and J. Rhee, HDPM: An Effective Heart Disease Prediction Model for a Clinical Decision Support System, IEEE Access. 8 (2020) 133034-133050. doi: 10.1109/ACCESS.2020.3010511.
[17] Y. Pan, M. Fu, B. Cheng, X. Tao and J. Guo, Enhanced Deep Learning Assisted Convolutional Neural Network for Heart Disease Prediction on the Internet of Medical Things Platform, IEEE Access. 8 (2020) 189503-189512. doi: 10.1109/ACCESS.2020.3026214.
[18] S. A. Ali et al., An Optimally Configured and Improved Deep Belief Network (OCI-DBN) Approach for Heart Disease Prediction Based on Ruzzo–Tompa and Stacked Genetic Algorithm, in IEEE Access. 8 (2020) 65947-65958. doi: 10.1109/ACCESS.2020.2985646.
[19] B. Wang et al., A Multi-Task Neural Network Architecture for Renal Dysfunction Prediction in Heart Failure Patients with Electronic Health Records, IEEE Access. 7 (2019) 178392-178400. doi: 10.1109/ACCESS.2019.2956859.
[20] S. S. Sarmah, An Efficient IoT-Based Patient Monitoring and Heart Disease Prediction System Using Deep Learning Modified Neural Network, IEEE Access. 8 (2020) 135784-135797. doi: 10.1109/ACCESS.2020.3007561.
[21] D. Bertsimas, L. Mingardi and B. Stellato, Machine Learning for Real-Time Heart Disease Prediction, IEEE Journal of Biomedical and Health Informatics. 25(9) (2021) 3627-3637. doi: 10.1109/JBHI.2021.3066347.
[22] M. A. Khan, An IoT Framework for Heart Disease Prediction Based on MDCNN Classifier, IEEE Access. 8 (2020) 34717-34727. doi: 10.1109/ACCESS.2020.2974687.
[23] Javeed, S. Zhou, L. Yongjian, I. Qasim, A. Noor and R. Nour, An Intelligent Learning System Based on Random Search Algorithm and Optimized Random Forest Model for Improved Heart Disease Detection, IEEE Access. 7 (2019) 180235-180243. doi: 10.1109/ACCESS.2019.2952107.
[24] Aggrawal, R., Pal, S. Sequential Feature Selection and Machine Learning Algorithm-Based Patient’s Death Events Prediction and Diagnosis in Heart Disease. SN COMPUT. SCI. 1 (2020) 344. https://doi.org/10.1007/s42979-020-00370-1.
[25] Ripan, R.C., Sarker, I.H., Hossain, S.M.M. et al. A Data-Driven Heart Disease Prediction Model Through K-Means Clustering-Based Anomaly Detection. SN COMPUT. SCI. 2 (2021) 112. https://doi.org/10.1007/s42979-021-00518-7
[26] Mohammad-Azari S., Bozorg-Haddad O., Chu X. Shark Smell Optimization (SSO) Algorithm. In: Bozorg-Haddad O. (eds) Advanced Optimization by Nature-Inspired Algorithms. Studies in Computational Intelligence, 720. Springer, Singapore. 720 (2018) https://doi.org/10.1007/978-981-10-5221-7_10.
[27] R. Rajakumar, Impact of Static and Adaptive Mutation Techniques on Genetic Algorithm, International Journal of Hybrid Intelligent Systems. 10(1) (2013) 11-22. doi: 10.3233/HIS-120161.
[28] B. R. Rajakumar, Static and Adaptive Mutation Techniques for Genetic Algorithm: A Systematic Comparative Analysis, International Journal of Computational Science and Engineering. 8(2) (2013) 180-193. doi: 10.1504/IJCSE.2013.053087.
[29] S. M. Swamy, B. R. Rajakumar and I. R. Valarmathi, Design of Hybrid Wind and Photovoltaic Power System using Opposition-based Genetic Algorithm with Cauchy Mutation, IET Chennai Fourth International Conference on Sustainable Energy and Intelligent Systems (SEISCON 2013), Chennai, India. (2013). doi: 10.1049/ic.2013.0361.
[30] Aloysius George and B. R. Rajakumar, APOGA: An Adaptive Population Pool Size based Genetic Algorithm, AASRI Procedia - 2013 AASRI Conference on Intelligent Systems and Control (ISC 2013). 4 (2013) 288-296. doi: https://doi.org/10.1016/j.aasri.2013.10.043.
[31] B. R. Rajakumar and Aloysius George, A New Adaptive Mutation Technique for Genetic Algorithm, In proceedings of IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), Coimbatore, India. 18(20) (2012) 1-7. doi: 10.1109/ICCIC.2012.6510293.
[32] Zerina Masetic, Abdulhamit Subasi, Congestive Heart Failure Detection Using Random Forest Classifier, Computer Methods and Programs in Biomedicine. 130 (2016) 54-64.
[33] Yogeswaran Mohan, Sia Seng Chee, Donica Kan Pei Xin and Lee Poh Foong, Artificial Neural Network for Classification of Depressive and Normal in EEG, 2016 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES). (2016).
[34] M. Akhil Jabbar, B. L. Deekshatulu, Priti Chandra, Classification of Heart Disease Using K- Nearest Neighbor and Genetic Algorithm, Procedia Technology. 10 (2013) 85-94.
[35] Fouad, Ahmed, Social Spider Optimization Algorithm. (2015).
[36] Seyyed Hamid Samareh MoosaviVahid Khatibi Bardsiri, Poor and Rich Optimization Algorithm: A New Human-Based and Multi Populations Algorithm, Engineering Applications of Artificial Intelligence. 86 (2019) 165-181.
[37] Dehghani, Mohammad, Št?pánHubálovský, and Pavel Trojovský, Cat and Mouse Based Optimizer: A New Nature-Inspired Optimization Algorithm. 21(15) (2021) 5214. https://doi.org/10.3390/s21155214.
[38] Rajendra Kumar, Identification of Noteworthy Features and Data Mining Techniques for Heart Disease Prediction, In Communication.
[39] [Online]. Available: https://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm#:~:text=Skewness%20is%20a%20measure%20of,relative%20to%20a%20normal%20distribution.
[40] [Online]. Availanble: https://en.wikipedia.org/wiki/Statistic.
[41] [Online]. Availanble: https://en.wikipedia.org/wiki/Standard_deviation
[42] Mishra, Sidharth & Sarkar, Uttam & Taraphder, Subhash & Datta, Sanjoy & Swain, Devi & Saikhom, Reshma & Panda, Sasmita & Laishram, Menalsh, Principal Component Analysis. International Journal of Livestock Research. 1 (2017).
[43] Ling-Jing Kao, Chih Chou Chiu, Application of Integrated Recurrent Neural Network with Multivariate Adaptive Regression Splines on SPC-EPC Process, Journal of Manufacturing Systems. 57 (2020) 109–118.
[44] H.Z. Wang, G.B. Wang, G.Q. Li, J.C. Peng, and Y.T. Liu, Deep Belief Network Based Deterministic and Probabilistic Wind Speed Forecasting Approach, Applied Energy. 182 (2016) 80–93.
[45] Arora, Sankalap & Singh, Satvir, Butterfly Optimization Algorithm: A Novel Approach for Global Optimization. Soft Computing. (2019).
[46] E. Avci, A New Intelligent Diagnosis System for the Heart Valve Diseases by Using Genetic-SVM Classifier, Expert Systems with Applications. 36(7) (2009) 10618-10626.
[47] Paraskevas TsangaratosIoanna Ilia, Comparison of a Logistic Regression and Naïve Bayes Classifier in Landslide Susceptibility Assessments: The Influence of Models Complexity and Training Dataset Size, CATENA. 145 (2016) 164-179.
[48] Y. Lecun, K. Kavukvuoglu, and C. Farabet, Convolutional Networks and Applications in Vision, In Circuits and Systems, International Symposium. (2010) 253–256.
[49] Ping DengHongjun WangXinwen Zhu, Linear Discriminant Analysis Guided by Unsupervised Ensemble Learning, Information Sciences. 480 (2018) 211-221.
[50] Malige Gangappa, Kiran Mai C, Sammulal P, Enhanced Crow Search Optimization Algorithm and Hybrid NN-CNN Classifiers for Classification of Land Cover Images, Multimedia Research. 2(3) (2019) 12-22.
[51] G.Gokulkumari, Classification of Brain Tumor using Manta Ray Foraging Optimization-Based DeepCNN classifier, Multimedia Research. 3(4) (2020).
[52] Sesham Anand, Archimedes Optimization Algorithm: Heart Disease Prediction, Multimedia Research. 4(3) (2021).
[53] Yuanhao Liu, Hybrid Shark Smell Optimization Based on World Cup Optimization Algorithm for Minimization of THD, Journal of Computational Mechanics, Power System and Control. 3(3) (2020).
[54] B. Kranthi Kiran, Indian Music Classification using Neural network Based Dragon Fly Algorithm, Journal of Computational Mechanics, Power System and Control. 4(3) (2021).
[55] Vaibhav Ankush Thorat, Cloud Intrusion Detection using Modified Crow Search Optimized Based Neural Network, Journal of Networking and Communication Systems.4(2) (2021).
[56] Sankul Rathod, Hybrid Metaheuristic Algorithm for Cluster Head Selection in WSN, Journal of Networking and Communication Systems. 3(4) (2020).
[57] (2021). [Online]. Availabe: https://archive.ics.uci.edu/ml/datasets/Heart+Disease.
[58] Kalletla Sunitha, Automatically Identifying Wild Animals in Camera-Trap Images with Deep Learning, IJETT International Journal of Computer Science and Engineering. 8(5) (2021) 12-16. https://doi.org/10.14445/23488387/IJCSE-V8I5P102
[59] Nidhi Mongoriya, Vinod Patel, Review the Breast Cancer Detection Technique Using Hybrid Machine Learning, IJETT International Journal of Computer Science and Engineering. 8(6) (2021) 5-8. https://doi.org/10.14445/23488387/IJCSE-V8I6P102
[60] V .P. Amadi, N.D Nwiabu, V. I. E. Anireh, Case-Based Reasoning System for the Diagnosis and Treatment of Breast, Cervical and Prostate Cancer, IJETT International Journal of Computer Science and Engineering. 8(8) (2021) 13-20. https://doi.org/10.14445/23488387/IJCSE-V8I8P103