Feature Based Ensemble Learning Model for Breast Cancer Reoccurrence Retrieval

Feature Based Ensemble Learning Model for Breast Cancer Reoccurrence Retrieval

© 2022 by IJETT Journal
Volume-70 Issue-10
Year of Publication : 2022
Authors : Mohan Kumar, Sunil Kumar Khatri, Masoud Mohammadian
DOI : 10.14445/22315381/IJETT-V70I10P220

How to Cite?

Mohan Kumar, Sunil Kumar Khatri, Masoud Mohammadian, "Feature Based Ensemble Learning Model for Breast Cancer Reoccurrence Retrieval," International Journal of Engineering Trends and Technology, vol. 70, no. 10, pp. 210-220, 2022. Crossref, https://doi.org/10.14445/22315381/IJETT-V70I10P220

In recent years, 30% of women have been diagnosed with cancer yearly. Improvement of medical treatments made patients would be in remission but with challenges. It’s estimated that there will be 14.8% of new cases in the last year. Breast cancer reoccurrence regenerates new challenges causing severe effects and even causing life loss. So, if it is detected early, it can cure it. Various latest techniques like machine learning are very much required for solving and predicting the reoccurrence for reducing mortality to some extent. The research paper proposed the ensemble approach, which used the Voting method for combining techniques used ensemble methods for detecting the two classes of tumors benign and malignant. Ensemble used Meta to implement more than one classifier. Experiments conducted on Wisconsin Diagnostic Breast Cancer (WDBC) dataset and voting techniques are used to get better results for model evaluation. Logistics regression, support vector machine, RBFF and Linear classifier, decision tree classifier and Random Forest are used to get classification accuracy, precision, recall, and F1 measure. The results obtained show that the ensemble models showed significant achievement in terms of performance and 98 % in terms of accuracy. With AdaBoost and costsensitivity in a model, a reasonable accuracy has been achieved. The proposed model in this research supports setting and evaluating various follow-up visit interventions and very advanced treatment recommendations, so there should be very low cancer mortality.

Machine learning, Classification, AdaBoost, Ensemble learning.

[1] World Health Organization., “WHO Position Paper on Mammography Screening [Internet]? Switzerland: World Health Organization, “ Available From: Https://Apps.Who.Int/ Iris/Handle/10665/137339, 2014.
[2] American Cancer Society, Cancer Facts & Figures 2020 [Internet]. Atlanta: American Cancer Society, 2020. Available From: https://www.Cancer.org/Content/Dam/Cancer-Org/ Research/Cancer-Facts-and-Statistics/Annual-Cancer-Facts andFigures/2020/Cancer-Facts-and-Figures-2020.Pdf
[3] Kim J, Shin H, “Breast Cancer Survivability Prediction Using Labeled, Unlabeled, and Pseudo-Labeled Patient Data,” Journal of the American Medical Informatics Association,, vol.20, no.4, pp.613–8. Doi: 10.1136/Amiajnl-2012- 001570. Pubmed PMID: 23467471; Pubmed Central PMCID: PMC3721173.
[4] Hsu JL, Hung PC, Lin HY, Hsieh CH, “Applying Under-Sampling Techniques and Cost-Sensitive Learning Methods on Risk Assessment of Breast Cancer,” Journal of Medical Systems, vol.39, no.4, pp.1–3, 2015. Doi: 10.1007/S10916-015-0210-X. Pubmed PMID: 25712814.
[5] Seely JM, Alhassan T, “Screening for Breast Cancer In 2018-What Should We Be Doing Today?” Current Oncology, pp. S115–24, 2018. Doi: 10.374/Co.25.3770. Pubmed PMID:29910654; Pubmed Central PMCID: PMC6001765.
[6] Uhry Z, Hédelin G, Colonna M, Asselain B, Arveux P, Rogel A, Et Al, “Multi-State Markov Models In Cancer Screening Evaluation: A Brief Review and Case Study,” Statistical Methods in Medical Research, vol.19, no.5, pp.463–86, 2010. Doi: 10.1177/0962280209359848. Pubmed PMID:20231370.
[7] Immadi Murali Krishna, Pendem Durga Bhavani, Tiriveedhi M S Madhuvani, Vajja Poojitha, "An Effective Segmentation and Modified Ada Boost CNN Based Classification Model for Fabric Fault Detection System," SSRG International Journal of Computer Science and Engineering, vol. 7, no. 7, pp. 34-40, 2020. Crossref, https://doi.org/10.14445/23488387/IJCSE-V7I7P106.
[8] Moody SE, Perez D, Pan TC, Sarkisian CJ, Portocarrero CP, Sterner CJ, Et Al, “ the Transcriptional Repressor Snail Promotes Mammary Tumor Recurrence,” Cancer Cell, vol.8, no.3, pp.197–209, 2005. Doi: 10.1016/J.Ccr.2005.07.009. Pubmed PMID: 16169465.
[9] Ahmad A, “Pathways to Breast Cancer Recurrence,” ISRN Oncology, pp.290568, 2013.Doi: 10.1155/2013/290568. Pubmed PMID: 23533807; Pubmed Central PMCID: PMC3603357.
[10] Saphner T, Tormey DC, Gray R, “ Annual Hazard Rates of Recurrence for Breast Cancer After Primary Therapy,” The Journal of Clinical Oncology, vol.14, no.10, pp.2738–46, 1996. Doi: 10.1200/JCO.1996.14.10.2738. Pubmed PMID: 8874335.
[11] Wang SY, Shamliyan T, Virnig BA, Kane R, “Tumor Characteristics as Predictors of Local Recurrence After Treatment of Ductal Carcinoma In Situ: A Meta-Analysis,” Breast Cancer Research and Treatment, vol.127, no.1, pp.1–14, 2011. Doi: 10.1007/S10549- 011-1387-4. Pubmed PMID: 21327465.
[12] Chacón RD, Costanzo MV, “Triple-Negative Breast Cancer,“ Breast Cancer Research and Treatment , vol.12, no.S3, 2010. Doi: 10.1186/Bcr2574. Pubmed PMID: 21050424; Pubmed Central PMCID: PMC2972557.
[13] Cavalli LR, “Molecular Markers of Breast Axillary Lymph Node Metastasis,” Expert Review of Molecular Diagnostic, vol.9, no.5, pp.441–54, 2009. Doi: 10.1586/Erm.09.30. Pubmed PMID: 19580429.
[14] Kim W, Kim KS, Lee JE, Noh DY, Kim SW, Jung YS, Et Al, “Development of Novel Breast Cancer Recurrence Prediction Model Using Support Vector Machine,” Journal of Breast Cancer, vol.15, no.2, pp.230–8, 2010. Doi: 10.4048/Jbc.2012.15.2.230. Pubmed PMID: 22807942; Pubmed Central PMCID: PMC3395748.
[15] Ahmad LG, Eshlaghy AT, Pourebrahimi A, Ebrahimi M, Razavi A, “Using Three Machine Learning Techniques for Predicting Breast Cancer Recurrence,” Journal of Health & Medical Informatics, vol.4, no.2, pp.124–30, 2013. Doi: 10.4172/2157-7420.1000124.
[16] Young I, Luz S, Lone N, “A Systematic Review of Natural Language Processing for Classification Tasks In the Field of Incident Reporting and Adverse Event Analysis,” International Journal of Medical Informatics,, vol.132, pp.103971, 2013. Https://Doi.Org/ 10.1016/J.Ijmedinf.2019.103971.
[17] Meystre S, Haug P , “Natural Language Processing to Extract Medical Problems From Electronic Clinical Documents: Performance Evaluation,” The Journal of Biomedical Informatics, vol. 39, no.6, pp.589–599, 2006.
[18] Sada Y Et Al, “ Validation of Case Finding Algorithms for Hepatocellular Cancer From Administrative Data and Electronic Health Records Using Natural Language Processing,” Med Care, vol.54, no.2, pp. E9-14, 2016.
[19] Alzu’bi A, Zhou L, Watzlaf V , “Personal Genomic Information Management and Personalized Medicine: Challenges, Current Solutions, and Roles of HIM Professionals,” Perspectives in Health Information Management \, 11(Spring):1c, 2014.
[20] Hardavella J Et Al, “ Top Tips to Deal With Challenging Situations: Doctor-Patient Interactions,” Breathe, vol.13, no.2, pp.129–135
[21] Nayana Banjan, Prajkta Dalvi and Neha Athavale, "Melanoma Skin Cancer Detection By Segmentation and Feature Extraction Using Combination of OTSU and STOLZ Algorithm Technique," SSRG International Journal of Electronics and Communication Engineering, vol. 4, no. 4, pp. 21-25, 2017. Crossref, https://doi.org/10.14445/23488549/IJECE-V4I4P105.
[22] Kundra H, Sadawarti H, “Hybrid Algorithm of Cuckoo Search and Particle Swarm Optimization for Natural Terrain Feature Extraction,” Research Journal of Information Technology, vol.7, no.1, pp.58–69, 2015.
[23] Hong W Et Al, “SVR With Hybrid Chaotic Immune Algorithm for Seasonal Load Demand Forecasting,” Energies, vol. 4, pp.960– 977, 2011.
[24] Zhang Z, Hong W, Li J , “Electric Load Forecasting By Hybrid Self-Recurrent Support Vector Regression Model with Variational Mode Decomposition and Improved Cuckoo Search Algorithm, “ IEEE Access, vol.8, pp.14642– 14658.
[25] Abualigah L , “Feature Selection and Enhanced Krill Herd Algorithm for Text Document Clustering. Studies In Computational Intelligence,” Springer International Publishing, Berlin, 2019.
[26] Abualigah L, Khader A , “ Unsupervised Text Feature Selection Technique Based on Hybrid Particle Swarm Optimization Algorithm With Genetic Operators for the Text Clustering, “ Journal of Supercomputer, vol.73, pp.4773–4795, 2017. https://doi.org/10.1007/S11227-017-2046-2.
[27] Abualigah L, “Multi-Verse Optimizer Algorithm: A Comprehensive Survey of Its Results, Variants, and Applications,” Neural Computer Application, vol. 32, pp.12381–12401, 2020.
[28] Guo J, Sun Z, Tang H, Jia X, Wang S, Yan X, Ye G, Wu G , “Hybrid Optimization Algorithm of Particle Swarm Optimization and Cuckoo Search for Preventive Maintenance Period Optimization,” Discrete Dynamics in Nature and Society, 2016. https://doi.org/10.1155/2016/1516271.
[29] Vinitha S, Hao Y, Hwang K, Wang Lu, Wang Li, “Disease Prediction By Machine Learning Over Big Data From Healthcare Communities,” Computing in Science & Engineering, vol.8, no.1, 2019. Https://Doi.Org/10.1109/ACCESS.2017.2694446.
[30] Dahiwade D, Patle G, Meshram E, “Designing Disease Prediction Model Using Machine Learning Approach,” In 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC). IEEE, New York, 2019.
[31] Chae S, Kwon S, Lee D, “Predicting Infectious Disease Using Deep Learning and Big Data,” International Journal of Environmental Research and Public Health,” vol.15, no.8, pp.1596, 2018.
[32] Battineni G Et Al, “Applications of Machine Learning Predictive Models in the Chronic Disease Diagnosis,” Journal of Perinatal Medicine, vol.10, no.2, pp.21.
[33] Dawes T Et Al, “Machine Learning of Three-Dimensional Right Ventricular Motion Enables Outcome Prediction in Pulmonary Hypertension: A Cardiac MR Imaging Stud,” Radiology, vol.283, no.2, pp.381–390, 2017.
[34] Ford E, Carroll JA, Smith HE, Scott D, Cassell JA, “Extracting Information From the Text of Electronic Medical Records to Improve Case Detection: A Systematic Review,“ Journal of the American Medical Informatics Association, vol.23, no.5, pp.1007–1015, 2016.
[35] Sharma H, Rizvi M , “Prediction of Heart Disease Using Machine Learning Algorithms: A Survey,” International Journal on Recent and Innovation Trends in Computing and Communication, vol. 5, no.8, pp.99–104, 2017.
[36] Ahmad L, Eshlaghy A, Poorebrahimi A, Ebrahimi M, Razavi A , “ Using Three Machine Learning Techniques for Predicting Breast Cancer Recurrence,” Journal of Health & Medical Informatics, vol. 4, no.2, 2017. https://doi.org/10.4172/2157- 7420.1000124
[37] Tseng Y Et Al, “Predicting Breast Cancer Metastasis by Using Serum Biomarkers and Clinicopathological Data With Machine Learning Technologies,” International Journal of Medical Informatics, vol.128, pp.79–86, 2019.
[38] Boeri C Et Al, “Machine Learning Techniques in Breast Cancer Prognosis Prediction: A Primary Evaluation,” Cancer Medicine, vol.9, no.9, pp.3234–3243, 2009.
[39] Chang C, Chen S , “Developing A Novel Machine Learning-Based Classification Scheme for Predicting SPCS in Breast Cancer Survivors,” Frontiers in Genetics, vol.10, Bo.848, 2019. https://doi.org/10.3389/Fgene.2019.00848.
[40] Eidemiller M Et Al, “ Long-Term Health Risk After Breast-Cancer Radiotherapy: Overview of Passes Methodology and Software,” Radiation Protection Dosimetry, vol.183, pp.259–263.
[41] Yousefi M Et Al, “ Organ-Specific Metastasis of Breast Cancer: Molecular and Cellular Mechanisms Underlying Lung Metastasis,” Cell Oncology , vol.41, no.2, pp.123–140, 2018.
[42] Feliciano E Et Al, “ Body Mass Index, Pam50 Subtype, Recurrence, and Survival Among Patients with Nonmetastatic Breast Cancer,” Cancer, vol. 123, no.13, pp.2535–2542, 2017.
[43] Shim H Et Al., “Breast Cancer Recurrence According to Molecular Subtype,” The Asian Pacific Journal of Cancer Prevention, vol.15, no.14, pp.5539–44, 2014.
[44] Nidhi Mongoriya, Vinod Patel, "Review the Breast Cancer Detection Technique Using Hybrid Machine Learning," SSRG International Journal of Computer Science and Engineering, vol. 8, no. 6, pp. 5-8, 2021. Crossref, Https://Doi.Org/10.14445/23488387/IJCSE-V8I6P102
[45] Cancer.Net. [Internet] Doctor-approved patient information from ASCO. Breast cancer: statistics, 2021 Jan. Available from: https://www.cancer.net/cancer-types/breast-cancer/ Statistics.
[46] Gerhard W, “The Diagnosis, Pathology, and Treatment of the Diseases of the Chest,” Philadelphia: E. Barrington And G.D. Haswell, 1850. Http://Resource.Nlm.Nih.Gov/101505669.