Robust and Explainable Ensemble Based Framework for Liver Disease Classification using Data Balancing and Upsampling

International Journal of Electronics and Communication Engineering |
© 2025 by SSRG - IJECE Journal |
Volume 12 Issue 2 |
Year of Publication : 2025 |
Authors : Aditya Bhongade, Yogita Dubey, Prachi Palsodkar, Punit Fulzele |
How to Cite?
Aditya Bhongade, Yogita Dubey, Prachi Palsodkar, Punit Fulzele, "Robust and Explainable Ensemble Based Framework for Liver Disease Classification using Data Balancing and Upsampling," SSRG International Journal of Electronics and Communication Engineering, vol. 12, no. 2, pp. 1-11, 2025. Crossref, https://doi.org/10.14445/23488549/IJECE-V12I2P101
Abstract:
Liver Disease (LD) is a severe health condition impacting over 2 million lives annually worldwide, as reported by the WHO. Factors such as rising alcohol consumption, increased type 2 diabetes cases, genetic predispositions, and various lifestyle influences are expected to heighten LD prevalence further, underscoring the need for a modern, accurate, and interpretable classification system. This paper aims to develop an effective and transparent Machine Learning (ML) framework using ensemble learning models and Explainable AI (XAI) techniques for LD classification. The proposed framework addresses dataset imbalance and size constraints by employing data balancing and upsampling, enabling the ensemble models to learn complex patterns in clinical data. The performance of each model is evaluated, and the best-performing model, Gradient Boosting (GB), is further analyzed using SHAP, LIME, and ELI5 to interpret its feature impact. GB achieved high classification metrics, including accuracy, precision, recall, specificity, and AUC, with Direct Bilirubin, Alkaline Phosphatase, Alanine Aminotransferase, and Age identified as key influential features. This paper successfully presents a reliable and interpretable ML-based framework for LD classification, combining quantitative performance and explainability, making it highly suitable for clinical application.
Keywords:
Classification, Ensemble learning, Explainable model, Feature analysis, Liver Disease.
References:
[1] Peter Byass, “The Global Burden of Liver Disease: A Challenge for Methods and for Public Health,” BMC Medicine, vol. 12, pp. 1-3, 2014.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Liver Disease, Cleveland Clinic, 2024. [Online]. Avaliable: https://my.clevelandclinic.org/health/diseases/17179-liver-disease
[3] Neda Afreen et al., “A Novel Machine Learning Approach using Boosting Algorithm for Liver Disease Classification,” 5th International Conference on Information Systems and Computer Networks, Mathura, India, pp. 1-5, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[4] G. Shobana, and K. Umamaheswari, “Prediction of Liver Disease Using Gradient Boost Machine Learning Techniques with Feature Scaling,” 5th International Conference on Computing Methodologies and Communication, Erode, India, pp. 1223-1229, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Vinayak Singh, Mahendra Kumar Gourisaria, and Himansu Das, “Performance Analysis of Machine Learning Algorithms for Prediction of Liver Disease,” IEEE 4th International Conference on Computing, Power and Communication Technologies, Kuala Lumpur, Malaysia, pp. 1-7, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Krittika Dutta, Satish Chandra, and Mahendra Kumar Gourisaria, “Early-Stage Detection of Liver Disease through Machine Learning Algorithms,” Advances in Data and Information Sciences, pp. 155-166, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Anton Sokoliuk et al., “Machine Learning Algorithms for Binary Classification of Liver Disease,” IEEE International Conference on Problems of Infocommunications, Science and Technology, Kharkiv, Ukraine, pp. 417-421, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Mounita Ghosh et al., “A Comparative Analysis of Machine Learning Algorithms to Predict Liver Disease,” Intelligent Automation & Soft Computing, vol. 30, no. 3, pp. 917-928, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Fazle Rabbi et al., “Prediction of Liver Disorders Using Machine Learning Algorithms: A Comparative Study,” 2nd International Conference on Advanced Information and Communication Technology, Dhaka, Bangladesh, pp. 111-116, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Ketan Gupta et al., “Liver Disease Prediction Using Machine Learning Classification Techniques,” IEEE 11th International Conference on Communication Systems and Network Technologies, Indore, India, pp. 221-226, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Greeshma Arya et al., “Explainable AI for Enhanced Interpretation of Liver Cirrhosis Biomarkers,” IEEE Access, vol. 11, pp. 123729-123741, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[12] A. Nilofer, and S. Sasikala, “A Comparative Study of Machine Learning Algorithms Using Explainable Artificial Intelligence System for Predicting Liver Disease,” Computing Open, vol. 1, pp. 1-18, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Abrar Alotaibi et al., “Explainable Ensemble-Based Machine Learning Models for Detecting the Presence of Cirrhosis in Hepatitis C Patients,” Computation, vol. 11, no. 6, pp. 1-17, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Xieyi Pei et al., “Machine Learning Algorithms for Predicting Fatty Liver Disease,” Annals of Nutrition and Metabolism, vol. 77, no. 1, pp. 38-45, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Basile Njei et al., “An Explainable Machine Learning Model for Prediction of High-Risk Nonalcoholic Steatohepatitis,” Scientific Reports, vol. 14, pp. 1-9, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Ebenezer Agbozo, and Daniel Musafiri Balu, “Liver Disease Classification-An XAI Approach to Biomedical AI,” Informatica, vol. 48, no. 1, pp. 1-12, 2024.
[CrossRef] [Google Scholar] [Publisher Link]
[17] Cristina Baciu et al., “Artificial Intelligence Applied to Omics Data in Liver Diseases: Enhancing Clinical Predictions,” Frontiers in Artificial Intelligence, vol. 5, pp. 1-10, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[18] R. Pandi Selvam et al., “Explainable Artificial Intelligence with Metaheuristic Feature Selection Technique for Biomedical Data Classification,” Biomedical Data Analysis and Processing Using Explainable (XAI) and Responsive Artificial Intelligence (RAI), pp. 43-57, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Mamatha Bhat et al., “Artificial Intelligence, Machine Learning, and Deep Learning in Liver Transplantation,” Journal of Hepatology, vol. 78, no. 6, pp. 1216-1233, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[20] Ramisetty Kavya et al., “Machine Learning and XAI Approaches for Allergy Diagnosis,” Biomedical Signal Processing and Control, vol. 69, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Seyedeh Neelufar Payrovnaziri et al., “Explainable Artificial Intelligence Models Using Real-World Electronic Health Record Data: A Systematic Scoping Review,” Journal of the American Medical Informatics Association, vol. 27, no. 7, pp. 1173-1185, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Ruey-Kai Sheu, and Mayuresh Sunil Pardeshi, “A Survey on Medical Explainable AI (XAI): Recent Progress, Explainability Approach, Human Interaction and Scoring System,” Sensors, vol. 22, no. 20, pp. 1-42, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Bendi Ramana, and N. Venkateswarlu, “ILPD (Indian Liver Patient Dataset),” UC Irvine Machine Learning Repository, 2022.
[CrossRef] [Publisher Link]