Explainable Model for Agricultural Crop Yield Prediction in Indian Conditions with SHAP Analysis

International Journal of Electronics and Communication Engineering |
© 2025 by SSRG - IJECE Journal |
Volume 12 Issue 1 |
Year of Publication : 2025 |
Authors : Yogita Dubey, Aniket Sakhare, Atharva Tasare, Santosh Kakad, Roshan Umate |
How to Cite?
Yogita Dubey, Aniket Sakhare, Atharva Tasare, Santosh Kakad, Roshan Umate, "Explainable Model for Agricultural Crop Yield Prediction in Indian Conditions with SHAP Analysis," SSRG International Journal of Electronics and Communication Engineering, vol. 12, no. 1, pp. 236-244, 2025. Crossref, https://doi.org/10.14445/23488549/IJECE-V12I1P118
Abstract:
This paper presents an ensemble-based approach to agricultural crop yield forecasting, focusing on the Indian context. The study integrates different forecasting models to improve forecasting accuracy for agricultural complexity data analysis. SHAP (SHapley Additive exPlanations) is used to give a clear idea of the contribution of each factor in the prediction model to improve model interpretation. The dataset used for this study contains yields of 55 crops over 6 seasons in 30 countries in 23 years (1997) -2020 available). Besides demonstrating the method's effectiveness, it emphasizes the need for explicit modeling that can provide valuable insights for better agricultural practices and ultimately contribute to higher yields that will be sustainable in Indian agriculture.
Keywords:
Ensemble learning, Explainable AI, Yield prediction, Quantitative assessment, SHAP analysis.
References:
[1] Andrew Crane-Droesch, “Machine Learning Methods for Crop Yield Prediction and Climate Change Impact Assessment in Agriculture,” Environmental Research Letters, vol. 13, no. 11, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Anna Chlingaryan, Salah Sukkarieh, and Brett Whelan, “Machine Learning Approaches for Crop Yield Prediction and Nitrogen Status Estimation in Precision Agriculture: A Review,” Computers and Electronics in Agriculture, vol. 151, pp. 61-69, 2018.
[CrossRef] [Google Scholar] [Publisher Link]
[3] Aruvansh Nigam et al., “Crop Yield Prediction Using Machine Learning Algorithms,” 2019 Fifth International Conference on Image Information Processing (ICIIP), Shimla, India, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Devdatta A. Bondre, and Santosh Mahagaonkar, “Prediction of Crop Yield and Fertilizer Recommendation Using Machine Learning Algorithms,” International Journal of Engineering Applied Sciences and Technology, vol. 4, no. 5, pp. 371-376, 2019.
[Google Scholar] [Publisher Link]
[5] P.S. Maya Gopal, and R. Bhargavi. “A Novel Approach for Efficient Crop Yield Prediction,” Computers and Electronics in Agriculture, vol. 165, 2019.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Thomas Van Klompenburg, Ayalew Kassahun, and Cagatay Catal, “Crop Yield Prediction Using Machine Learning: A Systematic Literature Review,” Computers and Electronics in Agriculture, vol. 177, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[7] Dhivya Elavarasan, and P.M. Durairaj Vincent, “Crop Yield Prediction Using Deep Reinforcement Learning Model for Sustainable Agrarian Applications,” IEEE Access, vol. 8, pp. 86886-86901, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Farhat Abbas et al., “Crop Yield Prediction through Proximal Sensing and Machine Learning Algorithms,” Agronomy, vol. 10, no. 7, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[9] Potnuru Sai Nishant et al., “Crop Yield Prediction Based on Indian Agriculture Using Machine Learning,” 2020 International Conference for Emerging Technology (INCET), Belgaum, India, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Mamunur Rashid et al., “A Comprehensive Review of Crop Yield Prediction Using Machine Learning Approaches with Special Emphasis on Palm Oil Yield Prediction,” IEEE Access, vol. 9, pp. 63406-63439, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Dilli Paudel et al., “Machine Learning for Large-Scale Crop Yield Forecasting,” Agricultural Systems, vol. 187, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Mummaleti Keerthana, Priyanka Kunapuli, and Durga Bhavani Banavath, “An Ensemble Algorithm for Crop Yield Prediction,” 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), Tirunelveli, India, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Mohsen Shahhosseini, Guiping Hu, and Sotirios V. Archontoulis, “Forecasting Corn Yield with Machine Learning Ensembles,” Frontiers in Plant Science, vol. 11, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[14] Masahiro Ryo, “Explainable Artificial Intelligence and Interpretable Machine Learning for Agricultural Data Analysis,” Artificial Intelligence in Agriculture, vol. 6, pp. 257-265, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[15] Tongxi Hu et al., “Crop Yield Prediction via Explainable AI and Interpretable Machine Learning: Dangers of Black Box Models for Evaluating Climate Change Impacts on Crop Yield,” Agricultural and Forest Meteorology, vol. 336, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[16] Alejandro Morales, and Francisco J. Villalobos, “Using Machine Learning for Crop Yield Prediction in the Past or the Future,” Frontiers in Plant Science, vol. 14, pp. 1-13, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[17] S. Iniyan, V. Akhil Varma, and Ch Teja Naidu, “Crop Yield Prediction Using Machine Learning Techniques,” Advances in Engineering Software, vol. 175, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[18] Ersin Elbasi et al., “Crop Prediction Model Using Machine Learning Algorithms,” Applied Sciences, vol. 13, no. 16, pp. 1-20, 2023.
[CrossRef] [Google Scholar] [Publisher Link]
[19] Akshat Gupta, Akshat Saini, and Abhay Dewedi, Agricultural Crop Yield in Indian States Dataset, Kaggle. [Online]. Available: https://www.kaggle.com/datasets/akshatgupta7/crop-yield-in-indian-states-dataset
[20] Leo Breiman et al., Classification and Regression Trees, 1st ed., Chapman and Hall/CRC, pp. 1-368, 2017.
[CrossRef] [Google Scholar] [Publisher Link]
[21] Andy Liaw, and Matthew Wiener, “Classification and Regression by Random Forest,” R News, vol. 2, no. 3, pp. 18-22, 2002.
[Google Scholar]
[22] Jerome H. Friedman, “Stochastic Gradient Boosting,” Computational Statistics & Data Analysis, vol. 38, no. 4, pp. 367-378, 2002.
[CrossRef] [Google Scholar] [Publisher Link]
[23] Tianqi Chen, and Carlos Guestrin, “XGBoost: A Scalable Tree Boosting System,” Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco California USA, pp. 785-794, 2016.
[CrossRef] [Google Scholar] [Publisher Link]
[24] Trevor Hastie, Robert Tibshirani, and Jerome Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 1st ed., Springer New York, NY, 2001.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Fabian Pedregosa et al., “Scikit-Learn: Machine Learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825-2830, 2011.
[Google Scholar]
[26] Scott M. Lundberg, and Su-In Lee, “A Unified Approach to Interpreting Model Predictions,” Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach California USA, pp. 4768-4777, 2017.
[Google Scholar] [Publisher Link]
[27] Aurelien Géron, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems, 2nd ed., O'Reilly Media, 2019.
[Publisher Link]
[28] Sebastian Raschka, and Vahid Mirjalili, Python Machine Learning: Machine Learning and Deep Learning with Python, Scikit-learn, and TensorFlow 2, Packt Publishing, pp. 1-772, 2019.
[Google Scholar] [Publisher Link]