ARCHIVES

Original Article

An Explainable Machine Learning Framework for Early Prediction of Student Academic Performance

Deepak Kumar1 Vikas Arora2 Abhinav Pratap Soni3 Anurag Agarwal4
1 2 3 4Department of Computer Science Engineering, Roorkee Institute of Technology, Uttarakhand, India.

Published Online: March-April 2026

Pages: 90-95

Abstract

Early identification of students at risk of poor academic performance is essential for improving learning outcomes and reducing dropout rates in higher education institutions. The increasing adoption of digital learning platforms has resulted in the generation of large-scale educational data, enabling the application of machine learning techniques for predictive analytics. Although existing studies report high prediction accuracy, most machine learning–based models operate as black boxes, limiting their interpretability and practical adoption by educators. To address this limitation, this paper proposes an explainable machine learning framework for early prediction of student academic performance. The proposed approach employs a Random Forest classifier for accurate risk prediction and integrates SHapley Additive exPlanations (SHAP) to provide both global and local interpretability of model decisions. Experiments conducted on the UCI Student Performance dataset demonstrate that the proposed framework achieves an accuracy of 91.14%, an F1-score of 86.79%, and an ROC–AUC of 0.9655, outperforming baseline classifiers. Furthermore, SHAP-based explanations identify key academic and behavioral factors influencing student performance and provide individualized explanations for at-risk students. The results confirm that the proposed framework offers both high predictive performance and transparency, making it suitable for real-world educational decision-support systems

Related Articles

2026

AI-Based Stomach Cancer Detection Using Biomarkers, Medical Images, and Voice Analysis

2026

Hydrogen-Efficient Eco-Driving and Route Planning for Fuel-Cell Electric Vehicles Using Multi-Objective Optimization Under Traffic and Terrain Uncertainty

2026

A Data-Driven Machine Learning Framework for Assessing Patent Commercial Value and Technological Significance

2026

Evaluating Student Academic Performance Through a Benchmark of Fuzzy Reasoning Models

2026

A Hybrid Soft Computing Approach for Managing Uncertainty in Data Analytics

2026

Soft Computing Approaches for Robust Analysis of Imbalanced and Noisy Data

Share Article

X
LinkedIn
Facebook
WhatsApp

Or copy link

https://test.theijire.com/archives/10.59256/ijire.20260702012

*Instagram doesn't support direct link sharing from web. Copy the link and share it in your Instagram story or post.