Enhancing graduate education assessment: a machine learning-based classification of academic performance in medical students

提升研究生教育评估:基于机器学习的医学生学业成绩分类

阅读:1

Abstract

BACKGROUND: Accurately predicting academic performance among medical postgraduate students is crucial for understanding educational outcomes and providing effective early academic guidance. Traditional statistical approaches often struggle to balance predictive performance with interpretability, particularly when handling complex relationships among academic and psychosocial factors. METHODS: A semi-structured survey was administered to medical postgraduate students at a Chinese medical university, yielding a final sample of 1,091 participants. GPA was dichotomized into two categories: outstanding academic performance (GPA ≥ 80) and non-outstanding academic performance (GPA < 80). Feature selection was performed using the Boruta algorithm. Logistic regression and XGBoost models were developed and evaluated on a held-out test set. Model performance was assessed using the area under the receiver operating characteristic curve, accuracy, and complementary validation metrics. Shapley Additive Explanations (SHAP) analysis was applied to interpret the contributions of key predictors. RESULTS: Both models demonstrated acceptable predictive performance. Undergraduate academic achievement emerged as the most influential predictor of GPA classification, followed by selected psychosocial characteristics and foundational academic skills. Shapley Additive Explanations (SHAP) interpretation provided transparent insights into the relative importance and directionality of these predictors. CONCLUSION: This study presents an interpretable machine learning framework for predicting academic performance in medical postgraduate education. By combining predictive modeling with explainable techniques, the proposed approach supports reliable performance assessment while maintaining transparency, offering a methodological foundation for future research and cautious application in educational analytics.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。