Explainable machine learning for mental health prediction from social media behavior: a nested cross-validation study with SHAP and LIME interpretability

基于社交媒体行为的心理健康预测的可解释机器学习:一项嵌套交叉验证研究,结合SHAP和LIME可解释性

阅读:1

Abstract

Social media behavior is a promising source of early indicators for psychological distress; however, predictive models often lack transparency, limiting their adoption in mental health settings. This paper describes an explainable machine learning framework for predicting self-reported depression risk based on behavioral features collected from 481 anonymized social media users. Three supervised learning models were tested using a nested 5 × 5 cross-validation strategy, with Random Forest yielding the strongest performance (accuracy = 84.2%, AUC = 0.88). Model calibration analysis using reliability curves and Expected Calibration Error (ECE) demonstrated that Random Forest provides well-calibrated probability estimates suitable for binary High/Low risk assessment. Explainability was integrated using SHAP to identify key behavioral markers, including screen time, passive scrolling, nighttime usage, and stress-driven engagement. Stability testing across multiple random seeds revealed consistent feature ranking patterns, supporting the reliability of the explanations. To showcase real-world applicability, we outline a prototype XAI-driven digital intervention workflow and present a simulation across representative user profiles, illustrating how interpreted model outputs can inform personalized behavioral recommendations. However, generalizability is limited by a moderately sized dataset reliant on self-reported measures and cross-sectional design. Future work will integrate multimodal behavioral signals, larger cohorts, and clinically validated mental-health assessments. Overall, the study presents a more transparent, computationally grounded approach for interpretable depression-risk prediction from social media behavior, bridging the gap between predictive performance and practical explainability.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。