Machine learning-based analysis and prediction of factors influencing mental health among children and adolescents in Jiangsu Province

基于机器学习的江苏省儿童青少年心理健康影响因素分析与预测

阅读:4

Abstract

BACKGROUND: This study investigates the current mental health status among children and adolescents in Jiangsu Province by analyzing symptoms of depression, anxiety, and stress using standardized psychological scales. Machine learning models were utilized to identify key influencing variables and predict mental health outcomes, aiming to establish a rapid psychological well-being assessment framework for this population. OBJECTIVE: A cross-sectional survey was conducted via random cluster sampling across 98 counties (cities/districts) in Jiangsu Province, enrolling 141,725 students (47,502 primary, 47,274 junior high, 11,619 vocational high school students, and 35,330 senior high ). The study focused on prevalent mental health disorders and associated risk factors. METHODS: Depression, anxiety, and stress scores served as dependent variables, with 57 socio-demographic and behavioral factors as independent variables. Five supervised machine learning models (Decision Tree, Naive Bayes, Random Forest, K-Nearest Neighbors (KNN), and XGBoost) were implemented using R software. Model performance was evaluated using accuracy, precision, recall, F1 Score and Area Under the ROC Curve (AUC). Feature importance analysis was conducted to identify key predictors. RESULTS: The study revealed significant mental health disparities: depression (14.9%), anxiety (25.5%), and stress (10.9%) prevalences showed clear gender and regional gradients. Females exhibited higher rates across all conditions (p < 0.05), and urban areas had elevated risks compared to suburban regions. Mental health deterioration escalated with educational stages (e.g., depression from 9.2% in primary to 21.2% in senior high; χ²(trend) = 2274.55, p < 0.05). The XGBoost model demonstrated optimal predictive performance (AUC: depression = 0.799, anxiety = 0.770, stress = 0.762), outperforming other models. Feature importance analysis consistently identified bullying duration, age, and drinking history as top risk factors across both Gain and SHAP methods, while SHAP values additionally emphasized modifiable lifestyle factors (e.g., breakfast frequency) and demographic variables (e.g., gender). CONCLUSIONS: This study identifies bullying, age, and alcohol consumption history as key mental health risk factors among Jiangsu's children and adolescents. These findings emphasize the need for school-based anti-bullying programs, age-specific mental health counseling, and healthy lifestyle education (including alcohol refusal). Lifestyle behaviors like daily breakfast intake should be integrated into dietary interventions for mental health promotion. Urban-rural and gender disparities necessitate targeted support for urban adolescent females, while educational stage differences highlight the criticality of early prevention.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。