The Performance of Wearable Device-Based Artificial Intelligence in Detecting Depression: Systematic Review and Meta-Analysis

基于可穿戴设备的人工智能在抑郁症检测中的性能:系统评价和荟萃分析

阅读:1

Abstract

BACKGROUND: In recent years, advances in wearable sensor technology and artificial intelligence (AI) have provided new possibilities for detecting and monitoring depression. OBJECTIVE: This study systematically reviewed and meta-analyzed the diagnostic and predictive performance of wearable device-based AI models for detecting depression and predicting depressive episodes and explored factors influencing outcomes. METHODS: Following PRISMA-DTA (Preferred Reporting Items for a Systematic Review and Meta-Analysis of Diagnostic Test Accuracy) guidelines, the PubMed, Embase, Web of Science, and PsycINFO databases were searched from inception to May 27, 2025. Eligible studies used AI algorithms on wearable device data for depression detection or episode prediction. Sensitivity, specificity, diagnostic odds ratio, and area under the curve (AUC) were pooled using a bivariate random effects model. Risk of bias was assessed using Prediction Model Risk of Bias Assessment Tool plus artificial intelligence (PROBAST+ AI), and certainty of evidence was assessed using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) tool. RESULTS: We included 16 studies (32 datasets) with 1189 patients and 13,593 samples. For depression detection, pooled sensitivity and specificity were 0.89 (95% CI 0.83-0.93) and 0.93 (95% CI 0.87-0.96), with a diagnostic odds ratio of 110.47 (95% CI 33.33-366.17) and AUC of 0.96 (95% CI 0.94-0.98). Random forest models showed the best performance (sensitivity=0.89, specificity=0.91, AUC=0.97). Subgroup analyses indicated that study design, AI method, reference standard, and input type significantly affected diagnostic accuracy (P<.05). For depressive episode prediction (3 datasets), pooled sensitivity was 0.86 (95% CI 0.80-0.91), and pooled specificity was 0.65 (95% CI 0.59-0.71). The overall risk of bias was low to moderate, with no evidence of publication bias. CONCLUSIONS: Wearable device-based AI models achieved high accuracy for detecting depression and moderate utility in predicting episodes. However, heterogeneity, reliance on retrospective and public datasets, and lack of standardized methods limited generalizability.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。