Harnessing multimodal approaches for depression detection using large language models and facial expressions

利用大型语言模型和面部表情的多模态方法进行抑郁症检测

阅读：1

作者：Sadeghi,Misha,Richer,Robert,Egger,Bernhard,Schindler-Gmelch,Lena,Rupp,Lydia Helene,Rahimi,Farnaz,Berking,Matthias,Eskofier,Bjoern M

期刊：		影响因子：
时间：	2024	起止号：	2024 Dec 23;3(1):66
doi：	10.1038/s44184-024-00112-8	研究方向：	神经科学
疾病类型：	抑郁症

Abstract

Detecting depression is a critical component of mental health diagnosis, and accurate assessment is essential for effective treatment. This study introduces a novel, fully automated approach to predicting depression severity using the E-DAIC dataset. We employ Large Language Models (LLMs) to extract depression-related indicators from interview transcripts, utilizing the Patient Health Questionnaire-8 (PHQ-8) score to train the prediction model. Additionally, facial data extracted from video frames is integrated with textual data to create a multimodal model for depression severity prediction. We evaluate three approaches: text-based features, facial features, and a combination of both. Our findings show the best results are achieved by enhancing text data with speech quality assessment, with a mean absolute error of 2.85 and root mean square error of 4.02. This study underscores the potential of automated depression detection, showing text-only models as robust and effective while paving the way for multimodal analysis.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。