A multimodal embedding model for sepsis data representation

一种用于脓毒症数据表示的多模态嵌入模型

阅读:1

Abstract

Sepsis research has long been constrained by limited labeled data and models designed for specific tasks that primarily rely on tabular inputs, overlooking the valuable insights contained in clinical text. To address these limitations, we propose the Sepsis Data Representation Model (SepsisDRM), an embedding model that jointly processes tabular and textual data to capture comprehensive patient representations. Trained on a dataset comprising 19,526 sepsis patients, SepsisDRM demonstrates strong generalization across diverse sepsis-related tasks without task-specific tuning. It effectively stratifies patients into four clinically interpretable phenotypes and achieves robust performance in predicting 28-day outcomes, with AUC scores of 0.92, 0.94, and 0.78 on retrospective, prospective, and external datasets, respectively. As the first embedding model developed specifically for sepsis, SepsisDRM establishes a novel paradigm for sepsis research and offers a promising approach for studies in other fields that involve the integration of both tabular and textual data.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。