Leveraging open-source large language models for clinical information extraction in resource-constrained settings

在资源受限的环境下利用开源大型语言模型进行临床信息提取

阅读：1

作者：Builtjes,Luc,Bosma,Joeran,Prokop,Mathias,van Ginneken,Bram,Hering,Alessa

期刊：	JAMIA Open	影响因子：	3.400
时间：	2025	起止号：	2025 Oct;8(5):ooaf109
doi：	10.1093/jamiaopen/ooaf109

Abstract

OBJECTIVE: We aimed to evaluate the zero-shot performance of open-source generative large language models (LLMs) on clinical information extraction from Dutch medical reports using the Diagnostic Report Analysis: General Optimization of NLP (DRAGON) benchmark. METHODS: We developed and released the llm_extractinator framework, a scalable, open-source tool for automating information extraction from clinical texts using LLMs. We evaluated 9 multilingual open-source LLMs across 28 tasks in the DRAGON benchmark, covering classification, regression, and named entity recognition (NER). All tasks were performed in a zero-shot setting. Model performance was quantified using task-specific metrics and aggregated into a DRAGON utility score. Additionally, we investigated the effect of in-context translation to English. RESULTS: Llama-3.3-70B achieved the highest utility score (0.760), followed by Phi-4-14B (0.751), Qwen-2.5-14B (0.748), and DeepSeek-R1-14B (0.744). These models outperformed or matched a fine-tuned RoBERTa baseline on 17 of 28 tasks, particularly in regression and structured classification. NER performance was consistently low across all models. Translation to English consistently reduced performance. DISCUSSION: Generative LLMs demonstrated strong zero-shot capabilities on clinical natural language processing tasks involving structured inference. Models around 14B parameters performed well overall, with Llama-3.3-70B leading but at high computational cost. Generative models excelled in regression tasks, but were hindered by token-level output formats for NER. Translation to English reduced performance, emphasizing the need for native language support. CONCLUSION: Open-source generative LLMs provide a viable zero-shot alternative for clinical information extraction from Dutch medical texts, particularly in low-resource and multilingual settings.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。

肿瘤免疫

炎症

T细胞

线粒体

凋亡

转录调控

巨噬细胞

传染病

自噬

氧化应激

磷酸化

血管生成

肠道菌群

囊泡

中性粒细胞

3D/类器官

单细胞

药物研究

外泌体

DNA甲基化

细胞衰老

铁死亡

缺氧低氧

miRNA

乙酰化

组蛋白修饰

泛素化

炎性小体

代谢重编程

树突状细胞

焦亡

肿瘤微环境

m6A/m5C/m7G

lncRNA

空间多组学

细胞基因治疗

内质网应激

相分离

治疗耐药

免疫代谢

Treg

上皮间质转化

染色质重塑

脂质过氧化

蛋白质稳态

铁代谢

cGAS-STING

碱基编辑

脂代谢

乳酸化

细胞极性

蛋白降解

低氧缺氧

circRNA

肠脑轴

氨基酸代谢

piRNA

翻译调控

NK 细胞

肿瘤异质性

MDSC

NETosis

RNA 编辑

氧化脂质

溶酶体功能

细胞干性

琥珀酰化

CAR-NK

冷应激

器官芯片

Tfh

巴豆酰化

表观遗传记忆

线粒体未折叠蛋白反应

铜死亡

器官纤维化

空间代谢组

自噬流

程序性坏死

MAIT 细胞

肠肝轴

丙酰化