Initial Insights Into an Institutional Secure Large Language Model for Magnetic Resonance Imaging Examination Requests: Retrospective Study

对磁共振成像检查申请的机构安全大型语言模型的初步认识：回顾性研究

阅读：1

作者：Hallinan,James Thomas Patrick Decourcy,Leow,Naomi Wenxin,Low,Yi Xian,Lee,Aric,Ong,Wilson,Chan,Matthew Ding Zhou,Devi,Ganakirthana Kalpenya,He,Stephanie Shengjie,Loh,Daniel De-Liang,Lim,Desmond Shi Wei,Low,Xi Zhen,Lim,Mei Chin,Yong,Clement,Sng,Weizhong Jonathan,Teo,Ee Chin,Tan,Jiong Hao,Kumar,Naresh,Makmur,Andrew,Ting,Yonghan

期刊：	Journal of Medical Internet Research	影响因子：	6.000
时间：	2026	起止号：	2026 Apr 7;28:e82579
doi：	10.2196/82579

Abstract

BACKGROUND: Incomplete clinical details on magnetic resonance imaging (MRI) examination requests (MERs) can lead to suboptimal protocol selection. An institutional secure large language model (sLLM) with access to manually retrieved salient data from the electronic medical record (EMR) may improve request completeness and protocol accuracy across multiple MRI subspecialties. OBJECTIVE: The objective of this study was to compare clinician MERs with sLLM-augmented MERs for information quality and to evaluate the protocoling accuracy of the sLLM versus board-certified radiologists across body, musculoskeletal, and neuroradiology MRI. METHODS: This retrospective study included 608 random outpatient MRI examinations performed between September 2023 and July 2024 (body 206, musculoskeletal 203, neuroradiology 199). The cohort comprised 528 patients (mean 51.2 years, SD 19.2; range 4-93; n=279, 52.8% women, n=249, 47.2% men). MERs without EMR access were excluded. A privately hosted Anthropic Claude 3.5 model (temperature 0) augmented each MER with manually retrieved salient EMR data and, via rule-based parsing, mapped the extracted elements onto predefined institutional criteria to recommend region or coverage and contrast use. Two experienced radiologists established a consensus reference standard. Two board-certified general radiologists (Rad 3 and Rad 4) and the sLLM were compared with this standard. Clinical information quality was graded using the Reason-for-Exam Imaging Reporting and Data System (RI-RADS). Interrater reliability was quantified with Gwet AC1. Paired accuracies were compared with the McNemar test to determine whether there was a statistically significant difference. RESULTS: Interreader agreement for RI-RADS was almost perfect for sLLM-augmented MERs (AC1 0.97, 95% CI 0.94-0.99) and moderate for clinician MERs (AC1 0.43, 95% CI 0.34-0.52). Limited or deficient clinical information (RI-RADS C/D) fell to 0% to 0.7% (0/608 to 4/608) with sLLM augmentation vs 4.1% to 20.4% (25/608 to 124/608) for clinician MERs. Overall protocol accuracy was 93.1% (566/608; 95% CI 89.6-96.6) for the sLLM, 91.4% (556/608; 95% CI 87.6-95.3) for Rad 3, and 92.1% (560/608; 95% CI 88.4-95.8) for Rad 4 (sLLM vs Rad 3 P=.23 vs Rad 4 P=.40). Region or coverage accuracy was similar (sLLM: 579/608, 95.2%; Rad 3: 585/608, 96.2%; Rad 4: 573/608, 94.2%; P=.46 and P=.36). Contrast decisions were more accurate using the sLLM at 94.4% (574/608; 95% CI 91.3-97.5) vs Rad 3 at 92.1% (560/608; 95% CI 88.4-95.8; P=.027) and were not significantly different to Rad 4 at 92.9% (565/608; 95% CI 89.4-96.4; P=.16). Subspecialty analyses showed similar patterns, with the sLLM outperforming Rad 4 for musculoskeletal MRI contrast decisions (96.6% vs 91.1%; P=.006) and matching readers elsewhere. Manual review indicated that sLLM improvements arose from EMR details not listed on the MER (infection/inflammation, tumor history, prior surgery). No clinically significant hallucinations were identified in a manual review of discordant cases. CONCLUSIONS: Across body, musculoskeletal, and neuroradiology MRI, sLLM-augmented examination requests improved clinical context and enhanced contrast selection while demonstrating accuracy comparable to general radiologists for region or coverage. Integrating sLLMs into routine vetting workflows may reduce manual workload in protocol selection for more efficient, standardized protocoling.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用；引用内容仅为补充信息，不代表本站立场。

2、若认为本页面引用内容涉及侵权，请及时与本站联系，我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容，需注明“来源：[生知库]”并获得授权；使用引用内容的，需自行联系原作者获得许可。

4、投稿及合作请联系：info@biocloudy.com。

肿瘤免疫

炎症

T细胞

转录调控

凋亡

线粒体

巨噬细胞

传染病

自噬

氧化应激

磷酸化

血管生成

肠道菌群

囊泡

中性粒细胞

单细胞

药物研究

外泌体

细胞衰老

3D/类器官

缺氧低氧

DNA甲基化

铁死亡

乙酰化

miRNA

组蛋白修饰

泛素化

炎性小体

代谢重编程

焦亡

树突状细胞

m6A/m5C/m7G

空间多组学

肿瘤微环境

细胞基因治疗

lncRNA

内质网应激

治疗耐药

Treg

相分离

免疫代谢

上皮间质转化

染色质重塑

脂质过氧化

蛋白质稳态

cGAS-STING

铁代谢

低氧缺氧

乳酸化

碱基编辑

脂代谢

蛋白降解

NK 细胞

circRNA

肠脑轴

MDSC

肿瘤异质性

氨基酸代谢

piRNA

细胞极性

NETosis

翻译调控

氧化脂质

RNA 编辑

溶酶体功能

细胞干性

琥珀酰化

CAR-NK

冷应激

Tfh

巴豆酰化

器官芯片

铜死亡

器官纤维化

表观遗传记忆

线粒体未折叠蛋白反应

空间代谢组

自噬流

MAIT 细胞

程序性坏死

丙酰化

肠肝轴