Clinical decision support of advanced large language models in endodontic disease

根管疾病中高级大型语言模型的临床决策支持

阅读:2

Abstract

BACKGROUND/PURPOSE: Large language models (LLMs) exhibit significant potential for clinical decision support, yet their application in endodontic disease remains underexplored. MATERIALS AND METHODS: This study assessed the decision-making capabilities of three advanced LLMs (GPT-4o, Claude 3.5, and Grok2) in specialized endodontic contexts. A question bank of 421 multiple-choice questions was constructed across 27 core endodontic topics, including theory, procedures, and 35 complex cases. The three LLMs were tested using standardized prompts, with performance evaluated via topic-stratified accuracy analysis. RESULTS: Claude 3.5 achieved the highest overall accuracy (73.39 %), followed by Grok2 (66.27 %) and GPT-4o (46.32 %). Grok2 excelled in complex case analysis (69.57 %). The models performed strongly in theoretical domains (e.g., clinical examination, structural function, pharmacology) but showed limitations in complex scenarios and procedural techniques. CONCLUSION: LLMs hold promise as endodontic decision support tools, though domain-specific refinement is essential for effective clinical application.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。