RTPhy-ChatBot: A RAG-Based intelligent assistant for radiotherapy physics using LLaMA3 and AAPM reports

RTPhy-ChatBot:基于 RAG 的放射治疗物理学智能助手,使用 LLaMA3 和 AAPM 报告

阅读:1

Abstract

BACKGROUND: Medical physics plays a crucial role in radiotherapy, with ongoing technological advancements aimed at improving treatment outcomes. However, the rapid pace of innovation presents challenges for medical physicists, who must continuously acquire and integrate complex information for effective decision-making and communication. PURPOSE: To support efficient knowledge acquisition, we developed RTPhy-ChatBot, an intelligent assistant tailored to radiotherapy physics. The objective was to create a reliable and precise tool to assist medical physicists in their daily work. METHODS: The knowledge base for RTPhy-ChatBot was constructed from publications by the American Association of Physicists in Medicine (AAPM), which were converted into markdown format, segmented, and embedded using the bge-base-en-v1.5 model. RTPhy-ChatBot employed the Meta-LLaMA3-8B-Instruct model for response generation. We compared its performance with several commercial large language models (LLMs) across 20 template questions and evaluated the impact of zero-shot chain-of-thought (CoT) reasoning. In addition to expert scoring by senior medical physicists, we conducted Rouge score analysis against synthesized reference answers. RESULTS: RTPhy-ChatBot demonstrated strong performance in answering radiotherapy physics questions. Across 20 questions, it achieved an average score of 4.0 ± 0.9, compared to 3.9 ± 1.1 for Gemini-2.0-Flash, 4.0 ± 1.4 for GPT-4o, and 3.8 ± 1.2 for Moonshot-v1. It excelled in questions involving specific quality assurance standards. Rouge analysis yielded scores of 0.5127 (Rouge-1), 0.2119 (Rouge-2), and 0.2748 (Rouge-L), closely matching commercial LLMs. CONCLUSIONS: RTPhy-ChatBot proved to be an effective intelligent assistant for radiotherapy physics, delivering accurate, referenced responses grounded in AAPM publications. Despite lacking online access, it matched or exceeded the performance of commercial LLMs in domain-specific tasks. This pilot study highlights the potential of domain-specific assistants in supporting clinical workflows.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。