Comparative evaluation of ChatGPT-5.0, DeepSeek-R1, and Gemini-2.5 pro in real-world outpatient prescription counseling: A multidimensional analysis

ChatGPT-5.0、DeepSeek-R1 和 Gemini-2.5 pro 在真实世界门诊处方咨询中的比较评价:一项多维度分析

阅读:1

Abstract

OBJECTIVE: To compare the performance of ChatGPT-5.0, DeepSeek-R1, and Gemini-2.5 Pro in real-world outpatient prescription counseling and evaluate their applicability across clinical contexts. METHODS: Fifty authentic prescriptions from four departments were submitted to the three models using standardized Chinese prompts. Responses were independently rated by three associate chief pharmacists across five dimensions-accuracy, relevance, clarity, practicality, and completeness-on a 5-point Likert scale. Rank-based non-parametric tests were applied for overall and subgroup analyses. RESULTS: Significant inter-model differences were observed in most dimensions (P < 0.05). DeepSeek excelled in clarity and practicality, ChatGPT achieved the highest accuracy and completeness, while Gemini consistently scored lower. Department-specific analyses revealed distinct contextual advantages. All models exhibited high response stability. CONCLUSIONS: LLMs demonstrate promising yet heterogeneous performance in outpatient medication counseling. DeepSeek and ChatGPT showed superior overall quality, supporting their potential as assistive "AI pharmacists" under professional supervision. However, several limitations should be acknowledged, including a modest sample size, reliance on expert evaluation rather than patient feedback, and context-specific findings that may limit generalizability.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。