A novel evaluation benchmark for medical LLMs illuminating safety and effectiveness in clinical domains
一种用于评估医疗LLM在临床领域安全性和有效性的新型评价基准
期刊:npj Digital Medicine
影响因子:15.1
doi:10.1038/s41746-025-02277-8
Wang, Shirui; Tang, Zhihui; Yang, Huaxia; Gong, Qiuhong; Gu, Tiantian; Ma, Hongyang; Wang, Yongxin; Sun, Wubin; Lian, Zeliang; Mao, Kehang; Jiang, Yinan; Huang, Zhicheng; Ma, Lingyun; Shen, Wenjie; Ji, Yajie; Tan, Yunhui; Wang, Chunbo; Gao, Yunlu; Ye, Qianling; Lin, Rui; Chen, Mingyu; Niu, Lijuan; Wang, Zhihao; Yu, Peng; Lang, Mengran; Liu, Yue; Zhang, Huimin; Shen, Haitao; Chen, Long; Zhao, Qiguang; Liu, Si-Xuan; Zhou, Lina; Gao, Hua; Ye, Dongqiang; Meng, Lingmin; Yu, Youtao; Liang, Naixin; Wu, Jianxiong