Evaluating clinical AI summaries with large language models as judges
使用大型语言模型作为评判者来评估临床人工智能摘要
期刊:npj Digital Medicine
影响因子:15.1
doi:10.1038/s41746-025-02005-2
Croxford, Emma; Gao, Yanjun; First, Elliot; Pellegrino, Nicholas; Schnier, Miranda; Caskey, John; Oguss, Madeline; Wills, Graham; Chen, Guanhua; Dligach, Dmitriy; Churpek, Matthew M; Mayampurath, Anoop; Liao, Frank; Goswami, Cherodeep; Wong, Karen K; Patterson, Brian W; Afshar, Majid