Fine-grained evaluation of a domain-specific Q&A dataset to support trustworthy medical language models
对特定领域问答数据集进行细粒度评估,以支持可信赖的医学语言模型
期刊:Health Information Science and Systems
影响因子:3.4
doi:10.1007/s13755-026-00458-7
Fonseca, Rafael da C; Rios, Ricardo A; Castaldoni, Rodrigo; Carvalho, Adrielle A; Lopes, Tiago J S; Andrade, Caio L B; Bispo, Braian V G; Mota, Laís R; Rios, Tatiane N