BEnchmarking Large Language Models for Ophthalmology (BELO): An Expert-Curated Data Set and Evaluation Framework for Knowledge and Reasoning
眼科大型语言模型基准测试(BELO):专家整理的知识与推理数据集和评估框架
期刊:Ophthalmology Science
影响因子:4.6
doi:10.1016/j.xops.2025.101050
Srinivasan, Sahana; Ai, Xuguang; Lo, Thaddaeus Wai Soon; Gilson, Aidan; Zou, Minjie; Zou, Ke; Kim, Hyunjae; Yang, Mingjia; Pushpanathan, Krithi; Yew, Samantha Min Er; Loke, Wan Ting; Goh, Jocelyn Hui Lin; Chen, Yibing; Kong, Yiming; Fu, Emily Yuelei; Ong, Michelle; Nwanyanwu, Kristen; Dave, Amisha; Li, Kelvin Zhenghao; Sun, Chen-Hsin; Chia, Mark; Yang, Gabriel Dawei; Wong, Wendy Meihua; Chen, David Ziyou; Liu, Dianbo; Singer, Maxwell; Antaki, Fares; Del Priore, Lucian V; Jonas, Jost B; Adelman, Ron; Chen, Qingyu; Tham, Yih-Chung