Evaluating language models for mathematics through interactions
通过交互评估数学语言模型
期刊:Proceedings of the National Academy of Sciences of the United States of America
影响因子:9.1
doi:10.1073/pnas.2318124121
Collins, Katherine M; Jiang, Albert Q; Frieder, Simon; Wong, Lionel; Zilka, Miri; Bhatt, Umang; Lukasiewicz, Thomas; Wu, Yuhuai; Tenenbaum, Joshua B; Hart, William; Gowers, Timothy; Li, Wenda; Weller, Adrian; Jamnik, Mateja