Olympiad-level formal mathematical reasoning with reinforcement learning
奥林匹克级别的强化学习形式数学推理
期刊:Nature
影响因子:48.5
doi:10.1038/s41586-025-09833-y
Hubert, Thomas; Mehta, Rishi; Sartran, Laurent; Horváth, Miklós Z; Žužić, Goran; Wieser, Eric; Huang, Aja; Schrittwieser, Julian; Schroecker, Yannick; Masoom, Hussain; Bertolli, Ottavia; Zahavy, Tom; Mandhane, Amol; Yung, Jessica; Beloshapka, Iuliya; Ibarz, Borja; Veeriah, Vivek; Yu, Lei; Nash, Oliver; Lezeau, Paul; Mercuri, Salvatore; Sönne, Calle; Mehta, Bhavik; Davies, Alex; Zheng, Daniel; Pedregosa, Fabian; Li, Yin; von Glehn, Ingrid; Rowland, Mark; Albanie, Samuel; Velingker, Ameya; Schmitt, Simon; Lockhart, Edward; Hughes, Edward; Michalewski, Henryk; Sonnerat, Nicolas; Hassabis, Demis; Kohli, Pushmeet; Silver, David