Choice behaviour of animals is characterized by two main tendencies: taking actions that led to rewards and repeating past actions(1,2). Theory suggests that these strategies may be reinforced by different types of dopaminergic teaching signals: reward prediction error to reinforce value-based associations and movement-based action prediction errors to reinforce value-free repetitive associations(3-6). Here we use an auditory discrimination task in mice to show that movement-related dopamine activity in the tail of the striatum encodes the hypothesized action prediction error signal. Causal manipulations reveal that this prediction error serves as a value-free teaching signal that supports learning by reinforcing repeated associations. Computational modelling and experiments demonstrate that action prediction errors alone cannot support reward-guided learning, but when paired with the reward prediction error circuitry they serve to consolidate stable sound-action associations in a value-free manner. Together we show that there are two types of dopaminergic prediction errors that work in tandem to support learning, each reinforcing different types of association in different striatal areas.
Dopaminergic action prediction errors serve as a value-free teaching signal.
多巴胺能作用预测误差可作为无价值的教学信号
阅读:8
作者:Greenstreet Francesca, Vergara Hernando Martinez, Johansson Yvonne, Pati Sthitapranjya, Schwarz Laura, Lenzi Stephen C, Geerts Jesse P, Wisdom Matthew, Gubanova Alina, Rollik Lars B, Kaur Jasvin, Moskovitz Theodore, Cohen Joseph, Thompson Emmett, Margrie Troy W, Clopath Claudia, Stephenson-Jones Marcus
| 期刊: | Nature | 影响因子: | 48.500 |
| 时间: | 2025 | 起止号: | 2025 Jul;643(8074):1333-1342 |
| doi: | 10.1038/s41586-025-09008-9 | 研究方向: | 信号转导 |
特别声明
1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。
2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。
3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。
4、投稿及合作请联系:info@biocloudy.com。
