Duration between rewards controls the rate of behavioral and dopaminergic learning

奖励间隔时间控制着行为学习和多巴胺学习的速度。

阅读:1

Abstract

Learning the causes of rewards is crucial for survival. Cue-reward associative learning is controlled in the brain by mesolimbic dopamine. It is widely believed that dopamine drives learning by conveying a reward prediction error. Dopamine-based learning algorithms are generally 'trial-based': learning progresses sequentially across individual cue-outcome experiences. A foundational assumption of these models is that the more cue-reward pairings one experiences over a fixed duration, the more one learns this association. By identifying a new biological principle governing learning, we disprove this assumption. Specifically, across many conditions in mice, we show that behavioral and dopaminergic learning rates are proportional to the duration between rewards (or punishments). Due to this rule, the overall learning over a fixed duration is independent of the number of cue-outcome experiences. A dopamine-based model of retrospective learning explains these findings, thereby providing a unified account of the biological mechanisms of learning.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。