[ONOS Seminar Series] Professor. Vijay Mohan K Namboodiri : Dopamine and causal learning
Date
Description
A hallmark of intelligence is the ability to learn associations between causes and effects (e.g., environmental cues and associated rewards). The near consensus understanding of the last few decades is that animals learn cause-effect associations from errors in the prediction of the effect (e.g., a reward prediction error or RPE). This theory has been hugely influential in neuroscience as decades of evidence suggested that mesolimbic dopamine (DA)— known to be critical for associative learning—appears to signal RPE. Though some evidence questioned whether DA signals RPE, the RPE hypothesis remained the best explanation of learning because no other normative theory of learning explained experimental observations inconsistent with RPE while also capturing phenomena explained by RPE.
However, my lab recently proposed a new theory of associative learning (named ANCCR, read “anchor”) which postulates that animals learn associations by retrospectively identifying causes of meaningful effects such as rewards and that mesolimbic dopamine conveys that a current event is meaningful (Jeong et al., Science, 2022, Burke et al., BioRxiv, 2023, Garr et al., BioRxiv, 2023). The core idea is simple: you can learn to predict the future by retrodicting the past, and you retrodict the past only after meaningful events. Here, I will present the basic formulation of this theory, experimental data focused on distinguishing predictions of ANCCR and RPE, and unpublished experimental results demonstrating that behavioral and dopaminergic learning rates from cue-reward experiences are quantitatively scaled by reward sparsity, providing an effective solution to “few-shot learning” from sparse experiences.
Subscribe to the OIST Calendar: Right-click to download, then open in your calendar application.