Citation

Li C, Brenner J, Boesky A, Ramanathan S, Kreiman G. 2024. Neuron-level Prediction and Noise can Implement Flexible Reward-Seeking Behavior. bioRxiv : the preprint server for biology. Pubmed: 38826332 DOI:10.1101/2024.05.22.595306

Abstract

We show that neural networks can implement reward-seeking behavior using only local predictive updates and internal noise. These networks are capable of autonomous interaction with an environment and can switch between explore and exploit behavior, which we show is governed by attractor dynamics. Networks can adapt to changes in their architectures, environments, or motor interfaces without any external control signals. When networks have a choice between different tasks, they can form preferences that depend on patterns of noise and initialization, and we show that these preferences can be biased by network architectures or by changing learning rates. Our algorithm presents a flexible, biologically plausible way of interacting with environments without requiring an explicit environmental reward function, allowing for behavior that is both highly adaptable and autonomous. Code is available at https://github.com/ccli3896/PaN.

Related Faculty

Photo of Sharad Ramanathan

Sharad Ramanathan investigates how multi-potent stem cells make fate decisions to give rise to complex human tissues, and how the dynamics of key neurons in the nervous system drive behavioral decisions.

Search Menu