Options
On the Asymptotic Equivalence Between Differential Hebbian and Temporal Difference Learning
ISSN
0899-7667
Date Issued
2008
Author(s)
DOI
10.1162/neco.2008.04-08-750
Abstract
In this theoretical contribution, we provide mathematical proof that two of the most important classes of network learning—correlation-based differential Hebbian learning and reward-based temporal difference learning—are asymptotically equivalent when timing the learning with a modulatory signal. This opens the opportunity to consistently reformulate most of the abstract reinforcement learning framework from a correlation-based perspective more closely related to the biophysics of neurons.