Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation

Published on Sep 17, 20097440 Views

Richard S. Sutton

Sutton, Szepesvari and Maei (2009) recently introduced the first temporal-difference learning algorithm compatible with both linear function approximation and off-policy training, and whose complexity

Sessions

Related categories

Machine Learning

Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation

Richard S. Sutton

Sessions

Related categories

VIDEOLECTURES

LEGAL