On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient thumbnail
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient

Published on Mar 25, 20113578 Views

Likelihood ratio policy gradient methods have been some of the most successful reinforcement learning algorithms, especially for learning on physical systems. We describe how the likelihood ratio poli

Related categories