Predictive Representations for Policy Gradient in POMDPs thumbnail
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Predictive Representations for Policy Gradient in POMDPs

Published on Aug 26, 20093743 Views

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive State Representations

Related categories