Generalized Advantage Estimation
E163182
UNEXPLORED
Generalized Advantage Estimation is a reinforcement learning technique that reduces variance and improves sample efficiency in policy gradient methods by cleverly estimating the advantage function over multiple time scales.
Referenced by (2)
| Subject (surface form when different) | Predicate |
|---|---|
|
John Schulman
("“High-Dimensional Continuous Control Using Generalized Advantage Estimation”")
→
|
authorOf |
|
John Schulman
→
|
notableWork |