Asynchronous Advantage Actor-Critic
E428319
UNEXPLORED
Asynchronous Advantage Actor-Critic is a deep reinforcement learning algorithm that trains multiple parallel agents to learn both policy and value functions efficiently and stably.