SAC

E426679 UNEXPLORED

SAC (Soft Actor-Critic) is a popular off-policy deep reinforcement learning algorithm that optimizes both expected return and policy entropy to achieve stable and efficient learning in continuous control tasks.

Jump to: Referenced by

Referenced by (2)

Full triples — surface form annotated when it differs from this entity's canonical label.