SAC
E426679
UNEXPLORED
SAC (Soft Actor-Critic) is a popular off-policy deep reinforcement learning algorithm that optimizes both expected return and policy entropy to achieve stable and efficient learning in continuous control tasks.
Jump to:
Referenced by
Referenced by (2)
Full triples — surface form annotated when it differs from this entity's canonical label.