isModelFree

P132456
predicate

Indicates that the behavior, decision, or control process does not rely on an internal model of the environment’s dynamics, but instead uses direct value estimates or cached experiences.

Sample triples (1)

Subject Object
Q-learning true