isModelFree
P132456
predicate
Indicates that the behavior, decision, or control process does not rely on an internal model of the environment’s dynamics, but instead uses direct value estimates or cached experiences.
Sample triples (1)
| Subject | Object |
|---|---|
| Q-learning | true ⓘ |