rewardSignal
P31983
predicate
Indicates that one entity provides a signal representing feedback or incentive (such as a reward or penalty) to guide another entity’s behavior or learning process.
Sample triples (2)
| Subject | Object |
|---|---|
| AlphaZero | game result win‑draw‑loss → |
| Atari deep Q-network | game score changes → |