Triple
T17694160
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Hindsight Policy Gradients |
E441117
|
entity |
| Predicate | instanceOf |
P0
|
FINISHED |
| Object | goal-conditioned reinforcement learning method |
C15711
|
CONCEPT FINISHED |
Provenance (1 batch)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69d8b9e940b081908b862bb0e6e89b0d |
elicitation | completed |
Created at: April 10, 2026, 10:04 a.m.