Triple

T824090
Position Surface form Disambiguated ID Type / Status
Subject OpenAI Baselines E17813 entity
Predicate implementsAlgorithm P8649 FINISHED
Object PPO2
PPO2 is an improved variant of the Proximal Policy Optimization reinforcement learning algorithm, designed for stable and efficient policy gradient training in continuous and discrete control tasks.
E98479 NE FINISHED

Provenance (5 batches)

Stage Batch ID Job type Status
creating batch_69a4937c9c188190aaa216f6b466f452 elicitation completed
NER batch_69a4b2b503d48190bd4f33548a22d5fe ner completed
NED1 batch_69a76d93af548190818c14a370e0914a ned_source_triple completed
NED2 batch_69a7860e656c8190a08a9999662ba1f1 ned_description completed
NEDg batch_69a781f5536c81908175d58b6b75adba nedg completed
Created at: March 1, 2026, 7:38 p.m.