Triple

T4586023
Position Surface form Disambiguated ID Type / Status
Subject Double DQN E101969 entity
Predicate basedOn P98 FINISHED
Object Q-learning
Q-learning is a model-free reinforcement learning algorithm that learns an action-value function to optimize decision-making by estimating the expected cumulative reward for each state-action pair.
E455376 NE FINISHED

Provenance (5 batches)

Stage Batch ID Job type Status
creating batch_69bd43d4ce208190b53158c882b222e3 elicitation completed
NER batch_69bd5906a43c81908fb11bf8f94be122 ner completed
NED1 batch_69bde0aa114881909fe446bf86c675e7 ned_source_triple completed
NED2 batch_69bde1b2efb48190a5ab83fa6c257df2 ned_description completed
NEDg batch_69bde14843148190a0b5fa0ad1d805d9 nedg completed
Created at: March 20, 2026, 1:10 p.m.