Triple
T18269630
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | HDX |
E437572
|
entity |
| Predicate | usesSoftware |
P10387
|
FINISHED |
| Object | CKAN |
—
|
NE NERFINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: CKAN | Statement: [HDX, usesSoftware, CKAN]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: CKAN Context triple: [HDX, usesSoftware, CKAN]
-
A.
CKAN
chosen
CKAN is an open-source data management system widely used by governments and organizations to publish, catalog, and share open data.
-
B.
Open Data Index
Open Data Index is a global initiative that evaluates and ranks the openness and accessibility of government data across countries.
-
C.
ONS Open Geography portal
The ONS Open Geography portal is an online platform providing access to official UK geographic data, boundaries, and related statistical geospatial resources published by the Office for National Statistics.
-
D.
Data.gov portal
The Data.gov portal is the U.S. government’s central online repository for accessing and downloading open data sets from federal agencies.
-
E.
Dataverse
Dataverse is Microsoft's cloud-based data platform that securely stores, manages, and structures business data for use across Power Platform applications and services.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (2 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d8b913351c8190932b6a426de04b41 |
completed | April 10, 2026, 8:47 a.m. |
| NER | Named-entity recognition | batch_69e4ff7d4f88819084123ed6c9e7e5b8 |
completed | April 19, 2026, 4:14 p.m. |
Created at: April 10, 2026, 10:34 a.m.