Triple
T17422517
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Jowzjan Province |
E423650
|
entity |
| Predicate | hasDistrict |
P459
|
FINISHED |
| Object | Sheberghan District |
—
|
NE NERFINISHED |
How this triple was built (3 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Sheberghan District | Statement: [Jowzjan Province, hasDistrict, Sheberghan District]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Sheberghan District Context triple: [Jowzjan Province, hasDistrict, Sheberghan District]
-
A.
Khogyani District
Khogyani District is an administrative district in eastern Afghanistan known for its largely rural Pashtun population and its location within Nangarhar Province near the Pakistan border.
-
B.
Nushki District
Nushki District is an administrative district in the Balochistan province of Pakistan, known for its arid landscape and strategic location near the Afghan border.
-
C.
Mirzaka District
Mirzaka District is an administrative district located within Paktia Province in eastern Afghanistan.
-
D.
Ghorak District
Ghorak District is a rural administrative district in southern Afghanistan, located within Kandahar Province and known for its mountainous terrain and security challenges.
-
E.
Rusafa District
Rusafa District is a central administrative area of Baghdad, Iraq, known for encompassing key cultural and historical landmarks, including major monuments and public institutions.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Sheberghan District Target entity description: Sheberghan District is an administrative district in northern Afghanistan centered around the city of Sheberghan, a regional hub for trade and agriculture.
-
A.
Khogyani District
Khogyani District is an administrative district in eastern Afghanistan known for its largely rural Pashtun population and its location within Nangarhar Province near the Pakistan border.
-
B.
Nushki District
Nushki District is an administrative district in the Balochistan province of Pakistan, known for its arid landscape and strategic location near the Afghan border.
-
C.
Mirzaka District
Mirzaka District is an administrative district located within Paktia Province in eastern Afghanistan.
-
D.
Ghorak District
Ghorak District is a rural administrative district in southern Afghanistan, located within Kandahar Province and known for its mountainous terrain and security challenges.
-
E.
Rusafa District
Rusafa District is a central administrative area of Baghdad, Iraq, known for encompassing key cultural and historical landmarks, including major monuments and public institutions.
- F. None of above. chosen
Provenance (2 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d889d88b6081908bada047f5b3ba51 |
completed | April 10, 2026, 5:25 a.m. |
| NER | Named-entity recognition | batch_69e44237f2cc819083ca0e7e00d828fb |
completed | April 19, 2026, 2:47 a.m. |
Created at: April 10, 2026, 5:46 a.m.