Triple
T9577433
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Galoli language |
E231079
|
entity |
| Predicate | primaryRegion |
P1103
|
FINISHED |
| Object | Dili District |
E672566
|
NE FINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Dili District | Statement: [Galoli language, primaryRegion, Dili District]
Disambiguation candidates (1 decision)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Dili District Context triple: [Galoli language, primaryRegion, Dili District]
-
A.
Dili district
chosen
Dili district was the former administrative designation for the area now organized as Dili municipality, which includes East Timor’s capital city.
-
B.
Butaritari District
Butaritari District is an administrative division in Kiribati encompassing the island and surrounding area of Butaritari in the Gilbert Islands.
-
C.
Tembuku District
Tembuku District is an administrative district in the regency of Bangli on the island of Bali, Indonesia.
-
D.
Achin District
Achin District is an administrative district in eastern Afghanistan known for its mountainous terrain and history of militant activity.
-
E.
Awaran District
Awaran District is a sparsely populated and underdeveloped administrative district in southwestern Balochistan, Pakistan, known for its rugged terrain and recurring seismic activity.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69ca848091c48190bc313d6620d09555 |
elicitation | completed |
| NER | batch_69cd99ad7d108190a0b8c975351ea727 |
ner | completed |
| NED1 | batch_69d17907de488190be97e58b05b6c6f2 |
ned_source_triple | completed |
Created at: March 30, 2026, 8:05 p.m.