Triple

T9577433
Position Surface form Disambiguated ID Type / Status
Subject Galoli language E231079 entity
Predicate primaryRegion P1103 FINISHED
Object Dili District E672566 NE FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Dili District | Statement: [Galoli language, primaryRegion, Dili District]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Dili District
Context triple: [Galoli language, primaryRegion, Dili District]
  • A. Dili district chosen
    Dili district was the former administrative designation for the area now organized as Dili municipality, which includes East Timor’s capital city.
  • B. Butaritari District
    Butaritari District is an administrative division in Kiribati encompassing the island and surrounding area of Butaritari in the Gilbert Islands.
  • C. Tembuku District
    Tembuku District is an administrative district in the regency of Bangli on the island of Bali, Indonesia.
  • D. Achin District
    Achin District is an administrative district in eastern Afghanistan known for its mountainous terrain and history of militant activity.
  • E. Awaran District
    Awaran District is a sparsely populated and underdeveloped administrative district in southwestern Balochistan, Pakistan, known for its rugged terrain and recurring seismic activity.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

Stage Batch ID Job type Status
creating batch_69ca848091c48190bc313d6620d09555 elicitation completed
NER batch_69cd99ad7d108190a0b8c975351ea727 ner completed
NED1 batch_69d17907de488190be97e58b05b6c6f2 ned_source_triple completed
Created at: March 30, 2026, 8:05 p.m.