Triple

T7518756
Position Surface form Disambiguated ID Type / Status
Subject Sawu language E177711 entity
Predicate closelyRelatedTo P37 FINISHED
Object Dhao language E231083 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Dhao language | Statement: [Sawu language, closelyRelatedTo, Dhao language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Dhao language
Context triple: [Sawu language, closelyRelatedTo, Dhao language]
  • A. Dhanka language
    Dhanka is an Indo-Aryan tribal language variety considered a dialect within the Bhili language group of western India.
  • B. Thadou language
    Thadou language is a Kuki-Chin language spoken primarily by the Thadou people in northeastern India and neighboring regions.
  • C. Dawan language chosen
    The Dawan language is an Austronesian language spoken primarily in West Timor, Indonesia, by the Atoni people.
  • D. Dawro language
    The Dawro language is an Omotic language of southwestern Ethiopia spoken by the Dawro people and closely related to Wolaytta.
  • E. Lhao Vo language
    Lhao Vo is a lesser-known Tibeto-Burman language spoken by an ethnic minority community in parts of Northeast India and nearby regions.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c69f2891148190a484f3b8222c6f1b completed March 27, 2026, 3:15 p.m.
NER Named-entity recognition batch_69c6f5f850c081909e697219071293fc completed March 27, 2026, 9:26 p.m.
NED1 Entity disambiguation (via context triple) batch_69c8462610b481909fa74023852b0154 completed March 28, 2026, 9:20 p.m.
Created at: March 27, 2026, 3:46 p.m.