Triple

T21473564
Position Surface form Disambiguated ID Type / Status
Subject Satawan dialect E529793 entity
Predicate isVarietyOf P2074 FINISHED
Object Mortlockese language NE NERFINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Mortlockese language | Statement: [Satawan dialect, isVarietyOf, Mortlockese language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Mortlockese language
Context triple: [Satawan dialect, isVarietyOf, Mortlockese language]
  • A. Mortlockese language chosen
    Mortlockese is an Austronesian language of the Chuukic branch spoken primarily in the Mortlock Islands of Chuuk State in the Federated States of Micronesia.
  • B. Aneityum language
    The Aneityum language is an Oceanic Austronesian language spoken by the indigenous people of Aneityum Island in Vanuatu.
  • C. Saraveca language
    The Saraveca language is an extinct Arawakan language once spoken in Bolivia, known from very limited historical documentation.
  • D. Lo-Toga language
    The Lo-Toga language is an Oceanic language spoken on the Torres Islands of northern Vanuatu, closely related to the neighboring Hiw language and known for its highly conservative phonology and grammar.
  • E. Camorta language
    The Camorta language is an Austroasiatic language spoken by the Nicobarese people on Camorta Island in India’s Nicobar Islands.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (2 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69e0c459acb481909bb6ee452a0045c7 completed April 16, 2026, 11:13 a.m.
NER Named-entity recognition batch_69e9ea156dac819087c4594d022d3df6 completed April 23, 2026, 9:44 a.m.
Created at: April 16, 2026, 6:19 p.m.