Triple

T6510207
Position Surface form Disambiguated ID Type / Status
Subject Carolinean people E150107 entity
Predicate usesLanguage P238 FINISHED
Object Sonsorolese language E440258 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Sonsorolese language | Statement: [Carolinean people, usesLanguage, Sonsorolese language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Sonsorolese language
Context triple: [Carolinean people, usesLanguage, Sonsorolese language]
  • A. Sonsorolese language chosen
    The Sonsorolese language is a Micronesian language spoken by the inhabitants of Sonsorol and nearby atolls in Palau, closely related to other Carolinian languages.
  • B. Saho language
    The Saho language is an Afroasiatic Cushitic language spoken primarily by the Saho people in Eritrea and northern Ethiopia.
  • C. Siuslaw language
    The Siuslaw language is an extinct Native American language once spoken along the central Oregon coast, often classified within the proposed Penutian language family.
  • D. Agutaynen language
    Agutaynen is an Austronesian language spoken by the Agutaynen people of Palawan in the Philippines.
  • E. Sanglechi language
    The Sanglechi language is an Eastern Iranian language spoken by a small community in the Sanglech Valley region of Afghanistan and Tajikistan.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c687ef291081909d437f035eef1cda completed March 27, 2026, 1:36 p.m.
NER Named-entity recognition batch_69c69f398f10819096342f3646cefcc2 completed March 27, 2026, 3:16 p.m.
NED1 Entity disambiguation (via context triple) batch_69c6cb5dd5b88190b0928b44ebc91609 completed March 27, 2026, 6:24 p.m.
Created at: March 27, 2026, 1:43 p.m.