Triple

T10697343
Position Surface form Disambiguated ID Type / Status
Subject Murle language E252178 entity
Predicate neighboringLanguage P16383 FINISHED
Object Toposa language E249869 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Toposa language | Statement: [Murle language, neighboringLanguage, Toposa language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Toposa language
Context triple: [Murle language, neighboringLanguage, Toposa language]
  • A. Toposa language chosen
    The Toposa language is an Eastern Nilotic language spoken primarily by the Toposa people of South Sudan and neighboring regions.
  • B. Curripaco language
    The Curripaco language is an Arawakan language spoken by the Curripaco people of the Northwest Amazon region in Brazil, Colombia, and Venezuela.
  • C. Opata language
    The Opata language is an extinct Uto-Aztecan language once spoken by the Opata people of northern Mexico, particularly in the present-day state of Sonora.
  • D. Patamona language
    The Patamona language is an indigenous Cariban language spoken by the Patamona people of the Guiana Highlands in Guyana and northern Brazil.
  • E. Teke-Kukuya language
    The Teke-Kukuya language is a Bantu language spoken by the Teke-Kukuya people in the Republic of the Congo and neighboring regions of Central Africa.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d6aa5cbabc8190973e683950d89faf completed April 8, 2026, 7:19 p.m.
NER Named-entity recognition batch_69d6fd89390c8190969ab2b4a79d5818 completed April 9, 2026, 1:14 a.m.
NED1 Entity disambiguation (via context triple) batch_69d998e447e0819098e839e9e121a21f completed April 11, 2026, 12:42 a.m.
Created at: April 8, 2026, 9:12 p.m.