Triple

T11458604
Position Surface form Disambiguated ID Type / Status
Subject Shabo people E271591 entity
Predicate endangeredLanguageSpeakerOf P99674 FINISHED
Object Shabo language E247404 NE FINISHED

How this triple was built (3 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Shabo language | Statement: [Shabo people, endangeredLanguageSpeakerOf, Shabo language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Shabo language
Context triple: [Shabo people, endangeredLanguageSpeakerOf, Shabo language]
  • A. Shabo language chosen
    Shabo language is a highly endangered and poorly documented language spoken by a small community in southwestern Ethiopia, often considered a linguistic isolate with uncertain genetic affiliation.
  • B. Shekkacho language
    The Shekkacho language is an Afroasiatic Omotic language spoken primarily by the Shekka people in the Sheka Zone of southwestern Ethiopia.
  • C. Sabaot language
    The Sabaot language is a Nilotic language spoken by the Sabaot people of western Kenya and eastern Uganda, closely associated with the Kalenjin language cluster.
  • D. Shawiya language
    The Shawiya language is a Berber (Amazigh) language spoken primarily by the Shawiya people of the Aurès Mountains and surrounding regions in northeastern Algeria.
  • E. Logba language
    The Logba language is a Niger-Congo language spoken by the Logba people of southeastern Ghana.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: endangeredLanguageSpeakerOf
Context triple: [Shabo people, endangeredLanguageSpeakerOf, Shabo language]
  • A. endangeredLanguageProjectID
    Indicates that there is an associated project identifier specifically for work or documentation related to an endangered language.
  • B. lastNativeSpeakersDiedOut
    Indicates that the final remaining native speakers of a language or dialect have died, resulting in the loss of native speech for that language.
  • C. hasNativeSpeakers
    Indicates that a language or dialect is spoken as a first language by one or more people or populations.
  • D. includesEndangeredLanguages
    Indicates that the subject contains, encompasses, or otherwise involves one or more languages classified as endangered.
  • E. languageEndangermentStatus
    Indicates the degree to which a language is at risk of falling out of use or becoming extinct.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d6aadff8888190a13f253f0d460874 completed April 8, 2026, 7:22 p.m.
NER Named-entity recognition batch_69d822f2138081909408c7916cef99c9 completed April 9, 2026, 10:06 p.m.
NED1 Entity disambiguation (via context triple) batch_69e6040733648190a10f9553b3ac87a7 completed April 20, 2026, 10:46 a.m.
PD Predicate disambiguation batch_69d80867ff248190bb157fa9e355353b completed April 9, 2026, 8:13 p.m.
PDg Predicate description generation batch_69d822ef46988190a1c360da4ee14fef completed April 9, 2026, 10:06 p.m.
Created at: April 8, 2026, 9:35 p.m.