Triple

T7866939
Position Surface form Disambiguated ID Type / Status
Subject Urambo District E182639 entity
Predicate languageUsed P238 FINISHED
Object Swahili E2738 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Swahili | Statement: [Urambo District, languageUsed, Swahili]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Swahili
Context triple: [Urambo District, languageUsed, Swahili]
  • A. Swahili language chosen
    Swahili is a major Bantu language widely spoken in East and Central Africa, serving as a regional lingua franca and an official language in several countries including Tanzania and Kenya.
  • B. Gikuyu
    Gikuyu is an alternative name for the Kikuyu, the largest ethnic group in Kenya known for their Bantu language and significant cultural and political influence in the country.
  • C. Luganda
    Luganda is a major Bantu language spoken primarily in Uganda, serving as a key lingua franca and cultural language of the Baganda people.
  • D. Chichewa
    Chichewa is a major Bantu language spoken primarily in Malawi and neighboring countries, serving as a national and widely used lingua franca in the region.
  • E. Tumbuka
    Tumbuka is a Bantu language spoken primarily in northern Malawi and parts of Zambia and Tanzania.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69ca82894d9081908a832bfce71a4714 completed March 30, 2026, 2:02 p.m.
NER Named-entity recognition batch_69cb38464274819080f182b53783fa84 completed March 31, 2026, 2:58 a.m.
NED1 Entity disambiguation (via context triple) batch_69cbdf6fb33881908cf7bd68915aa6b4 completed March 31, 2026, 2:51 p.m.
Created at: March 30, 2026, 4:54 p.m.