Triple

T8193913
Position Surface form Disambiguated ID Type / Status
Subject Central Province, Kenya E191379 entity
Predicate language P15 FINISHED
Object Gikuyu E267659 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Gikuyu | Statement: [Central Province, Kenya, language, Gikuyu]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Gikuyu
Context triple: [Central Province, Kenya, language, Gikuyu]
  • A. Gikuyu chosen
    Gikuyu is an alternative name for the Kikuyu, the largest ethnic group in Kenya known for their Bantu language and significant cultural and political influence in the country.
  • B. Tumbuka
    Tumbuka is a Bantu language spoken primarily in northern Malawi and parts of Zambia and Tanzania.
  • C. Swahili people
    The Swahili people are a Bantu ethnic group native to the East African coast, historically known as maritime traders and cultural intermediaries blending African, Arab, and Persian influences.
  • D. Swahili language
    Swahili is a major Bantu language widely spoken in East and Central Africa, serving as a regional lingua franca and an official language in several countries including Tanzania and Kenya.
  • E. Kikongo
    Kikongo is a Bantu language widely spoken in Central Africa, particularly in the western regions of the Democratic Republic of the Congo and neighboring countries.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69ca82c6e9548190a4c5ca14516e4417 completed March 30, 2026, 2:03 p.m.
NER Named-entity recognition batch_69cb5c1f02248190adbe56a7d6be3419 completed March 31, 2026, 5:31 a.m.
NED1 Entity disambiguation (via context triple) batch_69cceda1b22c8190acc1a2cd0fe36b70 completed April 1, 2026, 10:04 a.m.
Created at: March 30, 2026, 5:42 p.m.