Triple

T8112816
Position Surface form Disambiguated ID Type / Status
Subject Rukwa Region E189396 entity
Predicate officialLanguage P236 FINISHED
Object Swahili E2738 NE FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Swahili | Statement: [Rukwa Region, officialLanguage, Swahili]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Swahili
Context triple: [Rukwa Region, officialLanguage, Swahili]
  • A. Swahili language chosen
    Swahili is a major Bantu language widely spoken in East and Central Africa, serving as a regional lingua franca and an official language in several countries including Tanzania and Kenya.
  • B. Gikuyu
    Gikuyu is an alternative name for the Kikuyu, the largest ethnic group in Kenya known for their Bantu language and significant cultural and political influence in the country.
  • C. Luganda
    Luganda is a major Bantu language spoken primarily in Uganda, serving as a key lingua franca and cultural language of the Baganda people.
  • D. Chichewa
    Chichewa is a major Bantu language spoken primarily in Malawi and neighboring countries, serving as a national and widely used lingua franca in the region.
  • E. Tumbuka
    Tumbuka is a Bantu language spoken primarily in northern Malawi and parts of Zambia and Tanzania.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

Stage Batch ID Job type Status
creating batch_69ca82baad008190ab2859712b9b1607 elicitation completed
NER batch_69cb432d7dfc8190b9c980f32c7b4623 ner completed
NED1 batch_69cc9433a5848190aac09a2589061b53 ned_source_triple completed
Created at: March 30, 2026, 5:32 p.m.