Triple
T8112816
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Rukwa Region |
E189396
|
entity |
| Predicate | officialLanguage |
P236
|
FINISHED |
| Object | Swahili |
E2738
|
NE FINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Swahili | Statement: [Rukwa Region, officialLanguage, Swahili]
Disambiguation candidates (1 decision)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Swahili Context triple: [Rukwa Region, officialLanguage, Swahili]
-
A.
Swahili language
chosen
Swahili is a major Bantu language widely spoken in East and Central Africa, serving as a regional lingua franca and an official language in several countries including Tanzania and Kenya.
-
B.
Gikuyu
Gikuyu is an alternative name for the Kikuyu, the largest ethnic group in Kenya known for their Bantu language and significant cultural and political influence in the country.
-
C.
Luganda
Luganda is a major Bantu language spoken primarily in Uganda, serving as a key lingua franca and cultural language of the Baganda people.
-
D.
Chichewa
Chichewa is a major Bantu language spoken primarily in Malawi and neighboring countries, serving as a national and widely used lingua franca in the region.
-
E.
Tumbuka
Tumbuka is a Bantu language spoken primarily in northern Malawi and parts of Zambia and Tanzania.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69ca82baad008190ab2859712b9b1607 |
elicitation | completed |
| NER | batch_69cb432d7dfc8190b9c980f32c7b4623 |
ner | completed |
| NED1 | batch_69cc9433a5848190aac09a2589061b53 |
ned_source_triple | completed |
Created at: March 30, 2026, 5:32 p.m.