Triple
T10697343
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Murle language |
E252178
|
entity |
| Predicate | neighboringLanguage |
P16383
|
FINISHED |
| Object | Toposa language |
E249869
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Toposa language | Statement: [Murle language, neighboringLanguage, Toposa language]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Toposa language Context triple: [Murle language, neighboringLanguage, Toposa language]
-
A.
Toposa language
chosen
The Toposa language is an Eastern Nilotic language spoken primarily by the Toposa people of South Sudan and neighboring regions.
-
B.
Curripaco language
The Curripaco language is an Arawakan language spoken by the Curripaco people of the Northwest Amazon region in Brazil, Colombia, and Venezuela.
-
C.
Opata language
The Opata language is an extinct Uto-Aztecan language once spoken by the Opata people of northern Mexico, particularly in the present-day state of Sonora.
-
D.
Patamona language
The Patamona language is an indigenous Cariban language spoken by the Patamona people of the Guiana Highlands in Guyana and northern Brazil.
-
E.
Teke-Kukuya language
The Teke-Kukuya language is a Bantu language spoken by the Teke-Kukuya people in the Republic of the Congo and neighboring regions of Central Africa.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d6aa5cbabc8190973e683950d89faf |
completed | April 8, 2026, 7:19 p.m. |
| NER | Named-entity recognition | batch_69d6fd89390c8190969ab2b4a79d5818 |
completed | April 9, 2026, 1:14 a.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69d998e447e0819098e839e9e121a21f |
completed | April 11, 2026, 12:42 a.m. |
Created at: April 8, 2026, 9:12 p.m.