Triple
T15213080
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Southern Mansi |
E363565
|
entity |
| Predicate | sharesFeaturesWith |
P5696
|
FINISHED |
| Object | Khanty language |
E357085
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Khanty language | Statement: [Southern Mansi, sharesFeaturesWith, Khanty language]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Khanty language Context triple: [Southern Mansi, sharesFeaturesWith, Khanty language]
-
A.
Khanty language
chosen
The Khanty language is a Uralic language spoken by the Khanty people of western Siberia, closely related to Mansi and traditionally used in the Khanty-Mansi Autonomous Okrug of Russia.
-
B.
Enets language
Enets language is a critically endangered Samoyedic language of the Uralic family spoken by a small Indigenous community in northern Siberia, Russia.
-
C.
Nenets language
The Nenets language is a Uralic Samoyedic language spoken by the Nenets people of northern Arctic Russia.
-
D.
Selkup language
The Selkup language is a critically endangered Uralic (Samoyedic) language spoken by the indigenous Selkup people of western Siberia in Russia.
-
E.
Khakas language
Khakas language is a Turkic language spoken primarily by the Khakas people in the Republic of Khakassia in south-central Siberia, Russia.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d85a0b78bc8190b6e5ad51a2c4cfc5 |
completed | April 10, 2026, 2:01 a.m. |
| NER | Named-entity recognition | batch_69e0076c9e2481909d7a464b2172f4bf |
completed | April 15, 2026, 9:47 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69feef687bdc819090ec8a4e1af9f422 |
completed | May 9, 2026, 8:25 a.m. |
Created at: April 10, 2026, 3:11 a.m.