Triple

T15213080
Position Surface form Disambiguated ID Type / Status
Subject Southern Mansi E363565 entity
Predicate sharesFeaturesWith P5696 FINISHED
Object Khanty language E357085 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Khanty language | Statement: [Southern Mansi, sharesFeaturesWith, Khanty language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Khanty language
Context triple: [Southern Mansi, sharesFeaturesWith, Khanty language]
  • A. Khanty language chosen
    The Khanty language is a Uralic language spoken by the Khanty people of western Siberia, closely related to Mansi and traditionally used in the Khanty-Mansi Autonomous Okrug of Russia.
  • B. Enets language
    Enets language is a critically endangered Samoyedic language of the Uralic family spoken by a small Indigenous community in northern Siberia, Russia.
  • C. Nenets language
    The Nenets language is a Uralic Samoyedic language spoken by the Nenets people of northern Arctic Russia.
  • D. Selkup language
    The Selkup language is a critically endangered Uralic (Samoyedic) language spoken by the indigenous Selkup people of western Siberia in Russia.
  • E. Khakas language
    Khakas language is a Turkic language spoken primarily by the Khakas people in the Republic of Khakassia in south-central Siberia, Russia.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d85a0b78bc8190b6e5ad51a2c4cfc5 completed April 10, 2026, 2:01 a.m.
NER Named-entity recognition batch_69e0076c9e2481909d7a464b2172f4bf completed April 15, 2026, 9:47 p.m.
NED1 Entity disambiguation (via context triple) batch_69feef687bdc819090ec8a4e1af9f422 completed May 9, 2026, 8:25 a.m.
Created at: April 10, 2026, 3:11 a.m.