Triple

T5460233
Position Surface form Disambiguated ID Type / Status
Subject Tajik language E122576 entity
Predicate historicallyPartOf P5057 FINISHED
Object Persian language continuum E3587 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Persian language continuum | Statement: [Tajik language, historicallyPartOf, Persian language continuum]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Persian language continuum
Context triple: [Tajik language, historicallyPartOf, Persian language continuum]
  • A. Arabic language continuum
    The Arabic language continuum is the collection of closely related, often mutually intelligible Arabic dialects and varieties spoken across the Arab world, ranging from colloquial regional forms to standardized literary Arabic.
  • B. Iranian languages
    Iranian languages are a branch of the Indo-Iranian family of Indo-European languages, historically spoken across Iran, Central Asia, and surrounding regions, including major languages such as Persian (Farsi), Pashto, and Kurdish.
  • C. Persian language chosen
    Persian language is a major modern Iranian language spoken primarily in Iran, Afghanistan, and Tajikistan, known for its rich literary tradition and historical influence across the Middle East and Central Asia.
  • D. Northeastern Iranian languages
    Northeastern Iranian languages are a branch of the Iranian language family spoken historically and presently in parts of Central Asia and northeastern Iran, including languages such as Sogdian, Yaghnobi, and some modern Pamir languages.
  • E. Southeastern Iranian languages
    Southeastern Iranian languages are a subgroup of the Iranian branch of the Indo-Iranian language family, historically spoken in eastern Iran and surrounding regions and including languages such as Pashto and Ossetian.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69bd46424248819085282ddf50a565f3 completed March 20, 2026, 1:06 p.m.
NER Named-entity recognition batch_69bd9200a3988190a06f253f99e68224 completed March 20, 2026, 6:29 p.m.
NED1 Entity disambiguation (via context triple) batch_69bf414c39a4819098f2862f3c4594c0 completed March 22, 2026, 1:09 a.m.
Created at: March 20, 2026, 2:08 p.m.