Triple
T5460233
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Tajik language |
E122576
|
entity |
| Predicate | historicallyPartOf |
P5057
|
FINISHED |
| Object | Persian language continuum |
E3587
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Persian language continuum | Statement: [Tajik language, historicallyPartOf, Persian language continuum]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Persian language continuum Context triple: [Tajik language, historicallyPartOf, Persian language continuum]
-
A.
Arabic language continuum
The Arabic language continuum is the collection of closely related, often mutually intelligible Arabic dialects and varieties spoken across the Arab world, ranging from colloquial regional forms to standardized literary Arabic.
-
B.
Iranian languages
Iranian languages are a branch of the Indo-Iranian family of Indo-European languages, historically spoken across Iran, Central Asia, and surrounding regions, including major languages such as Persian (Farsi), Pashto, and Kurdish.
-
C.
Persian language
chosen
Persian language is a major modern Iranian language spoken primarily in Iran, Afghanistan, and Tajikistan, known for its rich literary tradition and historical influence across the Middle East and Central Asia.
-
D.
Northeastern Iranian languages
Northeastern Iranian languages are a branch of the Iranian language family spoken historically and presently in parts of Central Asia and northeastern Iran, including languages such as Sogdian, Yaghnobi, and some modern Pamir languages.
-
E.
Southeastern Iranian languages
Southeastern Iranian languages are a subgroup of the Iranian branch of the Indo-Iranian language family, historically spoken in eastern Iran and surrounding regions and including languages such as Pashto and Ossetian.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69bd46424248819085282ddf50a565f3 |
completed | March 20, 2026, 1:06 p.m. |
| NER | Named-entity recognition | batch_69bd9200a3988190a06f253f99e68224 |
completed | March 20, 2026, 6:29 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69bf414c39a4819098f2862f3c4594c0 |
completed | March 22, 2026, 1:09 a.m. |
Created at: March 20, 2026, 2:08 p.m.