Triple
T4976238
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Judeo-Tajik |
E111770
|
entity |
| Predicate | phonologyInfluencedBy |
P50775
|
FINISHED |
| Object | Tajik phonology |
E122576
|
NE FINISHED |
How this triple was built (3 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Tajik phonology | Statement: [Judeo-Tajik, phonologyInfluencedBy, Tajik phonology]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Tajik phonology Context triple: [Judeo-Tajik, phonologyInfluencedBy, Tajik phonology]
-
A.
Tajik language
chosen
The Tajik language is a variety of Persian spoken primarily in Tajikistan and written in the Cyrillic script.
-
B.
Judeo-Tajik
Judeo-Tajik is a Jewish ethnolect of the Tajik (Persian) language historically spoken by Central Asian Bukharan Jews, written in Hebrew script and enriched with Hebrew and Aramaic loanwords.
-
C.
Pamir languages
The Pamir languages are a group of Eastern Iranian languages spoken primarily in the mountainous Pamir region of Tajikistan, Afghanistan, and surrounding areas.
-
D.
Karakalpak language
The Karakalpak language is a Turkic language spoken primarily by the Karakalpak people in Karakalpakstan, an autonomous republic within Uzbekistan.
-
E.
Nuristani languages
Nuristani languages are a small, distinct group of Indo-Iranian languages spoken primarily in the remote Nuristan region of eastern Afghanistan.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
PD
Predicate disambiguation
gpt-5-mini-2025-08-07
Target predicate: phonologyInfluencedBy Context triple: [Judeo-Tajik, phonologyInfluencedBy, Tajik phonology]
-
A.
hasPhonologicalBasisFor
Indicates that one entity serves as the phonological source, motivation, or foundation for another entity.
-
B.
phonologyRelation
chosen
Indicates a relationship between linguistic elements based on their phonological properties, such as sound patterns, features, or structures.
-
C.
hasPhonologicalParameters
Indicates that an entity is associated with specific phonological features or parameters that characterize its sound structure.
-
D.
hasOwnPhonology
Indicates that an entity possesses its own distinct phonological system or set of sound patterns, separate from those of other entities.
-
E.
hasPhonologicalChange
Indicates a relationship where one linguistic form undergoes a change in its sound structure relative to another form or earlier state.
- F. None of above.
Provenance (4 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69bd441a0eb481908050fa4273b19eae |
completed | March 20, 2026, 12:56 p.m. |
| NER | Named-entity recognition | batch_69bd730a7590819088ab8d49c5c88c2f |
completed | March 20, 2026, 4:17 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69be8a0634b48190acb4a1834f5647cf |
completed | March 21, 2026, 12:07 p.m. |
| PD | Predicate disambiguation | batch_69bd7146e6e881908a55ab2756b631f6 |
completed | March 20, 2026, 4:09 p.m. |
Created at: March 20, 2026, 1:33 p.m.