Triple
T21201135
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Tabari |
E522455
|
entity |
| Predicate | subgroup |
P10
|
FINISHED |
| Object | Caspian languages |
—
|
NE NERFINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Caspian languages | Statement: [Tabari, subgroup, Caspian languages]
Disambiguation candidates (1 decision)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Caspian languages Context triple: [Tabari, subgroup, Caspian languages]
-
A.
Caspian languages
chosen
Caspian languages are a group of closely related Northwestern Iranian languages spoken mainly along the southern coast of the Caspian Sea in northern Iran.
-
B.
Pamir languages
The Pamir languages are a group of Eastern Iranian languages spoken primarily in the mountainous Pamir region of Tajikistan, Afghanistan, and surrounding areas.
-
C.
Pashayi languages
The Pashayi languages are a small group of closely related Indo-Aryan languages spoken primarily by ethnic Pashayi communities in eastern Afghanistan.
-
D.
Northeast Caucasian languages
The Northeast Caucasian languages are a diverse family of languages spoken primarily in the northeastern Caucasus region, including languages such as Chechen, Avar, and Lezgian.
-
E.
Kermanic languages
Kermanic languages are a subgroup of Northwestern Iranian languages spoken primarily in and around Iran’s Kerman region, characterized by distinct phonological and lexical features within the Iranian branch of the Indo-Iranian language family.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (2 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69e0b5112d8881909510b2dcdc93106d |
elicitation | completed |
| NER | batch_69e73430c5a08190aeb6a62eec0f43a3 |
ner | completed |
Created at: April 16, 2026, 3:18 p.m.