Triple

T21201135
Position Surface form Disambiguated ID Type / Status
Subject Tabari E522455 entity
Predicate subgroup P10 FINISHED
Object Caspian languages NE NERFINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Caspian languages | Statement: [Tabari, subgroup, Caspian languages]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Caspian languages
Context triple: [Tabari, subgroup, Caspian languages]
  • A. Caspian languages chosen
    Caspian languages are a group of closely related Northwestern Iranian languages spoken mainly along the southern coast of the Caspian Sea in northern Iran.
  • B. Pamir languages
    The Pamir languages are a group of Eastern Iranian languages spoken primarily in the mountainous Pamir region of Tajikistan, Afghanistan, and surrounding areas.
  • C. Pashayi languages
    The Pashayi languages are a small group of closely related Indo-Aryan languages spoken primarily by ethnic Pashayi communities in eastern Afghanistan.
  • D. Northeast Caucasian languages
    The Northeast Caucasian languages are a diverse family of languages spoken primarily in the northeastern Caucasus region, including languages such as Chechen, Avar, and Lezgian.
  • E. Kermanic languages
    Kermanic languages are a subgroup of Northwestern Iranian languages spoken primarily in and around Iran’s Kerman region, characterized by distinct phonological and lexical features within the Iranian branch of the Indo-Iranian language family.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (2 batches)

Stage Batch ID Job type Status
creating batch_69e0b5112d8881909510b2dcdc93106d elicitation completed
NER batch_69e73430c5a08190aeb6a62eec0f43a3 ner completed
Created at: April 16, 2026, 3:18 p.m.