Triple

T5244851
Position Surface form Disambiguated ID Type / Status
Subject Northwest Caucasian languages E118433 entity
Predicate partOf P40 FINISHED
Object Caucasian languages E57217 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Caucasian languages | Statement: [Northwest Caucasian languages, partOf, Caucasian languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Caucasian languages
Context triple: [Northwest Caucasian languages, partOf, Caucasian languages]
  • A. Northeast Caucasian languages
    The Northeast Caucasian languages are a diverse family of languages spoken primarily in the northeastern Caucasus region, including languages such as Chechen, Avar, and Lezgian.
  • B. Caucasian languages (areal grouping) chosen
    Caucasian languages (areal grouping) refers to a proposed geographical grouping of the diverse indigenous language families spoken in the Caucasus region, such as Kartvelian, Northwest Caucasian, and Northeast Caucasian, without implying a proven genetic relationship among them.
  • C. Balto-Slavic languages
    The Balto-Slavic languages are a branch of the Indo-European language family that includes the Baltic and Slavic languages, such as Russian, Polish, and Lithuanian.
  • D. Slavic languages
    Slavic languages are a branch of the Indo-European language family spoken primarily in Eastern and Central Europe, known for features such as rich consonant clusters, aspectual verb systems, and extensive case inflection.
  • E. Oghur languages
    Oghur languages are an early, now mostly extinct branch of the Turkic language family, historically spoken by steppe peoples such as the Bulgars and Khazars and represented today primarily by Chuvash.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69bd4468aacc8190a8196f71855cdf4f completed March 20, 2026, 12:58 p.m.
NER Named-entity recognition batch_69bd7b4fa0ec8190bce3da09aa768726 completed March 20, 2026, 4:52 p.m.
NED1 Entity disambiguation (via context triple) batch_69bef82f2738819080c5a0963f6dbbb0 completed March 21, 2026, 7:57 p.m.
Created at: March 20, 2026, 1:49 p.m.