Triple

T5189236
Position Surface form Disambiguated ID Type / Status
Subject Kumyk E117107 entity
Predicate influencedBy P9 FINISHED
Object Caucasian languages E57217 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Caucasian languages | Statement: [Kumyk, influencedBy, Caucasian languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Caucasian languages
Context triple: [Kumyk, influencedBy, Caucasian languages]
  • A. Northeast Caucasian languages
    The Northeast Caucasian languages are a diverse family of languages spoken primarily in the northeastern Caucasus region, including languages such as Chechen, Avar, and Lezgian.
  • B. Caucasian languages (areal grouping) chosen
    Caucasian languages (areal grouping) refers to a proposed geographical grouping of the diverse indigenous language families spoken in the Caucasus region, such as Kartvelian, Northwest Caucasian, and Northeast Caucasian, without implying a proven genetic relationship among them.
  • C. Balto-Slavic languages
    The Balto-Slavic languages are a branch of the Indo-European language family that includes the Baltic and Slavic languages, such as Russian, Polish, and Lithuanian.
  • D. Slavic languages
    Slavic languages are a branch of the Indo-European language family spoken primarily in Eastern and Central Europe, known for features such as rich consonant clusters, aspectual verb systems, and extensive case inflection.
  • E. Oghur languages
    Oghur languages are an early, now mostly extinct branch of the Turkic language family, historically spoken by steppe peoples such as the Bulgars and Khazars and represented today primarily by Chuvash.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69bd44620ff48190bcac01782107a397 completed March 20, 2026, 12:58 p.m.
NER Named-entity recognition batch_69bd79c732b48190af62dfffcbc5e3a6 completed March 20, 2026, 4:45 p.m.
NED1 Entity disambiguation (via context triple) batch_69bee08dfe648190a98b5a61f857d593 completed March 21, 2026, 6:16 p.m.
Created at: March 20, 2026, 1:46 p.m.