Triple

T6364692
Position Surface form Disambiguated ID Type / Status
Subject Eastern Indo-Aryan languages E143195 entity
Predicate hasMajorLanguage P207 FINISHED
Object Kurmali language E407557 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Kurmali language | Statement: [Eastern Indo-Aryan languages, hasMajorLanguage, Kurmali language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Kurmali language
Context triple: [Eastern Indo-Aryan languages, hasMajorLanguage, Kurmali language]
  • A. Kurmali language chosen
    Kurmali language is an Indo-Aryan tribal language spoken primarily in eastern India, especially in Jharkhand, West Bengal, and Odisha, by the Kurmi and related communities.
  • B. Kamviri language
    The Kamviri language is a Nuristani language spoken primarily by the Kam people in parts of eastern Afghanistan and neighboring regions of Pakistan.
  • C. Kumzari language
    The Kumzari language is an endangered Southwestern Iranian language spoken primarily by the Kumzari people in the Musandam Peninsula of Oman.
  • D. Khumi language
    The Khumi language is a lesser-known Tibeto-Burman language spoken primarily by the Khumi people in parts of Myanmar and neighboring regions.
  • E. Kambera language
    Kambera language is an Austronesian language spoken primarily on the island of Sumba in eastern Indonesia, known for its complex morphology and rich oral tradition.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c008d8c61081908bcaf61510d881ed completed March 22, 2026, 3:20 p.m.
NER Named-entity recognition batch_69c0680ed0148190b6e310b15b3449ff completed March 22, 2026, 10:07 p.m.
NED1 Entity disambiguation (via context triple) batch_69c62d7a1cbc8190a27a0a8e8b466ad5 completed March 27, 2026, 7:10 a.m.
Created at: March 22, 2026, 4:32 p.m.