Triple
T6364692
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Eastern Indo-Aryan languages |
E143195
|
entity |
| Predicate | hasMajorLanguage |
P207
|
FINISHED |
| Object | Kurmali language |
E407557
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Kurmali language | Statement: [Eastern Indo-Aryan languages, hasMajorLanguage, Kurmali language]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Kurmali language Context triple: [Eastern Indo-Aryan languages, hasMajorLanguage, Kurmali language]
-
A.
Kurmali language
chosen
Kurmali language is an Indo-Aryan tribal language spoken primarily in eastern India, especially in Jharkhand, West Bengal, and Odisha, by the Kurmi and related communities.
-
B.
Kamviri language
The Kamviri language is a Nuristani language spoken primarily by the Kam people in parts of eastern Afghanistan and neighboring regions of Pakistan.
-
C.
Kumzari language
The Kumzari language is an endangered Southwestern Iranian language spoken primarily by the Kumzari people in the Musandam Peninsula of Oman.
-
D.
Khumi language
The Khumi language is a lesser-known Tibeto-Burman language spoken primarily by the Khumi people in parts of Myanmar and neighboring regions.
-
E.
Kambera language
Kambera language is an Austronesian language spoken primarily on the island of Sumba in eastern Indonesia, known for its complex morphology and rich oral tradition.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69c008d8c61081908bcaf61510d881ed |
completed | March 22, 2026, 3:20 p.m. |
| NER | Named-entity recognition | batch_69c0680ed0148190b6e310b15b3449ff |
completed | March 22, 2026, 10:07 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69c62d7a1cbc8190a27a0a8e8b466ad5 |
completed | March 27, 2026, 7:10 a.m. |
Created at: March 22, 2026, 4:32 p.m.