Triple

T7573620
Position Surface form Disambiguated ID Type / Status
Subject Magahi language E179307 entity
Predicate relatedTo P37 FINISHED
Object Hindi language E5054 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Hindi language | Statement: [Magahi language, relatedTo, Hindi language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Hindi language
Context triple: [Magahi language, relatedTo, Hindi language]
  • A. Hindustani language
    The Hindustani language is a major Indo-Aryan language of South Asia that encompasses the closely related standardized forms of Hindi and Urdu and serves as a key lingua franca across northern India and Pakistan.
  • B. Hindi chosen
    Hindi is an Indo-Aryan language widely spoken across northern and central India and used in government, education, media, and popular culture.
  • C. Swati language
    Swati language is a Bantu language of the Nguni group spoken primarily in Eswatini and parts of South Africa.
  • D. Punjabi language
    Punjabi language is an Indo-Aryan language widely spoken in the Punjab region of India and Pakistan and among large diaspora communities worldwide.
  • E. Marathi language
    Marathi language is an Indo-Aryan language predominantly spoken in the Indian state of Maharashtra and surrounding regions, with a rich literary tradition and official status in the state.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c69f316e50819081a271c85c06f918 completed March 27, 2026, 3:16 p.m.
NER Named-entity recognition batch_69c6f948e1e08190ad807292365a0c27 completed March 27, 2026, 9:40 p.m.
NED1 Entity disambiguation (via context triple) batch_69c856ea9a2c8190a81762ac509c4c97 completed March 28, 2026, 10:32 p.m.
Created at: March 27, 2026, 3:51 p.m.