Triple

T10761117
Position Surface form Disambiguated ID Type / Status
Subject Saharsa Junction railway station E253827 entity
Predicate district P2709 FINISHED
Object Saharsa district E51711 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Saharsa district | Statement: [Saharsa Junction railway station, district, Saharsa district]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Saharsa district
Context triple: [Saharsa Junction railway station, district, Saharsa district]
  • A. Saharsa district chosen
    Saharsa district is an administrative region in the Indian state of Bihar, known for its Maithili-speaking population and location in the fertile Kosi river basin.
  • B. Saharsa
    Saharsa is a city in the northeastern Indian state of Bihar, known as a major agricultural and commercial center in the Kosi river region.
  • C. Hardoi district
    Hardoi district is an administrative district in the Indian state of Uttar Pradesh, known for its predominantly agricultural landscape and location in the central part of the state.
  • D. Samastipur district
    Samastipur district is an administrative region in the Indian state of Bihar, known for its agricultural economy and cultural use of the Maithili language.
  • E. Simdega district
    Simdega district is an administrative region in the Indian state of Jharkhand known for its significant indigenous population and use of tribal languages such as Kharia.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d6aa5f54f4819082d0bbcb6f8797e6 completed April 8, 2026, 7:19 p.m.
NER Named-entity recognition batch_69d731a14c7481909c6f4f9b15dc130f completed April 9, 2026, 4:57 a.m.
NED1 Entity disambiguation (via context triple) batch_69de55bb98ec8190914031643c1c7a97 completed April 14, 2026, 2:56 p.m.
Created at: April 8, 2026, 9:16 p.m.