Triple

T10761133
Position Surface form Disambiguated ID Type / Status
Subject Saharsa Junction railway station E253827 entity
Predicate isKeyJunctionFor P38489 FINISHED
Object Saharsa district E51711 NE FINISHED

How this triple was built (3 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Saharsa district | Statement: [Saharsa Junction railway station, isKeyJunctionFor, Saharsa district]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Saharsa district
Context triple: [Saharsa Junction railway station, isKeyJunctionFor, Saharsa district]
  • A. Saharsa district chosen
    Saharsa district is an administrative region in the Indian state of Bihar, known for its Maithili-speaking population and location in the fertile Kosi river basin.
  • B. Saharsa
    Saharsa is a city in the northeastern Indian state of Bihar, known as a major agricultural and commercial center in the Kosi river region.
  • C. Hardoi district
    Hardoi district is an administrative district in the Indian state of Uttar Pradesh, known for its predominantly agricultural landscape and location in the central part of the state.
  • D. Samastipur district
    Samastipur district is an administrative region in the Indian state of Bihar, known for its agricultural economy and cultural use of the Maithili language.
  • E. Simdega district
    Simdega district is an administrative region in the Indian state of Jharkhand known for its significant indigenous population and use of tribal languages such as Kharia.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: isKeyJunctionFor
Context triple: [Saharsa Junction railway station, isKeyJunctionFor, Saharsa district]
  • A. hasJunctionWith
    Indicates that one entity meets or intersects with another at a shared junction point.
  • B. hasJunctionIn chosen
    Indicates that one entity contains or includes a junction located within the spatial or structural extent of another entity.
  • C. isNumberedJunctionOf
    Indicates that a junction (such as a road or rail intersection) has been assigned an official identifying number within a network.
  • D. isJunctionOnLine
    Indicates that a particular junction lies on, or is part of, a specified line.
  • E. hasJunctionCount
    Indicates the number of junctions associated with or contained in a given entity.
  • F. None of above.

Provenance (4 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d6aa5f54f4819082d0bbcb6f8797e6 completed April 8, 2026, 7:19 p.m.
NER Named-entity recognition batch_69d731a14c7481909c6f4f9b15dc130f completed April 9, 2026, 4:57 a.m.
NED1 Entity disambiguation (via context triple) batch_69de84b88ba08190afddea2976d12465 completed April 14, 2026, 6:17 p.m.
PD Predicate disambiguation batch_69d6f311529c819080ca5493d55d6050 completed April 9, 2026, 12:30 a.m.
Created at: April 8, 2026, 9:16 p.m.