Triple

T17482916
Position Surface form Disambiguated ID Type / Status
Subject Alipur E425708 entity
Predicate countrySubdivisionName P2356 FINISHED
Object Punjab NE NERFINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Punjab | Statement: [Alipur, countrySubdivisionName, Punjab]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Punjab
Context triple: [Alipur, countrySubdivisionName, Punjab]
  • A. Punjab chosen
    Punjab is a historically and culturally rich region of South Asia, known for its fertile agricultural lands, Sikh heritage, and partition between modern-day India and Pakistan.
  • B. Panjab
    Panjab is a town in Afghanistan’s Hazarajat region that serves as an important local center within Bamyan Province.
  • C. Gujrat
    Gujrat is a city in Pakistan’s Punjab province known for its industrial activity, historical significance, and position along the Grand Trunk Road between Lahore and Islamabad.
  • D. Haryana
    Haryana is a northern Indian state known for its significant agricultural output, rapid industrial growth, and proximity to the national capital, New Delhi.
  • E. Sindh
    Sindh is a southeastern province of Pakistan known for its historical Indus Valley heritage, major cities like Karachi and Hyderabad, and a rich Sindhi cultural and linguistic tradition.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (2 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d889dccf7481909264a1844a2e9100 completed April 10, 2026, 5:25 a.m.
NER Named-entity recognition batch_69e451c0db14819098922453131fb40a completed April 19, 2026, 3:53 a.m.
Created at: April 10, 2026, 5:48 a.m.