Triple

T22111290
Position Surface form Disambiguated ID Type / Status
Subject DD India E546423 entity
Predicate sisterChannel P5818 FINISHED
Object DD Urdu NE NERFINISHED

How this triple was built (3 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: DD Urdu | Statement: [DD India, sisterChannel, DD Urdu]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: DD Urdu
Context triple: [DD India, sisterChannel, DD Urdu]
  • A. Dhakaiya Urdu
    Dhakaiya Urdu is a distinctive Urdu-based vernacular spoken by the Urdu-speaking community of Old Dhaka, shaped by centuries of contact with Bengali and local cultures.
  • B. Urdu language
    Urdu is a major South Asian language, written in a Perso-Arabic script and widely used in Pakistan and parts of India in literature, media, and everyday communication.
  • C. Karachi Urdu
    Karachi Urdu is an urban dialect of Urdu shaped by the speech of Muhajir communities in Karachi, marked by distinctive pronunciation, vocabulary, and influences from local languages.
  • D. Urdu literature
    Urdu literature is the body of written works produced in the Urdu language, encompassing poetry, prose, and drama that reflect the cultural, religious, and social life of South Asia.
  • E. Urdu script
    Urdu script is a Perso-Arabic–based writing system, written in a right-to-left, calligraphic style, used primarily for the Urdu language and several related South Asian languages.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: DD Urdu
Target entity description: DD Urdu is an Indian public television channel that broadcasts primarily in the Urdu language, offering cultural, educational, and entertainment programming.
  • A. Dhakaiya Urdu
    Dhakaiya Urdu is a distinctive Urdu-based vernacular spoken by the Urdu-speaking community of Old Dhaka, shaped by centuries of contact with Bengali and local cultures.
  • B. Urdu language
    Urdu is a major South Asian language, written in a Perso-Arabic script and widely used in Pakistan and parts of India in literature, media, and everyday communication.
  • C. Karachi Urdu
    Karachi Urdu is an urban dialect of Urdu shaped by the speech of Muhajir communities in Karachi, marked by distinctive pronunciation, vocabulary, and influences from local languages.
  • D. Urdu literature
    Urdu literature is the body of written works produced in the Urdu language, encompassing poetry, prose, and drama that reflect the cultural, religious, and social life of South Asia.
  • E. Urdu script
    Urdu script is a Perso-Arabic–based writing system, written in a right-to-left, calligraphic style, used primarily for the Urdu language and several related South Asian languages.
  • F. None of above. chosen

Provenance (2 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69e11e38b3848190ac3a4fa97d56e65a completed April 16, 2026, 5:36 p.m.
NER Named-entity recognition batch_69f12949cc7881908898ca7dc130f57f completed April 28, 2026, 9:40 p.m.
Created at: April 16, 2026, 8:31 p.m.