Triple

T6539497
Position Surface form Disambiguated ID Type / Status
Subject Ghulam Hussain Hidayatullah E168247 entity
Predicate languageSpoken P151 FINISHED
Object Sindhi E12831 NE FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Sindhi | Statement: [Ghulam Hussain Hidayatullah, languageSpoken, Sindhi]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Sindhi
Context triple: [Ghulam Hussain Hidayatullah, languageSpoken, Sindhi]
  • A. Sindhi chosen
    Sindhi is an Indo-Aryan language spoken primarily in Pakistan and India, known for its rich literary tradition and distinct script variants.
  • B. Punjabi
    Punjabi refers to the ethnolinguistic group native to the Punjab region of South Asia, known for its distinct language, culture, and traditions shared across parts of India and Pakistan.
  • C. Kashmiri
    Kashmiri refers to the ethnic group native to the Kashmir Valley in the northern part of the Indian subcontinent, known for its distinct language, culture, and traditions.
  • D. Saraiki
    Saraiki is an Indo-Aryan language spoken primarily in central and southern Pakistan, especially in the southern Punjab region.
  • E. Dogri
    Dogri is an Indo-Aryan language spoken primarily in the Jammu region of India and surrounding areas, recognized as one of the official languages of India.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

Stage Batch ID Job type Status
creating batch_69c68a51564081909e93aee0dbd9cca3 elicitation completed
NER batch_69c6add5d3848190a0d70dc4013ab756 ner completed
NED1 batch_69c6d53b861c81908adc984a3067d4ef ned_source_triple completed
Created at: March 27, 2026, 1:50 p.m.