Triple

T5459718
Position Surface form Disambiguated ID Type / Status
Subject Shanshan E122566 entity
Predicate usedScript P6524 FINISHED
Object Kharosthi script E125350 NE FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Kharosthi script | Statement: [Shanshan, usedScript, Kharosthi script]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Kharosthi script
Context triple: [Shanshan, usedScript, Kharosthi script]
  • A. Kharoṣṭhī script chosen
    The Kharoṣṭhī script is an ancient right-to-left writing system used in northwestern South Asia, especially in the Gandhāra region, primarily for early Buddhist and administrative texts.
  • B. Shahmukhi script
    Shahmukhi script is a Perso-Arabic–based writing system primarily used for writing the Punjabi language in Pakistan.
  • C. Brahmi script
    The Brahmi script is one of the oldest writing systems of the Indian subcontinent, serving as the ancestor of most modern South and Southeast Asian scripts.
  • D. Tirhuta script
    Tirhuta script is a traditional Brahmic writing system historically used for the Maithili language of the Mithila region in India and Nepal.
  • E. Khojki script
    The Khojki script is a historical writing system used primarily by the Nizari Ismaili community of South Asia to record religious and literary texts in languages such as Sindhi and Gujarati.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

Stage Batch ID Job type Status
creating batch_69bd46424248819085282ddf50a565f3 elicitation completed
NER batch_69bd91f353c481909ae1a73ae419fb9a ner completed
NED1 batch_69bf4149c6c081909f57e214dad54777 ned_source_triple completed
Created at: March 20, 2026, 2:08 p.m.