Triple
T5459718
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Shanshan |
E122566
|
entity |
| Predicate | usedScript |
P6524
|
FINISHED |
| Object | Kharosthi script |
E125350
|
NE FINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Kharosthi script | Statement: [Shanshan, usedScript, Kharosthi script]
Disambiguation candidates (1 decision)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Kharosthi script Context triple: [Shanshan, usedScript, Kharosthi script]
-
A.
Kharoṣṭhī script
chosen
The Kharoṣṭhī script is an ancient right-to-left writing system used in northwestern South Asia, especially in the Gandhāra region, primarily for early Buddhist and administrative texts.
-
B.
Shahmukhi script
Shahmukhi script is a Perso-Arabic–based writing system primarily used for writing the Punjabi language in Pakistan.
-
C.
Brahmi script
The Brahmi script is one of the oldest writing systems of the Indian subcontinent, serving as the ancestor of most modern South and Southeast Asian scripts.
-
D.
Tirhuta script
Tirhuta script is a traditional Brahmic writing system historically used for the Maithili language of the Mithila region in India and Nepal.
-
E.
Khojki script
The Khojki script is a historical writing system used primarily by the Nizari Ismaili community of South Asia to record religious and literary texts in languages such as Sindhi and Gujarati.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69bd46424248819085282ddf50a565f3 |
elicitation | completed |
| NER | batch_69bd91f353c481909ae1a73ae419fb9a |
ner | completed |
| NED1 | batch_69bf4149c6c081909f57e214dad54777 |
ned_source_triple | completed |
Created at: March 20, 2026, 2:08 p.m.