Triple

T22472362
Position Surface form Disambiguated ID Type / Status
Subject Baicheng E555534 entity
Predicate language P15 FINISHED
Object Mandarin Chinese NE NERFINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Mandarin Chinese | Statement: [Baicheng, language, Mandarin Chinese]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Mandarin Chinese
Context triple: [Baicheng, language, Mandarin Chinese]
  • A. Mandarin Chinese chosen
    Mandarin Chinese is the most widely spoken variety of Chinese and a major world language used across mainland China, Taiwan, and many overseas Chinese communities.
  • B. Hanyu
    Hanyu is a Chinese given name shared by various individuals, including notable figures in fields such as acting, sports, and academia.
  • C. Min Chinese
    Min Chinese is a major and diverse branch of the Sinitic language family, comprising several mutually unintelligible varieties spoken primarily in China’s southeastern coastal regions and among overseas Chinese communities.
  • D. Simplified Chinese
    Simplified Chinese is a standardized form of written Chinese that uses characters with reduced strokes, primarily employed in mainland China and Singapore.
  • E. Standard Chinese
    Standard Chinese is the official standardized form of the Chinese language, based primarily on the Beijing dialect of Mandarin and used as the national lingua franca of China.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (2 batches)

Stage Batch ID Job type Status
creating batch_69e11e52c2048190952dc5df209b9bed elicitation completed
NER batch_69f15be0d3c08190851537660cda619c ner completed
Created at: April 16, 2026, 8:49 p.m.