Triple

T17943726
Position Surface form Disambiguated ID Type / Status
Subject Qixia E448647 entity
Predicate language P15 FINISHED
Object Mandarin Chinese NE NERFINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Mandarin Chinese | Statement: [Qixia, language, Mandarin Chinese]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Mandarin Chinese
Context triple: [Qixia, language, Mandarin Chinese]
  • A. Mandarin Chinese chosen
    Mandarin Chinese is the most widely spoken variety of Chinese and a major world language used across mainland China, Taiwan, and many overseas Chinese communities.
  • B. Hanyu
    Hanyu is a Chinese given name shared by various individuals, including notable figures in fields such as acting, sports, and academia.
  • C. Min Chinese
    Min Chinese is a major and diverse branch of the Sinitic language family, comprising several mutually unintelligible varieties spoken primarily in China’s southeastern coastal regions and among overseas Chinese communities.
  • D. Simplified Chinese
    Simplified Chinese is a standardized form of written Chinese that uses characters with reduced strokes, primarily employed in mainland China and Singapore.
  • E. Standard Chinese
    Standard Chinese is the official standardized form of the Chinese language, based primarily on the Beijing dialect of Mandarin and used as the national lingua franca of China.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (2 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d8b9f8cca8819099836916c56b7c95 completed April 10, 2026, 8:51 a.m.
NER Named-entity recognition batch_69e4ad97611c8190a861467ae51f6c48 completed April 19, 2026, 10:25 a.m.
Created at: April 10, 2026, 10:21 a.m.