Triple
T7790041
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Ganxian Hakka |
E187352
|
entity |
| Predicate | hasAncestor |
P369
|
FINISHED |
| Object | Middle Chinese |
E125417
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Middle Chinese | Statement: [Ganxian Hakka, hasAncestor, Middle Chinese]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Middle Chinese Context triple: [Ganxian Hakka, hasAncestor, Middle Chinese]
-
A.
Middle Chinese
chosen
Middle Chinese is the historical stage of the Chinese language spoken during the Sui, Tang, and early Song dynasties, serving as the ancestor of many modern Chinese varieties and a key reference for historical phonology.
-
B.
Old Chinese
Old Chinese is the earliest attested stage of the Chinese language, spoken during the Shang and Zhou dynasties and reconstructed from ancient texts and inscriptions.
-
C.
Early Mandarin
Early Mandarin is a historical stage of the Chinese language that developed after Middle Chinese and laid the foundation for modern Mandarin varieties.
-
D.
Proto-Sinitic
Proto-Sinitic is the reconstructed common ancestor of the Sinitic (Chinese) languages, from which all modern varieties of Chinese are believed to have descended.
-
E.
Mandarin Chinese
Mandarin Chinese is the most widely spoken variety of Chinese and a major world language used across mainland China, Taiwan, and many overseas Chinese communities.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69ca82af2d2c8190963861f5e0b8bf21 |
completed | March 30, 2026, 2:03 p.m. |
| NER | Named-entity recognition | batch_69cae7ea13f08190a60c5f1863bce816 |
completed | March 30, 2026, 9:15 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69caf62c8568819090c058b0c55b7865 |
completed | March 30, 2026, 10:16 p.m. |
Created at: March 30, 2026, 4:25 p.m.