Triple
T16931965
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Ulungur Lake |
E410730
|
entity |
| Predicate | hasLocalNameLanguage |
P15
|
FINISHED |
| Object | Uyghur |
E80070
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Uyghur | Statement: [Ulungur Lake, hasLocalNameLanguage, Uyghur]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Uyghur Context triple: [Ulungur Lake, hasLocalNameLanguage, Uyghur]
-
A.
Uyghur language
The Uyghur language is a Turkic language spoken primarily by the Uyghur people in China’s Xinjiang region, written in several scripts and serving as a major language of Central Asia.
-
B.
Uyghurs
chosen
The Uyghurs are a Turkic-speaking, predominantly Muslim ethnic group native to the Xinjiang region of northwest China, with a distinct culture, language, and history.
-
C.
Uyghur Arabic alphabet
The Uyghur Arabic alphabet is a Perso-Arabic–based script adapted to represent the sounds of the Uyghur language, historically used by Uyghur communities in Central Asia.
-
D.
Uyghur Cyrillic alphabet
The Uyghur Cyrillic alphabet is a Cyrillic-based script historically used by Uyghur communities, particularly in the former Soviet Union, to write the Uyghur language.
-
E.
Uyghur Latin alphabet
The Uyghur Latin alphabet is a romanized writing system developed for the Uyghur language, used primarily in digital communication and linguistic transcription.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d886c886688190967be07322597ac9 |
completed | April 10, 2026, 5:12 a.m. |
| NER | Named-entity recognition | batch_69e3cf25a6dc8190a2b9d9c4d2adc5fd |
completed | April 18, 2026, 6:36 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_6a00dc024d7c8190b8055833ed8908d5 |
completed | May 10, 2026, 7:26 p.m. |
Created at: April 10, 2026, 5:30 a.m.