Triple
T5122693
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Longgang District |
E115506
|
entity |
| Predicate | languageUsed |
P238
|
FINISHED |
| Object | Hakka |
E34449
|
NE FINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Hakka | Statement: [Longgang District, languageUsed, Hakka]
Disambiguation candidates (1 decision)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Hakka Context triple: [Longgang District, languageUsed, Hakka]
-
A.
Hakka
chosen
Hakka is a Sinitic language spoken primarily by the Hakka people across southern China and various overseas Chinese communities.
-
B.
Hokkien
Hokkien is a Southern Min Chinese language variety widely spoken in Taiwan, Southeast Asia, and parts of southern China, known for its rich tonal system and distinct vocabulary from Mandarin.
-
C.
Chōmin
Chōmin was the pen name of Nakae Chōmin, a prominent Japanese political theorist, journalist, and early advocate of liberal democracy during the Meiji era.
-
D.
Guanggu
Guanggu is a major high-tech development zone in Wuhan, China, known as an innovation hub for the optics and electronics industries.
-
E.
Hui
The Hui are a predominantly Muslim ethnic group in China known for their integration of Islamic faith with Han Chinese language and cultural practices.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69bd4442ade0819087b9461f892b206b |
elicitation | completed |
| NER | batch_69bd78045e448190961db0ca7692370e |
ner | completed |
| NED1 | batch_69bec4b401a481909abf6660401c47dc |
ned_source_triple | completed |
Created at: March 20, 2026, 1:42 p.m.