Triple

T5122693
Position Surface form Disambiguated ID Type / Status
Subject Longgang District E115506 entity
Predicate languageUsed P238 FINISHED
Object Hakka E34449 NE FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Hakka | Statement: [Longgang District, languageUsed, Hakka]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Hakka
Context triple: [Longgang District, languageUsed, Hakka]
  • A. Hakka chosen
    Hakka is a Sinitic language spoken primarily by the Hakka people across southern China and various overseas Chinese communities.
  • B. Hokkien
    Hokkien is a Southern Min Chinese language variety widely spoken in Taiwan, Southeast Asia, and parts of southern China, known for its rich tonal system and distinct vocabulary from Mandarin.
  • C. Chōmin
    Chōmin was the pen name of Nakae Chōmin, a prominent Japanese political theorist, journalist, and early advocate of liberal democracy during the Meiji era.
  • D. Guanggu
    Guanggu is a major high-tech development zone in Wuhan, China, known as an innovation hub for the optics and electronics industries.
  • E. Hui
    The Hui are a predominantly Muslim ethnic group in China known for their integration of Islamic faith with Han Chinese language and cultural practices.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

Stage Batch ID Job type Status
creating batch_69bd4442ade0819087b9461f892b206b elicitation completed
NER batch_69bd78045e448190961db0ca7692370e ner completed
NED1 batch_69bec4b401a481909abf6660401c47dc ned_source_triple completed
Created at: March 20, 2026, 1:42 p.m.