Triple

T16704775
Position Surface form Disambiguated ID Type / Status
Subject Chan (surname 詹) E405939 entity
Predicate canAlsoRomanizePronunciationOf P2508 FINISHED
Object Cantonese reading of 占 LITERAL FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's LITERAL type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Cantonese reading of 占 | Statement: [Chan (surname 詹), canAlsoRomanizePronunciationOf, Cantonese reading of 占]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: canAlsoRomanizePronunciationOf
Context triple: [Chan (surname 詹), canAlsoRomanizePronunciationOf, Cantonese reading of 占]
  • A. hasRomanizationOf chosen
    Indicates that one entity is a romanized representation (written in the Latin alphabet) of the other entity’s original script form.
  • B. hasMacronRomanization
    Indicates that an entity is associated with a Romanized form of text that uses macrons to mark long vowels.
  • C. hasRomanizationStandard
    Indicates that an entity’s romanized form follows a specified romanization standard or system.
  • D. hasHakkaRomanization
    Indicates that an entity is associated with a specific representation of its name or term in Hakka Romanization.
  • E. romanizesVowel
    Indicates the action of converting a vowel from a non-Roman writing system into its corresponding representation in the Roman (Latin) alphabet.
  • F. None of above.

Provenance (3 batches)

Stage Batch ID Job type Status
creating batch_69d8838db21081909589220fd71440a4 elicitation completed
NER batch_69e3833496dc8190ae4b4a03ba04d69d ner completed
PD batch_69e319c379f88190ac0adf812486f598 pd completed
Created at: April 10, 2026, 5:19 a.m.