Triple

T10299396
Position Surface form Disambiguated ID Type / Status
Subject Bac Lieu E241585 entity
Predicate hasEthnicGroup P1898 FINISHED
Object Khmer E6451 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Khmer | Statement: [Bac Lieu, hasEthnicGroup, Khmer]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Khmer
Context triple: [Bac Lieu, hasEthnicGroup, Khmer]
  • A. Khmer chosen
    Khmer is the Austroasiatic language spoken primarily in Cambodia, where it serves as the official and most widely used national language.
  • B. Mon-Khmer
    Mon-Khmer is a major branch of the Austroasiatic language family that includes numerous languages spoken across mainland Southeast Asia and parts of South Asia.
  • C. Middle Khmer
    Middle Khmer is the historical stage of the Khmer language spoken in Cambodia roughly between the Angkorian period and the emergence of modern Khmer, serving as a key transitional form in the Austroasiatic language family.
  • D. Khmer Reamker
    Khmer Reamker is the Cambodian epic poem and national literary classic that adapts the Indian Ramayana into Khmer cultural, religious, and artistic traditions.
  • E. Khmeric languages
    Khmeric languages are a branch of the Austroasiatic language family that includes Khmer and its close relatives spoken primarily in Cambodia and neighboring regions.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d381aaafc08190af475ef58dc16aba completed April 6, 2026, 9:49 a.m.
NER Named-entity recognition batch_69d4d2ee10f88190b1615c49b8f24a26 completed April 7, 2026, 9:48 a.m.
NED1 Entity disambiguation (via context triple) batch_69d71d35d5908190bb87100c81f2948a completed April 9, 2026, 3:29 a.m.
Created at: April 6, 2026, 11:44 a.m.