Triple

T9051968
Position Surface form Disambiguated ID Type / Status
Subject Mon Khmer E216904 entity
Predicate hasSubbranch P1185 FINISHED
Object Bahnaric languages
The Bahnaric languages are a group of related Austroasiatic languages spoken primarily by indigenous communities in Vietnam, Laos, and Cambodia.
E775526 NE FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Bahnaric languages | Statement: [Mon Khmer, hasSubbranch, Bahnaric languages]

Disambiguation candidates (2 decisions)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Bahnaric languages
Context triple: [Mon Khmer, hasSubbranch, Bahnaric languages]
  • A. Hmongic languages
    Hmongic languages are a branch of the Hmong-Mien language family spoken primarily by Hmong and related ethnic groups in southern China and Southeast Asia.
  • B. Kove–Mangseng languages
    The Kove–Mangseng languages are a small subgroup of closely related Oceanic languages spoken in parts of Papua New Guinea.
  • C. Vietic languages
    Vietic languages are a branch of the Austroasiatic language family spoken primarily in Vietnam and neighboring areas, encompassing Vietnamese and several closely related minority languages.
  • D. Chamic languages
    The Chamic languages are a branch of the Austronesian language family spoken primarily in mainland Southeast Asia and parts of Indonesia, notable for heavy contact influence from neighboring Austroasiatic and Tai-Kadai languages.
  • E. Chimuan languages
    The Chimuan languages are an extinct group of pre-Columbian languages once spoken along the northern coast of Peru, most notably associated with the Chimú civilization.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Bahnaric languages
Target entity description: The Bahnaric languages are a group of related Austroasiatic languages spoken primarily by indigenous communities in Vietnam, Laos, and Cambodia.
  • A. Hmongic languages
    Hmongic languages are a branch of the Hmong-Mien language family spoken primarily by Hmong and related ethnic groups in southern China and Southeast Asia.
  • B. Kove–Mangseng languages
    The Kove–Mangseng languages are a small subgroup of closely related Oceanic languages spoken in parts of Papua New Guinea.
  • C. Vietic languages
    Vietic languages are a branch of the Austroasiatic language family spoken primarily in Vietnam and neighboring areas, encompassing Vietnamese and several closely related minority languages.
  • D. Chamic languages
    The Chamic languages are a branch of the Austronesian language family spoken primarily in mainland Southeast Asia and parts of Indonesia, notable for heavy contact influence from neighboring Austroasiatic and Tai-Kadai languages.
  • E. Chimuan languages
    The Chimuan languages are an extinct group of pre-Columbian languages once spoken along the northern coast of Peru, most notably associated with the Chimú civilization.
  • F. None of above. chosen

How the object was described

The object's one-sentence description was generated by prompting gpt-5.1 with the object name and this triple as context.

Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Bahnaric languages
Triple: [Mon Khmer, hasSubbranch, Bahnaric languages]
Generated description
The Bahnaric languages are a group of related Austroasiatic languages spoken primarily by indigenous communities in Vietnam, Laos, and Cambodia.

Provenance (5 batches)

Stage Batch ID Job type Status
creating batch_69ca83d362e88190ae44b4e4dc194209 elicitation completed
NER batch_69cc7a700de48190aa9f61d850e01cbd ner completed
NED1 batch_69cfebc90bf88190bbcdab07ca93f569 ned_source_triple completed
NED2 batch_69cff0ea8c388190bd95233db9c69038 ned_description completed
NEDg batch_69cfecf7fce08190a9b80044a2ae9745 nedg completed
Created at: March 30, 2026, 7:10 p.m.