Triple

T6775021
Position Surface form Disambiguated ID Type / Status
Subject Naman E155133 entity
Predicate isPartOf P10 FINISHED
Object Central-Eastern Malakula languages
The Central-Eastern Malakula languages are a subgroup of closely related Oceanic languages spoken on the island of Malakula in Vanuatu.
E617134 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Central-Eastern Malakula languages | Statement: [Naman, isPartOf, Central-Eastern Malakula languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Central-Eastern Malakula languages
Context triple: [Naman, isPartOf, Central-Eastern Malakula languages]
  • A. Lakkia–Biao languages
    The Lakkia–Biao languages are a small branch of the Tai–Kadai language family spoken by minority communities in southern China, notable for preserving archaic features distinct from the more widespread Tai languages.
  • B. Kuki-Chin languages
    Kuki-Chin languages are a subgroup of the Sino-Tibetan language family spoken primarily in northeastern India, Myanmar, and Bangladesh by various Kuki, Chin, and related ethnic communities.
  • C. Kam–Sui languages
    The Kam–Sui languages are a branch of the Tai–Kadai language family spoken primarily in southern China, including languages such as Kam (Dong) and Sui.
  • D. Chamic languages
    The Chamic languages are a branch of the Austronesian language family spoken primarily in mainland Southeast Asia and parts of Indonesia, notable for heavy contact influence from neighboring Austroasiatic and Tai-Kadai languages.
  • E. Northern Luzon languages
    The Northern Luzon languages are a subgroup of Philippine Austronesian languages spoken primarily in the northern part of Luzon in the Philippines, encompassing several related indigenous languages of the region.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Central-Eastern Malakula languages
Triple: [Naman, isPartOf, Central-Eastern Malakula languages]
Generated description
The Central-Eastern Malakula languages are a subgroup of closely related Oceanic languages spoken on the island of Malakula in Vanuatu.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Central-Eastern Malakula languages
Target entity description: The Central-Eastern Malakula languages are a subgroup of closely related Oceanic languages spoken on the island of Malakula in Vanuatu.
  • A. Lakkia–Biao languages
    The Lakkia–Biao languages are a small branch of the Tai–Kadai language family spoken by minority communities in southern China, notable for preserving archaic features distinct from the more widespread Tai languages.
  • B. Kuki-Chin languages
    Kuki-Chin languages are a subgroup of the Sino-Tibetan language family spoken primarily in northeastern India, Myanmar, and Bangladesh by various Kuki, Chin, and related ethnic communities.
  • C. Kam–Sui languages
    The Kam–Sui languages are a branch of the Tai–Kadai language family spoken primarily in southern China, including languages such as Kam (Dong) and Sui.
  • D. Chamic languages
    The Chamic languages are a branch of the Austronesian language family spoken primarily in mainland Southeast Asia and parts of Indonesia, notable for heavy contact influence from neighboring Austroasiatic and Tai-Kadai languages.
  • E. Northern Luzon languages
    The Northern Luzon languages are a subgroup of Philippine Austronesian languages spoken primarily in the northern part of Luzon in the Philippines, encompassing several related indigenous languages of the region.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c68812ef7c819099369f51febb725c completed March 27, 2026, 1:37 p.m.
NER Named-entity recognition batch_69c6d24ddaf08190baffbff991eeb458 completed March 27, 2026, 6:54 p.m.
NED1 Entity disambiguation (via context triple) batch_69c712ca48d88190b9f47b23264d4264 completed March 27, 2026, 11:29 p.m.
NEDg Description generation batch_69c713d2fad881909ac1b96ba4353bfe completed March 27, 2026, 11:33 p.m.
NED2 Entity disambiguation (via description) batch_69c71478072481909e396a2ac39f0f3a completed March 27, 2026, 11:36 p.m.
Created at: March 27, 2026, 2:13 p.m.