Triple

T17592485
Position Surface form Disambiguated ID Type / Status
Subject Mulam E428480 entity
Predicate languageMacrofamily P1047 FINISHED
Object Kra–Dai languages NE NERFINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Kra–Dai languages | Statement: [Mulam, languageMacrofamily, Kra–Dai languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Kra–Dai languages
Context triple: [Mulam, languageMacrofamily, Kra–Dai languages]
  • A. Kam–Sui languages
    The Kam–Sui languages are a branch of the Tai–Kadai language family spoken primarily in southern China, including languages such as Kam (Dong) and Sui.
  • B. Sino-Tibetan languages
    The Sino-Tibetan languages are a major language family of East, Southeast, and South Asia that includes Chinese, Tibetan, Burmese, and numerous related languages spoken by over a billion people.
  • C. Kuki-Chin languages
    Kuki-Chin languages are a subgroup of the Sino-Tibetan language family spoken primarily in northeastern India, Myanmar, and Bangladesh by various Kuki, Chin, and related ethnic communities.
  • D. Tai–Kadai languages
    The Tai–Kadai languages are a major language family of Southeast Asia that includes Thai, Lao, and related languages spoken across mainland and parts of southern China.
  • E. Lakkia–Biao languages
    The Lakkia–Biao languages are a small branch of the Tai–Kadai language family spoken by minority communities in southern China, notable for preserving archaic features distinct from the more widespread Tai languages.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Kra–Dai languages
Target entity description: The Kra–Dai languages are a family of tonal languages of East and Southeast Asia that includes major groups such as Thai, Lao, and Zhuang.
  • A. Kam–Sui languages
    The Kam–Sui languages are a branch of the Tai–Kadai language family spoken primarily in southern China, including languages such as Kam (Dong) and Sui.
  • B. Sino-Tibetan languages
    The Sino-Tibetan languages are a major language family of East, Southeast, and South Asia that includes Chinese, Tibetan, Burmese, and numerous related languages spoken by over a billion people.
  • C. Kuki-Chin languages
    Kuki-Chin languages are a subgroup of the Sino-Tibetan language family spoken primarily in northeastern India, Myanmar, and Bangladesh by various Kuki, Chin, and related ethnic communities.
  • D. Tai–Kadai languages chosen
    The Tai–Kadai languages are a major language family of Southeast Asia that includes Thai, Lao, and related languages spoken across mainland and parts of southern China.
  • E. Lakkia–Biao languages
    The Lakkia–Biao languages are a small branch of the Tai–Kadai language family spoken by minority communities in southern China, notable for preserving archaic features distinct from the more widespread Tai languages.
  • F. None of above.
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: languageMacrofamily
Context triple: [Mulam, languageMacrofamily, Kra–Dai languages]
  • A. languageFamilyBranchOf
    Indicates that one language family branch is a sub-group or subdivision within a larger language family.
  • B. ancientLanguageFamily
    Indicates that one language belongs to or descends from a historically ancient family of related languages.
  • C. languageFamily chosen
    Indicates that two or more languages belong to the same genealogical language family or linguistic lineage.
  • D. studiesLanguageFamily
    Indicates that an entity engages in the academic or systematic study of a particular language family.
  • E. inLanguageFamily
    Indicates that two languages belong to the same linguistic family or classification.
  • F. None of above.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d889e1030481909950e140c63255b9 completed April 10, 2026, 5:25 a.m.
NER Named-entity recognition batch_69e469e79dac8190953a1ce8fc015b20 completed April 19, 2026, 5:36 a.m.
PD Predicate disambiguation batch_69e3b4fff0348190b899a32da537eaca completed April 18, 2026, 4:44 p.m.
Created at: April 10, 2026, 5:51 a.m.