Triple

T10705206
Position Surface form Disambiguated ID Type / Status
Subject Mixe E252385 entity
Predicate language P15 FINISHED
Object Mixe languages E193683 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Mixe languages | Statement: [Mixe, language, Mixe languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Mixe languages
Context triple: [Mixe, language, Mixe languages]
  • A. Mixe languages chosen
    Mixe languages are a group of related indigenous Mesoamerican languages spoken primarily by the Mixe people in the highlands of Oaxaca, Mexico.
  • B. Engan languages
    The Engan languages are a group of closely related Papuan languages spoken primarily in the highlands of Papua New Guinea, known for their large speaker populations and significance within the Trans–New Guinea language family.
  • C. Malekula languages
    The Malekula languages are a diverse group of closely related Oceanic languages spoken on Malekula Island in Vanuatu.
  • D. Mishing language
    Mishing language is a Tani (Tibeto-Burman) language spoken by the Mishing people, primarily in the northeastern Indian state of Assam.
  • E. Maipurean languages
    The Maipurean languages are a major branch of the Arawakan language family, historically spoken across large areas of northern South America and the Caribbean.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d6aa5cbabc8190973e683950d89faf completed April 8, 2026, 7:19 p.m.
NER Named-entity recognition batch_69d6fddeb060819094cd125a68070eb2 completed April 9, 2026, 1:16 a.m.
NED1 Entity disambiguation (via context triple) batch_69d998fe56dc8190ae0c987b28ec6206 completed April 11, 2026, 12:42 a.m.
Created at: April 8, 2026, 9:12 p.m.