Triple

T21145441
Position Surface form Disambiguated ID Type / Status
Subject Santa María de Ocotán dialect E521041 entity
Predicate subdivisionOf P258 FINISHED
Object Tepehuán language NE NERFINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Tepehuán language | Statement: [Santa María de Ocotán dialect, subdivisionOf, Tepehuán language]

Disambiguation candidates (2 decisions)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Tepehuán language
Context triple: [Santa María de Ocotán dialect, subdivisionOf, Tepehuán language]
  • A. Sipakapense language
    The Sipakapense language is a Mayan language spoken by the Sipakapense people in the western highlands of Guatemala.
  • B. Picurís language
    The Picurís language is a Native American Tanoan language traditionally spoken by the Picurís Pueblo people of northern New Mexico and now considered highly endangered.
  • C. Piapoco language
    The Piapoco language is an indigenous Arawakan language spoken by the Piapoco people of Colombia and Venezuela.
  • D. Huambisa language
    The Huambisa language is an indigenous Jivaroan language spoken by the Huambisa (Wampis) people of the northern Peruvian Amazon.
  • E. Kalapalo language
    The Kalapalo language is an indigenous Cariban language spoken by the Kalapalo people of Brazil’s Upper Xingu region.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Tepehuán language
Target entity description: The Tepehuán language is a Uto-Aztecan indigenous language of northern Mexico spoken by the Tepehuán people in several regional dialects.
  • A. Sipakapense language
    The Sipakapense language is a Mayan language spoken by the Sipakapense people in the western highlands of Guatemala.
  • B. Picurís language
    The Picurís language is a Native American Tanoan language traditionally spoken by the Picurís Pueblo people of northern New Mexico and now considered highly endangered.
  • C. Piapoco language
    The Piapoco language is an indigenous Arawakan language spoken by the Piapoco people of Colombia and Venezuela.
  • D. Huambisa language
    The Huambisa language is an indigenous Jivaroan language spoken by the Huambisa (Wampis) people of the northern Peruvian Amazon.
  • E. Kalapalo language
    The Kalapalo language is an indigenous Cariban language spoken by the Kalapalo people of Brazil’s Upper Xingu region.
  • F. None of above. chosen

Provenance (2 batches)

Stage Batch ID Job type Status
creating batch_69e0b50c6a848190a4e525a77a319b8a elicitation completed
NER batch_69e723fcdb7c8190ae04d6ad9dff3187 ner completed
Created at: April 16, 2026, 2:58 p.m.