Triple
T21145441
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Santa María de Ocotán dialect |
E521041
|
entity |
| Predicate | subdivisionOf |
P258
|
FINISHED |
| Object | Tepehuán language |
—
|
NE NERFINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Tepehuán language | Statement: [Santa María de Ocotán dialect, subdivisionOf, Tepehuán language]
Disambiguation candidates (2 decisions)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Tepehuán language Context triple: [Santa María de Ocotán dialect, subdivisionOf, Tepehuán language]
-
A.
Sipakapense language
The Sipakapense language is a Mayan language spoken by the Sipakapense people in the western highlands of Guatemala.
-
B.
Picurís language
The Picurís language is a Native American Tanoan language traditionally spoken by the Picurís Pueblo people of northern New Mexico and now considered highly endangered.
-
C.
Piapoco language
The Piapoco language is an indigenous Arawakan language spoken by the Piapoco people of Colombia and Venezuela.
-
D.
Huambisa language
The Huambisa language is an indigenous Jivaroan language spoken by the Huambisa (Wampis) people of the northern Peruvian Amazon.
-
E.
Kalapalo language
The Kalapalo language is an indigenous Cariban language spoken by the Kalapalo people of Brazil’s Upper Xingu region.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Tepehuán language Target entity description: The Tepehuán language is a Uto-Aztecan indigenous language of northern Mexico spoken by the Tepehuán people in several regional dialects.
-
A.
Sipakapense language
The Sipakapense language is a Mayan language spoken by the Sipakapense people in the western highlands of Guatemala.
-
B.
Picurís language
The Picurís language is a Native American Tanoan language traditionally spoken by the Picurís Pueblo people of northern New Mexico and now considered highly endangered.
-
C.
Piapoco language
The Piapoco language is an indigenous Arawakan language spoken by the Piapoco people of Colombia and Venezuela.
-
D.
Huambisa language
The Huambisa language is an indigenous Jivaroan language spoken by the Huambisa (Wampis) people of the northern Peruvian Amazon.
-
E.
Kalapalo language
The Kalapalo language is an indigenous Cariban language spoken by the Kalapalo people of Brazil’s Upper Xingu region.
- F. None of above. chosen
Provenance (2 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69e0b50c6a848190a4e525a77a319b8a |
elicitation | completed |
| NER | batch_69e723fcdb7c8190ae04d6ad9dff3187 |
ner | completed |
Created at: April 16, 2026, 2:58 p.m.