Triple

T15339817
Position Surface form Disambiguated ID Type / Status
Subject Comecrudo language E366759 entity
Predicate alternativeName P39 FINISHED
Object Carrizo language
The Carrizo language, also known as Comecrudo, is an extinct Native American language formerly spoken by the Comecrudo people of southern Texas and northeastern Mexico.
E1152017 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Carrizo language | Statement: [Comecrudo language, alternativeName, Carrizo language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Carrizo language
Context triple: [Comecrudo language, alternativeName, Carrizo language]
  • A. Carijona language
    The Carijona language is an indigenous Cariban language spoken by the Carijona people of Colombia, now highly endangered with only a small number of elderly speakers remaining.
  • B. Cavineña language
    The Cavineña language is an indigenous Tacanan language spoken by the Cavineña people of northern Bolivia.
  • C. Guarijío language
    The Guarijío language is an indigenous Uto-Aztecan language spoken by the Guarijío people of northern Mexico, particularly in the states of Chihuahua and Sonora.
  • D. Cabiyari language
    The Cabiyari language is an indigenous Arawakan language spoken by the Cabiyari people of the northwestern Amazon region.
  • E. Diegueño language
    The Diegueño language is a Yuman language traditionally spoken by the Kumeyaay (Diegueño) people of southern California and northern Baja California.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Carrizo language
Triple: [Comecrudo language, alternativeName, Carrizo language]
Generated description
The Carrizo language, also known as Comecrudo, is an extinct Native American language formerly spoken by the Comecrudo people of southern Texas and northeastern Mexico.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Carrizo language
Target entity description: The Carrizo language, also known as Comecrudo, is an extinct Native American language formerly spoken by the Comecrudo people of southern Texas and northeastern Mexico.
  • A. Carijona language
    The Carijona language is an indigenous Cariban language spoken by the Carijona people of Colombia, now highly endangered with only a small number of elderly speakers remaining.
  • B. Cavineña language
    The Cavineña language is an indigenous Tacanan language spoken by the Cavineña people of northern Bolivia.
  • C. Guarijío language
    The Guarijío language is an indigenous Uto-Aztecan language spoken by the Guarijío people of northern Mexico, particularly in the states of Chihuahua and Sonora.
  • D. Cabiyari language
    The Cabiyari language is an indigenous Arawakan language spoken by the Cabiyari people of the northwestern Amazon region.
  • E. Diegueño language
    The Diegueño language is a Yuman language traditionally spoken by the Kumeyaay (Diegueño) people of southern California and northern Baja California.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d85a1355608190a6673ddb67231d54 completed April 10, 2026, 2:01 a.m.
NER Named-entity recognition batch_69e03e12eb7c8190944a260aa1aa9156 completed April 16, 2026, 1:40 a.m.
NED1 Entity disambiguation (via context triple) batch_69ff01f55e9881908612c4e384836fb5 completed May 9, 2026, 9:44 a.m.
NEDg Description generation batch_69ff030970588190a793cd710e819635 completed May 9, 2026, 9:48 a.m.
NED2 Entity disambiguation (via description) batch_69ff04633de08190867ad3baec6b2b02 completed May 9, 2026, 9:54 a.m.
Created at: April 10, 2026, 3:17 a.m.