Triple

T7455728
Position Surface form Disambiguated ID Type / Status
Subject Tupe District E172116 entity
Predicate languageEndangermentContext P39119 FINISHED
Object Jaqaru language preservation area
The Jaqaru language preservation area is a designated region in Peru’s Tupe District where efforts focus on maintaining and revitalizing the indigenous Jaqaru language and its associated cultural traditions.
E665741 NE FINISHED

How this triple was built (5 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Jaqaru language preservation area | Statement: [Tupe District, languageEndangermentContext, Jaqaru language preservation area]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Jaqaru language preservation area
Context triple: [Tupe District, languageEndangermentContext, Jaqaru language preservation area]
  • A. Talamanca Cabécar Indigenous Reserve
    The Talamanca Cabécar Indigenous Reserve is a protected territory in Costa Rica that serves as a primary homeland for the Cabécar people, preserving their traditional culture, language, and relationship with the surrounding rainforest.
  • B. Hara Protected Area
    Hara Protected Area is a coastal mangrove ecosystem and wildlife sanctuary on Qeshm Island in southern Iran, known for its rich biodiversity and important habitat for migratory birds.
  • C. Talamanca Indigenous Territory
    Talamanca Indigenous Territory is a protected ancestral region in southeastern Costa Rica that serves as a major homeland and cultural center for the Bribri people.
  • D. Salitre Indigenous Territory
    Salitre Indigenous Territory is an autonomous Bribri indigenous reserve in southern Costa Rica, known for its traditional culture, communal land tenure, and ongoing struggles for land rights and self-determination.
  • E. Mocho-Choshuenco National Reserve
    Mocho-Choshuenco National Reserve is a protected natural area in southern Chile centered around the Mocho-Choshuenco volcano, known for its forests, volcanic landscapes, and outdoor recreation opportunities.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Jaqaru language preservation area
Triple: [Tupe District, languageEndangermentContext, Jaqaru language preservation area]
Generated description
The Jaqaru language preservation area is a designated region in Peru’s Tupe District where efforts focus on maintaining and revitalizing the indigenous Jaqaru language and its associated cultural traditions.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Jaqaru language preservation area
Target entity description: The Jaqaru language preservation area is a designated region in Peru’s Tupe District where efforts focus on maintaining and revitalizing the indigenous Jaqaru language and its associated cultural traditions.
  • A. Talamanca Cabécar Indigenous Reserve
    The Talamanca Cabécar Indigenous Reserve is a protected territory in Costa Rica that serves as a primary homeland for the Cabécar people, preserving their traditional culture, language, and relationship with the surrounding rainforest.
  • B. Hara Protected Area
    Hara Protected Area is a coastal mangrove ecosystem and wildlife sanctuary on Qeshm Island in southern Iran, known for its rich biodiversity and important habitat for migratory birds.
  • C. Talamanca Indigenous Territory
    Talamanca Indigenous Territory is a protected ancestral region in southeastern Costa Rica that serves as a major homeland and cultural center for the Bribri people.
  • D. Salitre Indigenous Territory
    Salitre Indigenous Territory is an autonomous Bribri indigenous reserve in southern Costa Rica, known for its traditional culture, communal land tenure, and ongoing struggles for land rights and self-determination.
  • E. Mocho-Choshuenco National Reserve
    Mocho-Choshuenco National Reserve is a protected natural area in southern Chile centered around the Mocho-Choshuenco volcano, known for its forests, volcanic landscapes, and outdoor recreation opportunities.
  • F. None of above. chosen
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: languageEndangermentContext
Context triple: [Tupe District, languageEndangermentContext, Jaqaru language preservation area]
  • A. languageEndangermentFactors
    Indicates the various social, political, economic, and cultural conditions that contribute to a language becoming vulnerable, endangered, or extinct.
  • B. languageEndangermentStatus
    Indicates the degree to which a language is at risk of falling out of use or becoming extinct.
  • C. ethnicLanguageStatus
    Indicates the status or role of a language in relation to a particular ethnic group (e.g., primary, secondary, heritage, or minority language).
  • D. languageDiversity
    Indicates the degree to which multiple distinct languages are present and used within a given context or population.
  • E. includesEndangeredLanguages chosen
    Indicates that the subject contains, encompasses, or otherwise involves one or more languages classified as endangered.
  • F. None of above.

Provenance (6 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c68a66554c8190add75c65942c0317 completed March 27, 2026, 1:47 p.m.
NER Named-entity recognition batch_69c6f3af58dc819093fb0482482779a3 completed March 27, 2026, 9:16 p.m.
NED1 Entity disambiguation (via context triple) batch_69c827c39a848190bc275468362ce3bc completed March 28, 2026, 7:10 p.m.
NEDg Description generation batch_69c828e1487081908de825d60ea38c9e completed March 28, 2026, 7:15 p.m.
NED2 Entity disambiguation (via description) batch_69c829d8a15c8190911e0d7bda39c280 completed March 28, 2026, 7:19 p.m.
PD Predicate disambiguation batch_69c6f039f7248190bb4183f97b605763 completed March 27, 2026, 9:01 p.m.
Created at: March 27, 2026, 3:15 p.m.