Triple
T7455728
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Tupe District |
E172116
|
entity |
| Predicate | languageEndangermentContext |
P39119
|
FINISHED |
| Object |
Jaqaru language preservation area
The Jaqaru language preservation area is a designated region in Peru’s Tupe District where efforts focus on maintaining and revitalizing the indigenous Jaqaru language and its associated cultural traditions.
|
E665741
|
NE FINISHED |
How this triple was built (5 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Jaqaru language preservation area | Statement: [Tupe District, languageEndangermentContext, Jaqaru language preservation area]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Jaqaru language preservation area Context triple: [Tupe District, languageEndangermentContext, Jaqaru language preservation area]
-
A.
Talamanca Cabécar Indigenous Reserve
The Talamanca Cabécar Indigenous Reserve is a protected territory in Costa Rica that serves as a primary homeland for the Cabécar people, preserving their traditional culture, language, and relationship with the surrounding rainforest.
-
B.
Hara Protected Area
Hara Protected Area is a coastal mangrove ecosystem and wildlife sanctuary on Qeshm Island in southern Iran, known for its rich biodiversity and important habitat for migratory birds.
-
C.
Talamanca Indigenous Territory
Talamanca Indigenous Territory is a protected ancestral region in southeastern Costa Rica that serves as a major homeland and cultural center for the Bribri people.
-
D.
Salitre Indigenous Territory
Salitre Indigenous Territory is an autonomous Bribri indigenous reserve in southern Costa Rica, known for its traditional culture, communal land tenure, and ongoing struggles for land rights and self-determination.
-
E.
Mocho-Choshuenco National Reserve
Mocho-Choshuenco National Reserve is a protected natural area in southern Chile centered around the Mocho-Choshuenco volcano, known for its forests, volcanic landscapes, and outdoor recreation opportunities.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg
Description generation
gpt-5.1
Instruction
Generate a one-sentence description of the target entity. You are given a context triple in the form (subject, predicate, object), where the object is the target entity. # Instructions Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. Avoid repeating the information from the triple, unless really essential. # Response Format Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Jaqaru language preservation area Triple: [Tupe District, languageEndangermentContext, Jaqaru language preservation area]
Generated description
The Jaqaru language preservation area is a designated region in Peru’s Tupe District where efforts focus on maintaining and revitalizing the indigenous Jaqaru language and its associated cultural traditions.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Jaqaru language preservation area Target entity description: The Jaqaru language preservation area is a designated region in Peru’s Tupe District where efforts focus on maintaining and revitalizing the indigenous Jaqaru language and its associated cultural traditions.
-
A.
Talamanca Cabécar Indigenous Reserve
The Talamanca Cabécar Indigenous Reserve is a protected territory in Costa Rica that serves as a primary homeland for the Cabécar people, preserving their traditional culture, language, and relationship with the surrounding rainforest.
-
B.
Hara Protected Area
Hara Protected Area is a coastal mangrove ecosystem and wildlife sanctuary on Qeshm Island in southern Iran, known for its rich biodiversity and important habitat for migratory birds.
-
C.
Talamanca Indigenous Territory
Talamanca Indigenous Territory is a protected ancestral region in southeastern Costa Rica that serves as a major homeland and cultural center for the Bribri people.
-
D.
Salitre Indigenous Territory
Salitre Indigenous Territory is an autonomous Bribri indigenous reserve in southern Costa Rica, known for its traditional culture, communal land tenure, and ongoing struggles for land rights and self-determination.
-
E.
Mocho-Choshuenco National Reserve
Mocho-Choshuenco National Reserve is a protected natural area in southern Chile centered around the Mocho-Choshuenco volcano, known for its forests, volcanic landscapes, and outdoor recreation opportunities.
- F. None of above. chosen
PD
Predicate disambiguation
gpt-5-mini-2025-08-07
Target predicate: languageEndangermentContext Context triple: [Tupe District, languageEndangermentContext, Jaqaru language preservation area]
-
A.
languageEndangermentFactors
Indicates the various social, political, economic, and cultural conditions that contribute to a language becoming vulnerable, endangered, or extinct.
-
B.
languageEndangermentStatus
Indicates the degree to which a language is at risk of falling out of use or becoming extinct.
-
C.
ethnicLanguageStatus
Indicates the status or role of a language in relation to a particular ethnic group (e.g., primary, secondary, heritage, or minority language).
-
D.
languageDiversity
Indicates the degree to which multiple distinct languages are present and used within a given context or population.
-
E.
includesEndangeredLanguages
chosen
Indicates that the subject contains, encompasses, or otherwise involves one or more languages classified as endangered.
- F. None of above.
Provenance (6 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69c68a66554c8190add75c65942c0317 |
completed | March 27, 2026, 1:47 p.m. |
| NER | Named-entity recognition | batch_69c6f3af58dc819093fb0482482779a3 |
completed | March 27, 2026, 9:16 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69c827c39a848190bc275468362ce3bc |
completed | March 28, 2026, 7:10 p.m. |
| NEDg | Description generation | batch_69c828e1487081908de825d60ea38c9e |
completed | March 28, 2026, 7:15 p.m. |
| NED2 | Entity disambiguation (via description) | batch_69c829d8a15c8190911e0d7bda39c280 |
completed | March 28, 2026, 7:19 p.m. |
| PD | Predicate disambiguation | batch_69c6f039f7248190bb4183f97b605763 |
completed | March 27, 2026, 9:01 p.m. |
Created at: March 27, 2026, 3:15 p.m.