Triple

T15350477
Position Surface form Disambiguated ID Type / Status
Subject Skarù∙ręʼ E367036 entity
Predicate speaksLanguage P741 FINISHED
Object Tuscarora language E113051 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Tuscarora language | Statement: [Skarù∙ręʼ, speaksLanguage, Tuscarora language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Tuscarora language
Context triple: [Skarù∙ręʼ, speaksLanguage, Tuscarora language]
  • A. Tuscarora language chosen
    The Tuscarora language is an Iroquoian language historically spoken by the Tuscarora people of the Eastern Woodlands, now critically endangered with only a few fluent speakers and ongoing revitalization efforts.
  • B. Catawba language
    Catawba language is an endangered Siouan language historically spoken by the Catawba people of the southeastern United States, primarily in South Carolina.
  • C. Cheroenhaka language
    The Cheroenhaka language, also known as Nottoway, is an Iroquoian language historically spoken by the Nottoway (Cheroenhaka) people of southeastern Virginia.
  • D. Munsee language
    The Munsee language is an Eastern Algonquian Indigenous language traditionally spoken by the Munsee Lenape people of the northeastern United States and adjacent Canada, now critically endangered with only a few fluent speakers.
  • E. Wampanoag language
    The Wampanoag language is an Algonquian Native American language of the northeastern United States that has been the focus of significant revitalization efforts after having no native speakers for many generations.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d85a1355608190a6673ddb67231d54 completed April 10, 2026, 2:01 a.m.
NER Named-entity recognition batch_69e03e290efc8190b22c95dcd3e5f57f completed April 16, 2026, 1:40 a.m.
NED1 Entity disambiguation (via context triple) batch_69ff01fd53688190939787a3d6ff3bb9 completed May 9, 2026, 9:44 a.m.
Created at: April 10, 2026, 3:17 a.m.