Triple

T7408144
Position Surface form Disambiguated ID Type / Status
Subject Sawai language E170931 entity
Predicate neighboringLanguage P16383 FINISHED
Object Taba language E661507 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Taba language | Statement: [Sawai language, neighboringLanguage, Taba language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Taba language
Context triple: [Sawai language, neighboringLanguage, Taba language]
  • A. Taba language chosen
    Taba is an Austronesian language spoken in eastern Indonesia, particularly on the island of Makian in North Maluku.
  • B. Tubatulabal language
    The Tubatulabal language is an endangered Uto-Aztecan language traditionally spoken by the Tubatulabal people of the southern Sierra Nevada region in California.
  • C. Tawala language
    Tawala language is an Austronesian language of the Papuan Tip region of Papua New Guinea, spoken primarily in coastal communities of Milne Bay Province.
  • D. Kaxabu language
    The Kaxabu language is an indigenous Formosan language of Taiwan spoken by the Kaxabu people and considered highly endangered.
  • E. Tawbuid language
    The Tawbuid language is an Austronesian language spoken by the Tawbuid (Batangan) Mangyan people of Mindoro in the Philippines, closely related to other South Mangyan languages.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c68a6010108190925e5284de022660 completed March 27, 2026, 1:47 p.m.
NER Named-entity recognition batch_69c6f29acf588190a7c4056bdc4f3ffc completed March 27, 2026, 9:11 p.m.
NED1 Entity disambiguation (via context triple) batch_69c82775d1188190bcf158da5a02b6e0 completed March 28, 2026, 7:09 p.m.
Created at: March 27, 2026, 3:10 p.m.