Triple

T21815544
Position Surface form Disambiguated ID Type / Status
Subject Toabaita E538596 entity
Predicate hasNeighbouringLanguage P16383 FINISHED
Object Baelelea language NE NERFINISHED

How this triple was built (3 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Baelelea language | Statement: [Toabaita, hasNeighbouringLanguage, Baelelea language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Baelelea language
Context triple: [Toabaita, hasNeighbouringLanguage, Baelelea language]
  • A. Thaua language
    The Thaua language is an Indigenous Australian Aboriginal language traditionally spoken by the Thaua (a group of the Yuin people) of the south coast of New South Wales.
  • B. Kaera language
    The Kaera language is a Papuan language spoken by a small community on Pantar Island in eastern Indonesia.
  • C. Damara language
    The Damara language is a Khoe (Central Khoisan) language spoken primarily by the Damara people of Namibia.
  • D. Itsari language
    The Itsari language is a Northeast Caucasian (Dargin) variety spoken in Dagestan, Russia, closely related to the Kubachi language and used by a small local community.
  • E. Saraveca language
    The Saraveca language is an extinct Arawakan language once spoken in Bolivia, known from very limited historical documentation.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Baelelea language
Target entity description: The Baelelea language is an Oceanic language spoken by the Baelelea people of Malaita in the Solomon Islands.
  • A. Thaua language
    The Thaua language is an Indigenous Australian Aboriginal language traditionally spoken by the Thaua (a group of the Yuin people) of the south coast of New South Wales.
  • B. Kaera language
    The Kaera language is a Papuan language spoken by a small community on Pantar Island in eastern Indonesia.
  • C. Damara language
    The Damara language is a Khoe (Central Khoisan) language spoken primarily by the Damara people of Namibia.
  • D. Itsari language
    The Itsari language is a Northeast Caucasian (Dargin) variety spoken in Dagestan, Russia, closely related to the Kubachi language and used by a small local community.
  • E. Saraveca language
    The Saraveca language is an extinct Arawakan language once spoken in Bolivia, known from very limited historical documentation.
  • F. None of above. chosen

Provenance (2 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69e0c473f0f8819086c9d1b4a143bd67 completed April 16, 2026, 11:13 a.m.
NER Named-entity recognition batch_69f07cc99bbc8190bf074930f361af7d completed April 28, 2026, 9:24 a.m.
Created at: April 16, 2026, 6:54 p.m.