Triple

T21429284
Position Surface form Disambiguated ID Type / Status
Subject Bidayuh language E528641 entity
Predicate spokenIn P2266 FINISHED
Object Sarawak NE NERFINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Sarawak | Statement: [Bidayuh language, spokenIn, Sarawak]

Disambiguation candidates (1 decision)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Sarawak
Context triple: [Bidayuh language, spokenIn, Sarawak]
  • A. Sarawak chosen
    Sarawak is a resource-rich Malaysian state on the island of Borneo, known for its diverse indigenous cultures, extensive rainforests, and long history under the rule of the White Rajahs before joining Malaysia.
  • B. Sabah
    Sabah is a major Turkish daily newspaper known for its wide circulation and coverage of national news, politics, and entertainment.
  • C. Sabah
    Sabah is a Malaysian state on the northern portion of Borneo, known for its rich biodiversity, indigenous cultures, and iconic Mount Kinabalu.
  • D. Brunei-Kedayan
    Brunei-Kedayan is a Malayic language variety spoken primarily by the Kedayan ethnic group in Brunei and surrounding regions of Borneo.
  • E. Perak
    Perak is a Malaysian state on the west coast of the Malay Peninsula, historically known for its rich tin deposits and former status as a key sultanate within British Malaya.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (2 batches)

Stage Batch ID Job type Status
creating batch_69e0c455f3688190810bc96365791b0f elicitation completed
NER batch_69ee813db52c8190ac933bc6ec4dbf77 ner completed
Created at: April 16, 2026, 5:49 p.m.