Triple

T21429298
Position Surface form Disambiguated ID Type / Status
Subject Bidayuh language E528641 entity
Predicate hasDialect P4251 FINISHED
Object Biatah Bidayuh NE NERFINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Biatah Bidayuh | Statement: [Bidayuh language, hasDialect, Biatah Bidayuh]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Biatah Bidayuh
Context triple: [Bidayuh language, hasDialect, Biatah Bidayuh]
  • A. Biatah Bidayuh chosen
    Biatah Bidayuh is an Austronesian language spoken by a subgroup of the Bidayuh people in Sarawak, Malaysia.
  • B. Bayan Lepas
    Bayan Lepas is a township in the southwestern part of Penang Island, Malaysia, known for its industrial zone, residential areas, and role as a key transportation and commercial hub.
  • C. Orang Seletar
    Orang Seletar are an indigenous Orang Asli sea people of southern Peninsular Malaysia and nearby Singapore, traditionally living as coastal and riverine fisher-foragers.
  • D. Nusajaya
    Nusajaya is a planned township in Johor, Malaysia, developed as a key administrative, commercial, and residential hub within the Iskandar Malaysia economic corridor.
  • E. Sitiawan
    Sitiawan is a coastal town in the Manjung District of Perak, Malaysia, known for its fishing industry and proximity to the port city of Lumut.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (2 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69e0c455f3688190810bc96365791b0f completed April 16, 2026, 11:13 a.m.
NER Named-entity recognition batch_69ee813ef6a8819089511b8f608c9491 completed April 26, 2026, 9:18 p.m.
Created at: April 16, 2026, 5:49 p.m.