Triple

T7355267
Position Surface form Disambiguated ID Type / Status
Subject Ambon E169607 entity
Predicate hasDemonym P191 FINISHED
Object Ambonese E160354 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Ambonese | Statement: [Ambon, hasDemonym, Ambonese]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Ambonese
Context triple: [Ambon, hasDemonym, Ambonese]
  • A. Melayu Ambon
    Melayu Ambon is a regional variety of Malay spoken primarily in and around Ambon Island in eastern Indonesia, known for its distinct phonology and vocabulary influenced by local Austronesian languages and historical contact with European languages.
  • B. Ambon Malay chosen
    Ambon Malay is a regional Malay-based creole spoken primarily in and around Ambon Island in eastern Indonesia, serving as a lingua franca in the Maluku region.
  • C. East Sumbanese
    East Sumbanese is an Austronesian language spoken on the eastern part of Sumba Island in Indonesia.
  • D. Tanimbar languages
    The Tanimbar languages are a subgroup of Austronesian languages spoken primarily in the Tanimbar Islands of eastern Indonesia.
  • E. Flores–Lembata languages
    The Flores–Lembata languages are a subgroup of Austronesian languages spoken on the islands of Flores and Lembata in eastern Indonesia, known for their distinctive phonological and grammatical features within the region.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c68a59f2288190877ca15c19b1e822 completed March 27, 2026, 1:47 p.m.
NER Named-entity recognition batch_69c6f139505c8190a7158cf59a6e089e completed March 27, 2026, 9:06 p.m.
NED1 Entity disambiguation (via context triple) batch_69c802b69cb4819096815b1fac284840 completed March 28, 2026, 4:32 p.m.
Created at: March 27, 2026, 3:05 p.m.