Triple

T7007189
Position Surface form Disambiguated ID Type / Status
Subject Simbo language E162485 entity
Predicate subfamily P4180 FINISHED
Object Northwest Solomonic languages E27244 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Northwest Solomonic languages | Statement: [Simbo language, subfamily, Northwest Solomonic languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Northwest Solomonic languages
Context triple: [Simbo language, subfamily, Northwest Solomonic languages]
  • A. Northwest Solomonic languages chosen
    The Northwest Solomonic languages are a subgroup of Oceanic languages spoken primarily in the northwestern Solomon Islands and nearby regions, known for their distinctive phonological and grammatical innovations within the Austronesian family.
  • B. Western Bismarck languages
    Western Bismarck languages are a subgroup of Oceanic Austronesian languages spoken primarily in the Bismarck Archipelago of Papua New Guinea.
  • C. Takic languages
    Takic languages are a branch of the Uto-Aztecan language family spoken historically in Southern California and neighboring regions by various Indigenous peoples.
  • D. Surmic languages
    The Surmic languages are a branch of the Nilo-Saharan language family spoken primarily in southwestern Ethiopia and neighboring regions, known for their complex phonologies and rich systems of noun classification.
  • E. Muskogean languages
    The Muskogean languages are a family of indigenous languages of the Southeastern United States, traditionally spoken by Native American peoples such as the Creek, Choctaw, Chickasaw, and Seminole.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c6885928148190ae31909fbb5e9849 completed March 27, 2026, 1:38 p.m.
NER Named-entity recognition batch_69c6dc35cb848190a839919021efce81 completed March 27, 2026, 7:36 p.m.
NED1 Entity disambiguation (via context triple) batch_69c76a43c3a081909b9150d36ba107f5 completed March 28, 2026, 5:42 a.m.
Created at: March 27, 2026, 2:33 p.m.