Triple

T9212757
Position Surface form Disambiguated ID Type / Status
Subject Aru Islands E221165 entity
Predicate hasLanguage P15 FINISHED
Object Batuley language
The Batuley language is an Austronesian language spoken by a small community in Indonesia’s Aru Islands.
E787286 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Batuley language | Statement: [Aru Islands, hasLanguage, Batuley language]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Batuley language
Context triple: [Aru Islands, hasLanguage, Batuley language]
  • A. Baliledu language
    The Baliledu language is an Austronesian language of the Bima–Sumba subgroup spoken by a local community in eastern Indonesia.
  • B. Baule language
    The Baule language is a Central Tano language of the Akan group spoken primarily by the Baoulé people in Côte d'Ivoire.
  • C. Hoanya language
    The Hoanya language is an extinct Austronesian language once spoken by the Hoanya people of western Taiwan and classified among the indigenous Formosan languages.
  • D. Patamona language
    The Patamona language is an indigenous Cariban language spoken by the Patamona people of the Guiana Highlands in Guyana and northern Brazil.
  • E. Tawbuid language
    The Tawbuid language is an Austronesian language spoken by the Tawbuid (Batangan) Mangyan people of Mindoro in the Philippines, closely related to other South Mangyan languages.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Batuley language
Triple: [Aru Islands, hasLanguage, Batuley language]
Generated description
The Batuley language is an Austronesian language spoken by a small community in Indonesia’s Aru Islands.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Batuley language
Target entity description: The Batuley language is an Austronesian language spoken by a small community in Indonesia’s Aru Islands.
  • A. Baliledu language
    The Baliledu language is an Austronesian language of the Bima–Sumba subgroup spoken by a local community in eastern Indonesia.
  • B. Baule language
    The Baule language is a Central Tano language of the Akan group spoken primarily by the Baoulé people in Côte d'Ivoire.
  • C. Hoanya language
    The Hoanya language is an extinct Austronesian language once spoken by the Hoanya people of western Taiwan and classified among the indigenous Formosan languages.
  • D. Patamona language
    The Patamona language is an indigenous Cariban language spoken by the Patamona people of the Guiana Highlands in Guyana and northern Brazil.
  • E. Tawbuid language
    The Tawbuid language is an Austronesian language spoken by the Tawbuid (Batangan) Mangyan people of Mindoro in the Philippines, closely related to other South Mangyan languages.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69ca83e9d0e081908bdb71097201a06c completed March 30, 2026, 2:08 p.m.
NER Named-entity recognition batch_69ccda05406081909893bec3a092d3ce completed April 1, 2026, 8:40 a.m.
NED1 Entity disambiguation (via context triple) batch_69d0778e8dc48190bbae39137df966e3 completed April 4, 2026, 2:29 a.m.
NEDg Description generation batch_69d07f131494819096d5313b1e42c11e completed April 4, 2026, 3:01 a.m.
NED2 Entity disambiguation (via description) batch_69d07f5f981081908d4c4a392e4c1ffe completed April 4, 2026, 3:02 a.m.
Created at: March 30, 2026, 7:27 p.m.