Triple
T9212757
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Aru Islands |
E221165
|
entity |
| Predicate | hasLanguage |
P15
|
FINISHED |
| Object |
Batuley language
The Batuley language is an Austronesian language spoken by a small community in Indonesia’s Aru Islands.
|
E787286
|
NE FINISHED |
How this triple was built (4 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Batuley language | Statement: [Aru Islands, hasLanguage, Batuley language]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Batuley language Context triple: [Aru Islands, hasLanguage, Batuley language]
-
A.
Baliledu language
The Baliledu language is an Austronesian language of the Bima–Sumba subgroup spoken by a local community in eastern Indonesia.
-
B.
Baule language
The Baule language is a Central Tano language of the Akan group spoken primarily by the Baoulé people in Côte d'Ivoire.
-
C.
Hoanya language
The Hoanya language is an extinct Austronesian language once spoken by the Hoanya people of western Taiwan and classified among the indigenous Formosan languages.
-
D.
Patamona language
The Patamona language is an indigenous Cariban language spoken by the Patamona people of the Guiana Highlands in Guyana and northern Brazil.
-
E.
Tawbuid language
The Tawbuid language is an Austronesian language spoken by the Tawbuid (Batangan) Mangyan people of Mindoro in the Philippines, closely related to other South Mangyan languages.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg
Description generation
gpt-5.1
Instruction
Generate a one-sentence description of the target entity. You are given a context triple in the form (subject, predicate, object), where the object is the target entity. # Instructions Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. Avoid repeating the information from the triple, unless really essential. # Response Format Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Batuley language Triple: [Aru Islands, hasLanguage, Batuley language]
Generated description
The Batuley language is an Austronesian language spoken by a small community in Indonesia’s Aru Islands.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Batuley language Target entity description: The Batuley language is an Austronesian language spoken by a small community in Indonesia’s Aru Islands.
-
A.
Baliledu language
The Baliledu language is an Austronesian language of the Bima–Sumba subgroup spoken by a local community in eastern Indonesia.
-
B.
Baule language
The Baule language is a Central Tano language of the Akan group spoken primarily by the Baoulé people in Côte d'Ivoire.
-
C.
Hoanya language
The Hoanya language is an extinct Austronesian language once spoken by the Hoanya people of western Taiwan and classified among the indigenous Formosan languages.
-
D.
Patamona language
The Patamona language is an indigenous Cariban language spoken by the Patamona people of the Guiana Highlands in Guyana and northern Brazil.
-
E.
Tawbuid language
The Tawbuid language is an Austronesian language spoken by the Tawbuid (Batangan) Mangyan people of Mindoro in the Philippines, closely related to other South Mangyan languages.
- F. None of above. chosen
Provenance (5 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69ca83e9d0e081908bdb71097201a06c |
completed | March 30, 2026, 2:08 p.m. |
| NER | Named-entity recognition | batch_69ccda05406081909893bec3a092d3ce |
completed | April 1, 2026, 8:40 a.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69d0778e8dc48190bbae39137df966e3 |
completed | April 4, 2026, 2:29 a.m. |
| NEDg | Description generation | batch_69d07f131494819096d5313b1e42c11e |
completed | April 4, 2026, 3:01 a.m. |
| NED2 | Entity disambiguation (via description) | batch_69d07f5f981081908d4c4a392e4c1ffe |
completed | April 4, 2026, 3:02 a.m. |
Created at: March 30, 2026, 7:27 p.m.