Triple
T21815544
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Toabaita |
E538596
|
entity |
| Predicate | hasNeighbouringLanguage |
P16383
|
FINISHED |
| Object | Baelelea language |
—
|
NE NERFINISHED |
How this triple was built (3 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Baelelea language | Statement: [Toabaita, hasNeighbouringLanguage, Baelelea language]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Baelelea language Context triple: [Toabaita, hasNeighbouringLanguage, Baelelea language]
-
A.
Thaua language
The Thaua language is an Indigenous Australian Aboriginal language traditionally spoken by the Thaua (a group of the Yuin people) of the south coast of New South Wales.
-
B.
Kaera language
The Kaera language is a Papuan language spoken by a small community on Pantar Island in eastern Indonesia.
-
C.
Damara language
The Damara language is a Khoe (Central Khoisan) language spoken primarily by the Damara people of Namibia.
-
D.
Itsari language
The Itsari language is a Northeast Caucasian (Dargin) variety spoken in Dagestan, Russia, closely related to the Kubachi language and used by a small local community.
-
E.
Saraveca language
The Saraveca language is an extinct Arawakan language once spoken in Bolivia, known from very limited historical documentation.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Baelelea language Target entity description: The Baelelea language is an Oceanic language spoken by the Baelelea people of Malaita in the Solomon Islands.
-
A.
Thaua language
The Thaua language is an Indigenous Australian Aboriginal language traditionally spoken by the Thaua (a group of the Yuin people) of the south coast of New South Wales.
-
B.
Kaera language
The Kaera language is a Papuan language spoken by a small community on Pantar Island in eastern Indonesia.
-
C.
Damara language
The Damara language is a Khoe (Central Khoisan) language spoken primarily by the Damara people of Namibia.
-
D.
Itsari language
The Itsari language is a Northeast Caucasian (Dargin) variety spoken in Dagestan, Russia, closely related to the Kubachi language and used by a small local community.
-
E.
Saraveca language
The Saraveca language is an extinct Arawakan language once spoken in Bolivia, known from very limited historical documentation.
- F. None of above. chosen
Provenance (2 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69e0c473f0f8819086c9d1b4a143bd67 |
completed | April 16, 2026, 11:13 a.m. |
| NER | Named-entity recognition | batch_69f07cc99bbc8190bf074930f361af7d |
completed | April 28, 2026, 9:24 a.m. |
Created at: April 16, 2026, 6:54 p.m.