Triple
T21429303
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Bidayuh language |
E528641
|
entity |
| Predicate | isPartOf |
P10
|
FINISHED |
| Object | Borneo linguistic area |
—
|
NE NERFINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Borneo linguistic area | Statement: [Bidayuh language, isPartOf, Borneo linguistic area]
Disambiguation candidates (2 decisions)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Borneo linguistic area Context triple: [Bidayuh language, isPartOf, Borneo linguistic area]
-
A.
Sulawesi linguistic area
The Sulawesi linguistic area is a region of the Indonesian island of Sulawesi characterized by intense contact among diverse Austronesian and Papuan languages, leading to shared structural features across otherwise unrelated language groups.
-
B.
Bornean languages
Bornean languages are a subgroup of the Malayo-Polynesian language family spoken on the island of Borneo, encompassing diverse indigenous languages of Brunei, Indonesia, and Malaysia.
-
C.
Southeast Asia linguistic area
The Southeast Asia linguistic area is a region where languages from diverse families have converged to share common structural features such as tonal systems, analytic grammar, and similar word order due to long-term contact and diffusion.
-
D.
Barito–Mahakam languages
The Barito–Mahakam languages are a subgroup of Austronesian languages spoken in parts of Borneo, particularly along the Barito and Mahakam river regions.
-
E.
Indo-Burma linguistic area
The Indo-Burma linguistic area is a geographically defined region of South and Southeast Asia where diverse languages, especially from Tibeto-Burman and related families, have converged and shared structural features through long-term contact.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: Borneo linguistic area Target entity description: The Borneo linguistic area is a region of the island of Borneo characterized by extensive contact and shared structural features among its diverse Austronesian and non-Austronesian languages.
-
A.
Sulawesi linguistic area
The Sulawesi linguistic area is a region of the Indonesian island of Sulawesi characterized by intense contact among diverse Austronesian and Papuan languages, leading to shared structural features across otherwise unrelated language groups.
-
B.
Bornean languages
Bornean languages are a subgroup of the Malayo-Polynesian language family spoken on the island of Borneo, encompassing diverse indigenous languages of Brunei, Indonesia, and Malaysia.
-
C.
Southeast Asia linguistic area
The Southeast Asia linguistic area is a region where languages from diverse families have converged to share common structural features such as tonal systems, analytic grammar, and similar word order due to long-term contact and diffusion.
-
D.
Barito–Mahakam languages
The Barito–Mahakam languages are a subgroup of Austronesian languages spoken in parts of Borneo, particularly along the Barito and Mahakam river regions.
-
E.
Indo-Burma linguistic area
The Indo-Burma linguistic area is a geographically defined region of South and Southeast Asia where diverse languages, especially from Tibeto-Burman and related families, have converged and shared structural features through long-term contact.
- F. None of above. chosen
Provenance (2 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69e0c455f3688190810bc96365791b0f |
elicitation | completed |
| NER | batch_69ee813ef6a8819089511b8f608c9491 |
ner | completed |
Created at: April 16, 2026, 5:49 p.m.