Triple

T21429303
Position Surface form Disambiguated ID Type / Status
Subject Bidayuh language E528641 entity
Predicate isPartOf P10 FINISHED
Object Borneo linguistic area NE NERFINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Borneo linguistic area | Statement: [Bidayuh language, isPartOf, Borneo linguistic area]

Disambiguation candidates (2 decisions)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Borneo linguistic area
Context triple: [Bidayuh language, isPartOf, Borneo linguistic area]
  • A. Sulawesi linguistic area
    The Sulawesi linguistic area is a region of the Indonesian island of Sulawesi characterized by intense contact among diverse Austronesian and Papuan languages, leading to shared structural features across otherwise unrelated language groups.
  • B. Bornean languages
    Bornean languages are a subgroup of the Malayo-Polynesian language family spoken on the island of Borneo, encompassing diverse indigenous languages of Brunei, Indonesia, and Malaysia.
  • C. Southeast Asia linguistic area
    The Southeast Asia linguistic area is a region where languages from diverse families have converged to share common structural features such as tonal systems, analytic grammar, and similar word order due to long-term contact and diffusion.
  • D. Barito–Mahakam languages
    The Barito–Mahakam languages are a subgroup of Austronesian languages spoken in parts of Borneo, particularly along the Barito and Mahakam river regions.
  • E. Indo-Burma linguistic area
    The Indo-Burma linguistic area is a geographically defined region of South and Southeast Asia where diverse languages, especially from Tibeto-Burman and related families, have converged and shared structural features through long-term contact.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Borneo linguistic area
Target entity description: The Borneo linguistic area is a region of the island of Borneo characterized by extensive contact and shared structural features among its diverse Austronesian and non-Austronesian languages.
  • A. Sulawesi linguistic area
    The Sulawesi linguistic area is a region of the Indonesian island of Sulawesi characterized by intense contact among diverse Austronesian and Papuan languages, leading to shared structural features across otherwise unrelated language groups.
  • B. Bornean languages
    Bornean languages are a subgroup of the Malayo-Polynesian language family spoken on the island of Borneo, encompassing diverse indigenous languages of Brunei, Indonesia, and Malaysia.
  • C. Southeast Asia linguistic area
    The Southeast Asia linguistic area is a region where languages from diverse families have converged to share common structural features such as tonal systems, analytic grammar, and similar word order due to long-term contact and diffusion.
  • D. Barito–Mahakam languages
    The Barito–Mahakam languages are a subgroup of Austronesian languages spoken in parts of Borneo, particularly along the Barito and Mahakam river regions.
  • E. Indo-Burma linguistic area
    The Indo-Burma linguistic area is a geographically defined region of South and Southeast Asia where diverse languages, especially from Tibeto-Burman and related families, have converged and shared structural features through long-term contact.
  • F. None of above. chosen

Provenance (2 batches)

Stage Batch ID Job type Status
creating batch_69e0c455f3688190810bc96365791b0f elicitation completed
NER batch_69ee813ef6a8819089511b8f608c9491 ner completed
Created at: April 16, 2026, 5:49 p.m.