Triple

T6496088
Position Surface form Disambiguated ID Type / Status
Subject Sunda-Sulawesi languages E148163 entity
Predicate hasPart P35 FINISHED
Object Bali-Sasak-Sumbawa languages
The Bali-Sasak-Sumbawa languages are a subgroup of the Austronesian language family spoken primarily on the islands of Bali, Lombok, and Sumbawa in Indonesia.
E597760 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Bali-Sasak-Sumbawa languages | Statement: [Sunda-Sulawesi languages, hasPart, Bali-Sasak-Sumbawa languages]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Bali-Sasak-Sumbawa languages
Context triple: [Sunda-Sulawesi languages, hasPart, Bali-Sasak-Sumbawa languages]
  • A. Bima–Sumba languages
    The Bima–Sumba languages are a subgroup of Austronesian languages spoken primarily on the islands of Sumbawa and Sumba in eastern Indonesia.
  • B. Sunda-Sulawesi languages
    The Sunda-Sulawesi languages are a proposed group of Austronesian languages spoken primarily in western and central Indonesia, including parts of Java, Sulawesi, and nearby islands.
  • C. Sulawesi languages
    The Sulawesi languages are a diverse group of Austronesian languages spoken on the Indonesian island of Sulawesi, known for their complex typological variation and significant internal linguistic diversity.
  • D. Timor–Babar languages
    The Timor–Babar languages are a subgroup of Austronesian languages spoken primarily on Timor and nearby islands in eastern Indonesia, noted for their complex phonologies and diverse grammatical structures.
  • E. Flores–Lembata languages
    The Flores–Lembata languages are a subgroup of Austronesian languages spoken on the islands of Flores and Lembata in eastern Indonesia, known for their distinctive phonological and grammatical features within the region.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Bali-Sasak-Sumbawa languages
Triple: [Sunda-Sulawesi languages, hasPart, Bali-Sasak-Sumbawa languages]
Generated description
The Bali-Sasak-Sumbawa languages are a subgroup of the Austronesian language family spoken primarily on the islands of Bali, Lombok, and Sumbawa in Indonesia.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Bali-Sasak-Sumbawa languages
Target entity description: The Bali-Sasak-Sumbawa languages are a subgroup of the Austronesian language family spoken primarily on the islands of Bali, Lombok, and Sumbawa in Indonesia.
  • A. Bima–Sumba languages
    The Bima–Sumba languages are a subgroup of Austronesian languages spoken primarily on the islands of Sumbawa and Sumba in eastern Indonesia.
  • B. Sunda-Sulawesi languages
    The Sunda-Sulawesi languages are a proposed group of Austronesian languages spoken primarily in western and central Indonesia, including parts of Java, Sulawesi, and nearby islands.
  • C. Sulawesi languages
    The Sulawesi languages are a diverse group of Austronesian languages spoken on the Indonesian island of Sulawesi, known for their complex typological variation and significant internal linguistic diversity.
  • D. Timor–Babar languages
    The Timor–Babar languages are a subgroup of Austronesian languages spoken primarily on Timor and nearby islands in eastern Indonesia, noted for their complex phonologies and diverse grammatical structures.
  • E. Flores–Lembata languages
    The Flores–Lembata languages are a subgroup of Austronesian languages spoken on the islands of Flores and Lembata in eastern Indonesia, known for their distinctive phonological and grammatical features within the region.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c009088f3081909cd467b05919de30 completed March 22, 2026, 3:21 p.m.
NER Named-entity recognition batch_69c06ab958808190bd85e007e925ffc4 completed March 22, 2026, 10:18 p.m.
NED1 Entity disambiguation (via context triple) batch_69c65fdf59548190b4ba4f44716e6b3c completed March 27, 2026, 10:45 a.m.
NEDg Description generation batch_69c660ae1ae48190941b4e0fc1fa6bea completed March 27, 2026, 10:49 a.m.
NED2 Entity disambiguation (via description) batch_69c6615b639c8190af0073368e55ab8d completed March 27, 2026, 10:52 a.m.
Created at: March 22, 2026, 4:53 p.m.