Triple
T16615898
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Lampung people |
E403693
|
entity |
| Predicate | partOf |
P40
|
FINISHED |
| Object | Malay cultural sphere |
E151109
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Malay cultural sphere | Statement: [Lampung people, partOf, Malay cultural sphere]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Malay cultural sphere Context triple: [Lampung people, partOf, Malay cultural sphere]
-
A.
Malay world
The Malay world is a cultural and historical region of Maritime Southeast Asia encompassing areas where Malay peoples and related Austronesian groups have traditionally lived and shared linguistic, religious, and social traditions.
-
B.
Austronesian cultural sphere
The Austronesian cultural sphere is a broad maritime-based cultural and linguistic region spanning Island Southeast Asia and the Pacific, characterized by related Austronesian languages, seafaring traditions, and shared ancestral cultural practices.
-
C.
Southeast Asia linguistic area
The Southeast Asia linguistic area is a region where languages from diverse families have converged to share common structural features such as tonal systems, analytic grammar, and similar word order due to long-term contact and diffusion.
-
D.
Kedahan Malay
Kedahan Malay is a regional variety of the Malay language spoken primarily in the Malaysian state of Kedah and surrounding areas, distinguished by its own phonological, lexical, and orthographic features.
-
E.
Alam Melayu
chosen
Alam Melayu refers to the cultural and historical Malay world encompassing the Malay-speaking peoples and regions of Maritime Southeast Asia.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69d883897eb481909eaaa088ba9918d9 |
completed | April 10, 2026, 4:58 a.m. |
| NER | Named-entity recognition | batch_69e375494260819099b6988857c52dde |
completed | April 18, 2026, 12:12 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_6a007daef18481908c3628a3466300ce |
completed | May 10, 2026, 12:44 p.m. |
Created at: April 10, 2026, 5:17 a.m.