Triple
T8527190
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | North Jakarta |
E201845
|
entity |
| Predicate | commonLocalLanguage |
P10892
|
FINISHED |
| Object | Betawi Malay |
E141259
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Betawi Malay | Statement: [North Jakarta, commonLocalLanguage, Betawi Malay]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Betawi Malay Context triple: [North Jakarta, commonLocalLanguage, Betawi Malay]
-
A.
Betawi Malay
chosen
Betawi Malay is a Malay-based creole language spoken primarily in Jakarta, Indonesia, serving as the traditional language of the Betawi ethnic community.
-
B.
Banjarese Malay
Banjarese Malay is an Austronesian language spoken primarily by the Banjar people of South Kalimantan in Indonesia, known for its distinct phonology and vocabulary within the Malayic language family.
-
C.
Bandanese Malay
Bandanese Malay is a regional variety of Malay historically spoken by the Banda Islands community in Indonesia, shaped by centuries of spice trade and cultural contact.
-
D.
Palembang Malay
Palembang Malay is a regional variety of the Malay language spoken primarily in and around the city of Palembang in South Sumatra, Indonesia, known for its distinct phonology and vocabulary.
-
E.
Bantenese Malay
Bantenese Malay is a regional variety of the Malay language spoken primarily in the Banten province of western Java, Indonesia, with distinctive phonological and lexical features influenced by Sundanese and Javanese.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69ca83228b24819085d22e7dc99f5d94 |
completed | March 30, 2026, 2:05 p.m. |
| NER | Named-entity recognition | batch_69cbe6477100819081fa20cb6b8ea3d7 |
completed | March 31, 2026, 3:20 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69ce6d49c1408190b7c23739409d1e3d |
completed | April 2, 2026, 1:21 p.m. |
Created at: March 30, 2026, 6:16 p.m.