Triple
T15291255
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Meriam Mir |
E365531
|
entity |
| Predicate | hasLoanwordsFrom |
P506
|
FINISHED |
| Object | Tok Pisin |
E49351
|
NE FINISHED |
Named-entity recognition
Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Tok Pisin | Statement: [Meriam Mir, hasLoanwordsFrom, Tok Pisin]
Disambiguation candidates (1 decision)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Tok Pisin Context triple: [Meriam Mir, hasLoanwordsFrom, Tok Pisin]
-
A.
Tok Pisin
chosen
Tok Pisin is an English-based creole language widely spoken in Papua New Guinea, where it serves as a major lingua franca and one of the country’s primary official languages.
-
B.
Solomon Islands Pijin
Solomon Islands Pijin is an English-based creole language widely used as a lingua franca across the Solomon Islands.
-
C.
Nauruan
Nauruan is an Austronesian language spoken primarily on the Pacific island nation of Nauru.
-
D.
Melanesian Pidgin
Melanesian Pidgin is an English-based creole language widely used as a lingua franca in parts of Melanesia, particularly Papua New Guinea.
-
E.
Papua New Guinean Hiri Motu
Papua New Guinean Hiri Motu is an Austronesian-based lingua franca and simplified form of Motu historically used for interethnic communication in Papua New Guinea.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69d85a103d9081908c1ea6c4c73ac8e3 |
elicitation | completed |
| NER | batch_69e03680b60c8190a3ea54a9d34c8105 |
ner | completed |
| NED1 | batch_69feef7d4da4819080f101c3a525ea11 |
ned_source_triple | completed |
Created at: April 10, 2026, 3:15 a.m.