Triple
T6037458
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | São Tomé |
E134456
|
entity |
| Predicate | languageCommon |
P741
|
FINISHED |
| Object | Forro creole |
E94968
|
NE FINISHED |
How this triple was built (2 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Forro creole | Statement: [São Tomé, languageCommon, Forro creole]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Forro creole Context triple: [São Tomé, languageCommon, Forro creole]
-
A.
Forro
chosen
Forro is a Portuguese-based creole language spoken primarily on the islands of São Tomé and Príncipe in the Gulf of Guinea.
-
B.
Batuque
Batuque is an Afro-Brazilian religious tradition that blends West African (especially Yoruba) spiritual practices with elements of Catholicism and Indigenous beliefs, centered on the worship of orixás through music, dance, and ritual.
-
C.
Rumba
Rumba is one of the spacecraft in the Cluster mission, a European Space Agency project studying Earth's magnetosphere in three dimensions.
-
D.
Cajiqueño
Cajiqueño is the Spanish demonym for a person from the Colombian municipality of Cajicá.
-
E.
Maio Creole
Maio Creole is a regional variety of Cape Verdean Creole spoken primarily on the island of Maio, characterized by its own distinct phonetic and lexical features.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
Provenance (3 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69c00875db5c819099dd5bb833ec43c2 |
completed | March 22, 2026, 3:19 p.m. |
| NER | Named-entity recognition | batch_69c056cb06508190a90beb4d9d083835 |
completed | March 22, 2026, 8:53 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69c1139031248190b796a655bf07a4bc |
completed | March 23, 2026, 10:18 a.m. |
Created at: March 22, 2026, 4:08 p.m.