Triple

T6037458
Position Surface form Disambiguated ID Type / Status
Subject São Tomé E134456 entity
Predicate languageCommon P741 FINISHED
Object Forro creole E94968 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Forro creole | Statement: [São Tomé, languageCommon, Forro creole]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Forro creole
Context triple: [São Tomé, languageCommon, Forro creole]
  • A. Forro chosen
    Forro is a Portuguese-based creole language spoken primarily on the islands of São Tomé and Príncipe in the Gulf of Guinea.
  • B. Batuque
    Batuque is an Afro-Brazilian religious tradition that blends West African (especially Yoruba) spiritual practices with elements of Catholicism and Indigenous beliefs, centered on the worship of orixás through music, dance, and ritual.
  • C. Rumba
    Rumba is one of the spacecraft in the Cluster mission, a European Space Agency project studying Earth's magnetosphere in three dimensions.
  • D. Cajiqueño
    Cajiqueño is the Spanish demonym for a person from the Colombian municipality of Cajicá.
  • E. Maio Creole
    Maio Creole is a regional variety of Cape Verdean Creole spoken primarily on the island of Maio, characterized by its own distinct phonetic and lexical features.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c00875db5c819099dd5bb833ec43c2 completed March 22, 2026, 3:19 p.m.
NER Named-entity recognition batch_69c056cb06508190a90beb4d9d083835 completed March 22, 2026, 8:53 p.m.
NED1 Entity disambiguation (via context triple) batch_69c1139031248190b796a655bf07a4bc completed March 23, 2026, 10:18 a.m.
Created at: March 22, 2026, 4:08 p.m.