Triple

T11939874
Position Surface form Disambiguated ID Type / Status
Subject La Guajira Department E284146 entity
Predicate hasMunicipality P847 FINISHED
Object Maicao
Maicao is a Colombian border city in the northeastern La Guajira region, known as a major commercial hub with significant Arab and Wayuu indigenous communities.
E955075 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Maicao | Statement: [La Guajira Department, hasMunicipality, Maicao]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Maicao
Context triple: [La Guajira Department, hasMunicipality, Maicao]
  • A. Neiva
    Neiva is a major city in southwestern Colombia known as the economic and cultural center of the upper Magdalena River valley.
  • B. Santena
    Santena is a small town in the Piedmont region of northern Italy, known for its historical association with statesman Camillo Benso, Count of Cavour.
  • C. Mochó
    Mochó is an indigenous Mayan language spoken by a small community in Chiapas, Mexico, and considered highly endangered.
  • D. Luena
    Luena is a city in eastern Angola that served as a significant site during the Angolan Civil War and later became known as the burial place of UNITA leader Jonas Savimbi.
  • E. Arruda
    Arruda is a neighborhood in Recife, Brazil, best known for housing the Estádio do Arruda, home stadium of the Santa Cruz Futebol Clube.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Maicao
Triple: [La Guajira Department, hasMunicipality, Maicao]
Generated description
Maicao is a Colombian border city in the northeastern La Guajira region, known as a major commercial hub with significant Arab and Wayuu indigenous communities.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Maicao
Target entity description: Maicao is a Colombian border city in the northeastern La Guajira region, known as a major commercial hub with significant Arab and Wayuu indigenous communities.
  • A. Neiva
    Neiva is a major city in southwestern Colombia known as the economic and cultural center of the upper Magdalena River valley.
  • B. Santena
    Santena is a small town in the Piedmont region of northern Italy, known for its historical association with statesman Camillo Benso, Count of Cavour.
  • C. Mochó
    Mochó is an indigenous Mayan language spoken by a small community in Chiapas, Mexico, and considered highly endangered.
  • D. Luena
    Luena is a city in eastern Angola that served as a significant site during the Angolan Civil War and later became known as the burial place of UNITA leader Jonas Savimbi.
  • E. Arruda
    Arruda is a neighborhood in Recife, Brazil, best known for housing the Estádio do Arruda, home stadium of the Santa Cruz Futebol Clube.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d6ab2ce9c48190b5d39511b524f666 completed April 8, 2026, 7:23 p.m.
NER Named-entity recognition batch_69d903415d2481909d84e6727454b9fe completed April 10, 2026, 2:03 p.m.
NED1 Entity disambiguation (via context triple) batch_69f4409a40dc81909d87c50601b98b78 completed May 1, 2026, 5:56 a.m.
NEDg Description generation batch_69f448fc874081908fe05f9d8aff11a3 completed May 1, 2026, 6:32 a.m.
NED2 Entity disambiguation (via description) batch_69f44afdc7b08190bdf47cfcb94c34c8 completed May 1, 2026, 6:41 a.m.
Created at: April 8, 2026, 9:45 p.m.