Triple

T16203855
Position Surface form Disambiguated ID Type / Status
Subject arrondissement of Bastia E393269 entity
Predicate contains P35 FINISHED
Object Sorio
Sorio is a small commune in the Haute-Corse department on the island of Corsica in France.
E1205628 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Sorio | Statement: [arrondissement of Bastia, contains, Sorio]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Sorio
Context triple: [arrondissement of Bastia, contains, Sorio]
  • A. Sarzedo
    Sarzedo is a small parish (freguesia) in the municipality of Arganil, located in the Coimbra District of central Portugal.
  • B. Osuna
    Osuna is a historic town in the province of Seville, Spain, known for its rich archaeological heritage, including notable ancient reliefs and other Roman-era remains.
  • C. Requena
    Requena is a small Peruvian city in the Loreto region, known as a remote Amazonian river port and gateway to surrounding rainforest communities.
  • D. Requena
    Requena is a historic inland town in Spain’s Valencian Community, known for its wine production and well-preserved medieval quarter.
  • E. Caleruega
    Caleruega is a small town in the province of Burgos, Spain, best known as the birthplace of Saint Dominic, founder of the Dominican Order.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Sorio
Triple: [arrondissement of Bastia, contains, Sorio]
Generated description
Sorio is a small commune in the Haute-Corse department on the island of Corsica in France.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Sorio
Target entity description: Sorio is a small commune in the Haute-Corse department on the island of Corsica in France.
  • A. Sarzedo
    Sarzedo is a small parish (freguesia) in the municipality of Arganil, located in the Coimbra District of central Portugal.
  • B. Osuna
    Osuna is a historic town in the province of Seville, Spain, known for its rich archaeological heritage, including notable ancient reliefs and other Roman-era remains.
  • C. Requena
    Requena is a small Peruvian city in the Loreto region, known as a remote Amazonian river port and gateway to surrounding rainforest communities.
  • D. Requena
    Requena is a historic inland town in Spain’s Valencian Community, known for its wine production and well-preserved medieval quarter.
  • E. Caleruega
    Caleruega is a small town in the province of Burgos, Spain, best known as the birthplace of Saint Dominic, founder of the Dominican Order.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d87f1f5bd08190bd01cac0d5b9d2ef completed April 10, 2026, 4:39 a.m.
NER Named-entity recognition batch_69e2270ca18c8190a259992aed4ec072 completed April 17, 2026, 12:26 p.m.
NED1 Entity disambiguation (via context triple) batch_6a001f860ecc8190be904fa793968d89 completed May 10, 2026, 6:02 a.m.
NEDg Description generation batch_6a00211bb2bc8190bc32492fd6de3bc3 completed May 10, 2026, 6:09 a.m.
NED2 Entity disambiguation (via description) batch_6a00221262288190b154d2e2c318d162 completed May 10, 2026, 6:13 a.m.
Created at: April 10, 2026, 5:03 a.m.