Triple

T14855192
Position Surface form Disambiguated ID Type / Status
Subject Mongol-ruled Central Asia E349332 entity
Predicate includes P1393 FINISHED
Object Mawarannahr
Mawarannahr is the historical region of Transoxiana in Central Asia, centered between the Amu Darya and Syr Darya rivers and encompassing key Silk Road cities such as Samarkand and Bukhara.
E1123601 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Mawarannahr | Statement: [Mongol-ruled Central Asia, includes, Mawarannahr]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Mawarannahr
Context triple: [Mongol-ruled Central Asia, includes, Mawarannahr]
  • A. Azania
    Azania is a name used by some African liberation movements and activists to refer to a decolonized, non-apartheid South Africa.
  • B. Madakiya
    Madakiya is a town in southern Kaduna State, Nigeria, situated within the Zangon Kataf Local Government Area.
  • C. Ridaniya
    Ridaniya is a historical locality near Cairo, Egypt, known primarily as the site of a pivotal early 16th-century battle between the Ottoman Empire and the Mamluk Sultanate.
  • D. Magland
    Magland is a commune in the Haute-Savoie department of southeastern France, situated in the Alps near the town of Cluses.
  • E. Oriente
    Oriente is a Bolivian professional football club based in Santa Cruz de la Sierra, known for competing in the country’s top division.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Mawarannahr
Triple: [Mongol-ruled Central Asia, includes, Mawarannahr]
Generated description
Mawarannahr is the historical region of Transoxiana in Central Asia, centered between the Amu Darya and Syr Darya rivers and encompassing key Silk Road cities such as Samarkand and Bukhara.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Mawarannahr
Target entity description: Mawarannahr is the historical region of Transoxiana in Central Asia, centered between the Amu Darya and Syr Darya rivers and encompassing key Silk Road cities such as Samarkand and Bukhara.
  • A. Azania
    Azania is a name used by some African liberation movements and activists to refer to a decolonized, non-apartheid South Africa.
  • B. Madakiya
    Madakiya is a town in southern Kaduna State, Nigeria, situated within the Zangon Kataf Local Government Area.
  • C. Ridaniya
    Ridaniya is a historical locality near Cairo, Egypt, known primarily as the site of a pivotal early 16th-century battle between the Ottoman Empire and the Mamluk Sultanate.
  • D. Magland
    Magland is a commune in the Haute-Savoie department of southeastern France, situated in the Alps near the town of Cluses.
  • E. Oriente
    Oriente is a Bolivian professional football club based in Santa Cruz de la Sierra, known for competing in the country’s top division.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d822ed7e1881909b90fca143ad7e34 completed April 9, 2026, 10:06 p.m.
NER Named-entity recognition batch_69ded44318f0819080b6c599f2d3474f completed April 14, 2026, 11:56 p.m.
NED1 Entity disambiguation (via context triple) batch_69fe65087708819084f51a043e5361e9 completed May 8, 2026, 10:34 p.m.
NEDg Description generation batch_69fe66218cb88190b8c86b359abaa14c completed May 8, 2026, 10:39 p.m.
NED2 Entity disambiguation (via description) batch_69fe66de57cc8190935d764d399f56f5 completed May 8, 2026, 10:42 p.m.
Created at: April 10, 2026, 1:54 a.m.