Triple

T16594818
Position Surface form Disambiguated ID Type / Status
Subject Rector Street station (IRT Broadway–Seventh Avenue Line) E403178 entity
Predicate hasMTAStationId P49108 FINISHED
Object R20
R20 is the internal MTA station identifier code assigned to the Rector Street station on the IRT Broadway–Seventh Avenue Line in the New York City Subway system.
E1222810 NE FINISHED

How this triple was built (4 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: R20 | Statement: [Rector Street station (IRT Broadway–Seventh Avenue Line), hasMTAStationId, R20]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: R20
Context triple: [Rector Street station (IRT Broadway–Seventh Avenue Line), hasMTAStationId, R20]
  • A. R20
    R20 is a regional commuter rail line in Catalonia that forms part of the Rodalies de Catalunya network.
  • B. R2000
    The R2000 is a 32-bit MIPS RISC microprocessor that became one of the earliest and most influential commercial implementations of the MIPS architecture in the mid-1980s.
  • C. R24
    R24 is a regional commuter rail line in Catalonia, Spain, operating within the Rodalies de Catalunya network to connect towns and cities in the area.
  • D. R2
    R2 is a classification for U.S. doctoral universities characterized by high levels of research activity, as defined by the Carnegie Classification system.
  • E. R2
    R2 is the MBTA station code used to identify Ashmont station on Boston's Red Line transit system.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NEDg Description generation gpt-5.1
Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: R20
Triple: [Rector Street station (IRT Broadway–Seventh Avenue Line), hasMTAStationId, R20]
Generated description
R20 is the internal MTA station identifier code assigned to the Rector Street station on the IRT Broadway–Seventh Avenue Line in the New York City Subway system.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: R20
Target entity description: R20 is the internal MTA station identifier code assigned to the Rector Street station on the IRT Broadway–Seventh Avenue Line in the New York City Subway system.
  • A. R20
    R20 is a regional commuter rail line in Catalonia that forms part of the Rodalies de Catalunya network.
  • B. R2000
    The R2000 is a 32-bit MIPS RISC microprocessor that became one of the earliest and most influential commercial implementations of the MIPS architecture in the mid-1980s.
  • C. R24
    R24 is a regional commuter rail line in Catalonia, Spain, operating within the Rodalies de Catalunya network to connect towns and cities in the area.
  • D. R2
    R2 is the MBTA station code used to identify Ashmont station on Boston's Red Line transit system.
  • E. R2
    R2 is a classification for U.S. doctoral universities characterized by high levels of research activity, as defined by the Carnegie Classification system.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d883880d0c81908b5fcd454e767b60 completed April 10, 2026, 4:58 a.m.
NER Named-entity recognition batch_69e35d70ad008190bba0901bf3c7089f completed April 18, 2026, 10:31 a.m.
NED1 Entity disambiguation (via context triple) batch_6a00759fe6ec81908c5321dcba558269 completed May 10, 2026, 12:10 p.m.
NEDg Description generation batch_6a007825547c81909230ac39761fba96 completed May 10, 2026, 12:20 p.m.
NED2 Entity disambiguation (via description) batch_6a00788e103481908d40bbe4a3cb1695 completed May 10, 2026, 12:22 p.m.
Created at: April 10, 2026, 5:16 a.m.