Triple

T14383544
Position Surface form Disambiguated ID Type / Status
Subject Simon Peyton Jones E356667 entity
Predicate employer P7 FINISHED
Object Microsoft Research Cambridge E100209 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Microsoft Research Cambridge | Statement: [Simon Peyton Jones, employer, Microsoft Research Cambridge]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Microsoft Research Cambridge
Context triple: [Simon Peyton Jones, employer, Microsoft Research Cambridge]
  • A. Microsoft Research Cambridge chosen
    Microsoft Research Cambridge is a leading European research laboratory of Microsoft focused on advancing computer science and related fields through fundamental and applied research.
  • B. Microsoft Research
    Microsoft Research is the research division of Microsoft, dedicated to advancing computer science and related fields through fundamental and applied research.
  • C. Adobe Research
    Adobe Research is the research arm of Adobe that advances cutting-edge technologies in areas such as computer graphics, computer vision, machine learning, and digital media.
  • D. IBM Austin Research Laboratory
    IBM Austin Research Laboratory is a research facility of IBM located in Austin, Texas, known for advanced work in computer architecture, microprocessors, and high-performance computing technologies.
  • E. Intel Labs
    Intel Labs is the research and innovation arm of Intel, focused on advancing cutting-edge technologies in areas such as computing, artificial intelligence, and semiconductor design.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d827927c988190ad98bb0360981783 completed April 9, 2026, 10:26 p.m.
NER Named-entity recognition batch_69de900d28c88190a37feee4743563de completed April 14, 2026, 7:05 p.m.
NED1 Entity disambiguation (via context triple) batch_69fd5511c9e4819089dcbf089ca0dc6a completed May 8, 2026, 3:14 a.m.
Created at: April 10, 2026, 1:16 a.m.