Triple

T14417778
Position Surface form Disambiguated ID Type / Status
Subject Dorothy Mary Crowfoot Hodgkin E357500 entity
Predicate familyName P18 FINISHED
Object Hodgkin E70299 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Hodgkin | Statement: [Dorothy Mary Crowfoot Hodgkin, familyName, Hodgkin]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Hodgkin
Context triple: [Dorothy Mary Crowfoot Hodgkin, familyName, Hodgkin]
  • A. Hodgkin chosen
    Hodgkin is a surname most famously associated with Dorothy Hodgkin, the Nobel Prize–winning British chemist who advanced the field of X-ray crystallography.
  • B. Hodgkin lymphoma
    Hodgkin lymphoma is a type of cancer that originates in the lymphatic system, characterized by the presence of abnormal Reed–Sternberg cells and often affecting lymph nodes.
  • C. non-Hodgkin lymphoma
    Non-Hodgkin lymphoma is a diverse group of blood cancers that originate in the lymphatic system from abnormal lymphocytes and can vary widely in aggressiveness and prognosis.
  • D. CLL
    CLL is the IATA airport code for Easterwood Airport, a regional airport serving College Station, Texas.
  • E. chronic lymphocytic leukemia
    Chronic lymphocytic leukemia is a slow-growing cancer of the blood and bone marrow characterized by an overproduction of abnormal lymphocytes, most commonly affecting older adults.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69d82793421c8190861eb0e673b085de completed April 9, 2026, 10:26 p.m.
NER Named-entity recognition batch_69de90cec8e4819087c72d82f9caacda completed April 14, 2026, 7:09 p.m.
NED1 Entity disambiguation (via context triple) batch_69fd5bc79c088190b6fd2984515976d7 completed May 8, 2026, 3:43 a.m.
Created at: April 10, 2026, 1:17 a.m.