Triple

T10837764
Position Surface form Disambiguated ID Type / Status
Subject Bruton tyrosine kinase E255805 entity
Predicate associatedWithDisease P37 FINISHED
Object Waldenström macroglobulinemia
Waldenström macroglobulinemia is a rare, indolent B-cell non-Hodgkin lymphoma characterized by bone marrow infiltration and excess IgM production, leading to symptoms such as anemia, hyperviscosity, and neuropathy.
E99398 NE FINISHED

Named-entity recognition

Before disambiguation, gpt-5-mini classified whether the object phrase is a named entity — the step behind the object's NE type shown above.

Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Waldenström macroglobulinemia | Statement: [Bruton tyrosine kinase, associatedWithDisease, Waldenström macroglobulinemia]

Disambiguation candidates (2 decisions)

The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.

NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: Waldenström macroglobulinemia
Context triple: [Bruton tyrosine kinase, associatedWithDisease, Waldenström macroglobulinemia]
  • A. CLL
    CLL is the IATA airport code for Easterwood Airport, a regional airport serving College Station, Texas.
  • B. chronic lymphocytic leukemia
    Chronic lymphocytic leukemia is a slow-growing cancer of the blood and bone marrow characterized by an overproduction of abnormal lymphocytes, most commonly affecting older adults.
  • C. Hodgkin
    Hodgkin is a surname most famously associated with Dorothy Hodgkin, the Nobel Prize–winning British chemist who advanced the field of X-ray crystallography.
  • D. Hodgkin lymphoma
    Hodgkin lymphoma is a type of cancer that originates in the lymphatic system, characterized by the presence of abnormal Reed–Sternberg cells and often affecting lymph nodes.
  • E. non-Hodgkin lymphoma
    Non-Hodgkin lymphoma is a diverse group of blood cancers that originate in the lymphatic system from abnormal lymphocytes and can vary widely in aggressiveness and prognosis.
  • F. None of above. chosen
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2 Entity disambiguation (via description) gpt-5-mini-2025-08-07
Target entity: Waldenström macroglobulinemia
Target entity description: Waldenström macroglobulinemia is a rare, indolent B-cell non-Hodgkin lymphoma characterized by bone marrow infiltration and excess IgM production, leading to symptoms such as anemia, hyperviscosity, and neuropathy.
  • A. CLL
    CLL is the IATA airport code for Easterwood Airport, a regional airport serving College Station, Texas.
  • B. chronic lymphocytic leukemia
    Chronic lymphocytic leukemia is a slow-growing cancer of the blood and bone marrow characterized by an overproduction of abnormal lymphocytes, most commonly affecting older adults.
  • C. Hodgkin
    Hodgkin is a surname most famously associated with Dorothy Hodgkin, the Nobel Prize–winning British chemist who advanced the field of X-ray crystallography.
  • D. Hodgkin lymphoma
    Hodgkin lymphoma is a type of cancer that originates in the lymphatic system, characterized by the presence of abnormal Reed–Sternberg cells and often affecting lymph nodes.
  • E. non-Hodgkin lymphoma chosen
    Non-Hodgkin lymphoma is a diverse group of blood cancers that originate in the lymphatic system from abnormal lymphocytes and can vary widely in aggressiveness and prognosis.
  • F. None of above.

How the object was described

The object's one-sentence description was generated by prompting gpt-5.1 with the object name and this triple as context.

Instruction
Generate a one-sentence description of the target entity. 
You are given a context triple in the form (subject, predicate, object), where the object is the target entity. 
# Instructions
Use the triple to infer relevant information about the entity. Describe the entity based on what is most defining, well-known. 
Avoid repeating the information from the triple, unless really essential.
# Response Format
Return only the sentence: "Description: [one-sentence description of the target entity]"
Input
Entity: Waldenström macroglobulinemia
Triple: [Bruton tyrosine kinase, associatedWithDisease, Waldenström macroglobulinemia]
Generated description
Waldenström macroglobulinemia is a rare, indolent B-cell non-Hodgkin lymphoma characterized by bone marrow infiltration and excess IgM production, leading to symptoms such as anemia, hyperviscosity, and neuropathy.

Provenance (5 batches)

Stage Batch ID Job type Status
creating batch_69d6aa81a5d08190aa86689061d1ddd2 elicitation completed
NER batch_69d747002b3081908726901ee83d8f38 ner completed
NED1 batch_69deb139072081908a67e76a83575c32 ned_source_triple completed
NED2 batch_69deb4e471648190a00f3a921b5fb657 ned_description completed
NEDg batch_69deb41c9ae881909a3dab1292d6ddd7 nedg completed
Created at: April 8, 2026, 9:19 p.m.