Triple

T6065057
Position Surface form Disambiguated ID Type / Status
Subject SMN2 E135135 entity
Predicate fullName P16 FINISHED
Object survival of motor neuron 2, centromeric E135135 NE FINISHED

How this triple was built (2 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: survival of motor neuron 2, centromeric | Statement: [SMN2, fullName, survival of motor neuron 2, centromeric]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: survival of motor neuron 2, centromeric
Context triple: [SMN2, fullName, survival of motor neuron 2, centromeric]
  • A. survival motor neuron protein
    Survival motor neuron protein is an essential cellular protein required for the maintenance and function of motor neurons, whose deficiency leads to spinal muscular atrophy.
  • B. SMN2 chosen
    SMN2 is a human gene that produces a backup form of survival motor neuron protein and is a key therapeutic target in spinal muscular atrophy.
  • C. 5q13 (SMN1)
    5q13 (SMN1) is a gene located on chromosome 5 whose proper function is critical for motor neuron survival and whose mutations are the primary cause of spinal muscular atrophy.
  • D. SMN complex
    The SMN complex is a multiprotein assembly crucial for the biogenesis of small nuclear ribonucleoproteins (snRNPs) and proper pre-mRNA splicing, with key roles in motor neuron survival.
  • E. SOD1
    SOD1 is a gene encoding the antioxidant enzyme superoxide dismutase 1, whose mutations are a major known cause of familial amyotrophic lateral sclerosis (ALS).
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.

Provenance (3 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c00878d06881909ee78e88913bf890 completed March 22, 2026, 3:19 p.m.
NER Named-entity recognition batch_69c05723c91c819090b4d4672e72f9f3 completed March 22, 2026, 8:54 p.m.
NED1 Entity disambiguation (via context triple) batch_69c11d23dca8819080702ca0f05df5dd completed March 23, 2026, 10:59 a.m.
Created at: March 22, 2026, 4:10 p.m.