Triple

T6065067
Position Surface form Disambiguated ID Type / Status
Subject SMN2 E135135 entity
Predicate primaryTranscript P68440 FINISHED
Object SMN2 pre-mRNA E135135 NE FINISHED

How this triple was built (3 steps)

Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.

NER Named-entity recognition gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: SMN2 pre-mRNA | Statement: [SMN2, primaryTranscript, SMN2 pre-mRNA]
NED1 Entity disambiguation (via context triple) gpt-5-mini-2025-08-07
Target entity: SMN2 pre-mRNA
Context triple: [SMN2, primaryTranscript, SMN2 pre-mRNA]
  • A. SMN2 chosen
    SMN2 is a human gene that produces a backup form of survival motor neuron protein and is a key therapeutic target in spinal muscular atrophy.
  • B. SMN complex
    The SMN complex is a multiprotein assembly crucial for the biogenesis of small nuclear ribonucleoproteins (snRNPs) and proper pre-mRNA splicing, with key roles in motor neuron survival.
  • C. SMN1 gene
    The SMN1 gene is a human gene whose proper function is critical for motor neuron survival, and mutations in it are the primary cause of spinal muscular atrophy.
  • D. 5q13 (SMN1)
    5q13 (SMN1) is a gene located on chromosome 5 whose proper function is critical for motor neuron survival and whose mutations are the primary cause of spinal muscular atrophy.
  • E. survival motor neuron protein
    Survival motor neuron protein is an essential cellular protein required for the maintenance and function of motor neurons, whose deficiency leads to spinal muscular atrophy.
  • F. None of above.
  • G. Unsure - the case is ambiguous/there is not enough information to decide.
PD Predicate disambiguation gpt-5-mini-2025-08-07
Target predicate: primaryTranscript
Context triple: [SMN2, primaryTranscript, SMN2 pre-mRNA]
  • A. primaryStructure
    Indicates that one entity serves as the main or principal structural component or framework in relation to another entity.
  • B. transcription
    Indicates the process by which genetic information in DNA is copied into RNA by RNA polymerase.
  • C. primaryReference
    Indicates that one entity serves as the main or authoritative source of information or citation for another entity.
  • D. primaryFor
    Indicates that one entity serves as the main or principal option, resource, or association for another entity among possible alternatives.
  • E. primaryType
    Indicates the main or most fundamental category or classification assigned to an entity, distinguishing it from any secondary or auxiliary types.
  • F. None of above. chosen

Provenance (5 batches)

The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.

Step Stage Batch ID Status When
creating Elicitation batch_69c00878d06881909ee78e88913bf890 completed March 22, 2026, 3:19 p.m.
NER Named-entity recognition batch_69c05723c91c819090b4d4672e72f9f3 completed March 22, 2026, 8:54 p.m.
NED1 Entity disambiguation (via context triple) batch_69c11d23dca8819080702ca0f05df5dd completed March 23, 2026, 10:59 a.m.
PD Predicate disambiguation batch_69c049f031408190b08b2766237c5dd0 completed March 22, 2026, 7:58 p.m.
PDg Predicate description generation batch_69c04e8d4a148190bd8f95caae978e1b completed March 22, 2026, 8:18 p.m.
Created at: March 22, 2026, 4:10 p.m.