Triple
T6065067
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | SMN2 |
E135135
|
entity |
| Predicate | primaryTranscript |
P68440
|
FINISHED |
| Object | SMN2 pre-mRNA |
E135135
|
NE FINISHED |
How this triple was built (3 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: SMN2 pre-mRNA | Statement: [SMN2, primaryTranscript, SMN2 pre-mRNA]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: SMN2 pre-mRNA Context triple: [SMN2, primaryTranscript, SMN2 pre-mRNA]
-
A.
SMN2
chosen
SMN2 is a human gene that produces a backup form of survival motor neuron protein and is a key therapeutic target in spinal muscular atrophy.
-
B.
SMN complex
The SMN complex is a multiprotein assembly crucial for the biogenesis of small nuclear ribonucleoproteins (snRNPs) and proper pre-mRNA splicing, with key roles in motor neuron survival.
-
C.
SMN1 gene
The SMN1 gene is a human gene whose proper function is critical for motor neuron survival, and mutations in it are the primary cause of spinal muscular atrophy.
-
D.
5q13 (SMN1)
5q13 (SMN1) is a gene located on chromosome 5 whose proper function is critical for motor neuron survival and whose mutations are the primary cause of spinal muscular atrophy.
-
E.
survival motor neuron protein
Survival motor neuron protein is an essential cellular protein required for the maintenance and function of motor neurons, whose deficiency leads to spinal muscular atrophy.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
PD
Predicate disambiguation
gpt-5-mini-2025-08-07
Target predicate: primaryTranscript Context triple: [SMN2, primaryTranscript, SMN2 pre-mRNA]
-
A.
primaryStructure
Indicates that one entity serves as the main or principal structural component or framework in relation to another entity.
-
B.
transcription
Indicates the process by which genetic information in DNA is copied into RNA by RNA polymerase.
-
C.
primaryReference
Indicates that one entity serves as the main or authoritative source of information or citation for another entity.
-
D.
primaryFor
Indicates that one entity serves as the main or principal option, resource, or association for another entity among possible alternatives.
-
E.
primaryType
Indicates the main or most fundamental category or classification assigned to an entity, distinguishing it from any secondary or auxiliary types.
- F. None of above. chosen
Provenance (5 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69c00878d06881909ee78e88913bf890 |
completed | March 22, 2026, 3:19 p.m. |
| NER | Named-entity recognition | batch_69c05723c91c819090b4d4672e72f9f3 |
completed | March 22, 2026, 8:54 p.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69c11d23dca8819080702ca0f05df5dd |
completed | March 23, 2026, 10:59 a.m. |
| PD | Predicate disambiguation | batch_69c049f031408190b08b2766237c5dd0 |
completed | March 22, 2026, 7:58 p.m. |
| PDg | Predicate description generation | batch_69c04e8d4a148190bd8f95caae978e1b |
completed | March 22, 2026, 8:18 p.m. |
Created at: March 22, 2026, 4:10 p.m.