Triple
T8357456
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Whittaker five-kingdom system |
E196714
|
entity |
| Predicate | groupsUnicellularEukaryotesIn |
P37835
|
FINISHED |
| Object | Protista |
E6162
|
NE FINISHED |
How this triple was built (3 steps)
Every LLM step that produced this triple, in pipeline order — named-entity classification, the disambiguation choices (the exact options shown, with the pick highlighted), and the generated description. The batch + timestamp of each is in the Provenance table below.
NER
Named-entity recognition
gpt-5-mini
Instruction
Given a phrase, classify it is english named entity (e.g., persons, organizations, works of art) in Latin script, or not (e.g., literals, dates, URLs, verbose phrases). For disambiguation, the statement where the phrase occurs as object is also given. Please return a JSON object with `phrase` (string, the phrase being analyzed) and `is_ne` (boolean, indicating whether the phrase is a Named Entity).
Input
Phrase: Protista | Statement: [Whittaker five-kingdom system, groupsUnicellularEukaryotesIn, Protista]
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: Protista Context triple: [Whittaker five-kingdom system, groupsUnicellularEukaryotesIn, Protista]
-
A.
Protista
chosen
Protista is a diverse kingdom of mostly single-celled eukaryotic organisms that are neither animals, plants, nor fungi.
-
B.
Chromista
Chromista is a diverse eukaryotic kingdom that includes many algae and protist groups, such as brown algae and diatoms, characterized by complex plastids and often aquatic lifestyles.
-
C.
Eukarya
Eukarya is the domain of life comprising all organisms with complex eukaryotic cells containing membrane-bound organelles and a true nucleus.
-
D.
Opisthokonta
Opisthokonta is a major clade of eukaryotes that includes animals, fungi, and their closest unicellular relatives, united by shared cellular and molecular features.
-
E.
Excavata
Excavata is a major supergroup of unicellular eukaryotes characterized by a ventral feeding groove and often modified mitochondria, including many free-living flagellates and important parasites.
- F. None of above.
- G. Unsure - the case is ambiguous/there is not enough information to decide.
PD
Predicate disambiguation
gpt-5-mini-2025-08-07
Target predicate: groupsUnicellularEukaryotesIn Context triple: [Whittaker five-kingdom system, groupsUnicellularEukaryotesIn, Protista]
-
A.
isEukaryote
Indicates that an organism belongs to the domain of life characterized by cells with a true nucleus and membrane-bound organelles.
-
B.
isMulticellular
Indicates that an organism consists of multiple cells organized into a single functional individual.
-
C.
cellStructure
Indicates the structural organization, components, and physical arrangement that make up a cell.
-
D.
organismType
chosen
Indicates the biological classification or kind of organism that an entity is.
-
E.
cellularOrganization
Indicates how the components within a biological cell are structured, arranged, and functionally organized in relation to one another.
- F. None of above.
Provenance (4 batches)
The batch behind each pipeline step, in order, with when it ran. Timestamps are batch-level — stages were processed in waves, so the object chain (NER → NED1 → NEDg → NED2) reads in order, but predicate / elicitation batches can sit in a different wave.
| Step | Stage | Batch ID | Status | When |
|---|---|---|---|---|
| creating | Elicitation | batch_69ca82f08b348190bfb7881944bbff6f |
completed | March 30, 2026, 2:04 p.m. |
| NER | Named-entity recognition | batch_69cb804b57f88190907a4e4e389caf5f |
completed | March 31, 2026, 8:05 a.m. |
| NED1 | Entity disambiguation (via context triple) | batch_69cde7bd74fc8190abdb2813d51d79f8 |
completed | April 2, 2026, 3:51 a.m. |
| PD | Predicate disambiguation | batch_69cb70ca25548190b0f90c5384e3fb3c |
completed | March 31, 2026, 6:59 a.m. |
Created at: March 30, 2026, 5:59 p.m.