Triple
T18204749
| Position | Surface form | Disambiguated ID | Type / Status |
|---|---|---|---|
| Subject | Bloom |
E435874
|
entity |
| Predicate | tokenizerType |
P21075
|
FINISHED |
| Object | SentencePiece |
—
|
NE NERFINISHED |
Disambiguation candidates (2 decisions)
The exact options the model was shown at each disambiguation step, with the option it chose highlighted — the evidence behind this triple's disambiguated ids.
NED1
Entity disambiguation (via context triple)
gpt-5-mini-2025-08-07
Target entity: SentencePiece Context triple: [Bloom, tokenizerType, SentencePiece]
-
A.
TensorFlow Text
TensorFlow Text is a library of text-related ops and utilities that extends TensorFlow for building, training, and serving natural language processing models.
-
B.
Hugging Face Transformers
Hugging Face Transformers is a widely used open-source library that provides state-of-the-art transformer-based models and tools for natural language processing and related machine learning tasks.
-
C.
DistilBERT
DistilBERT is a smaller, faster, and lighter-weight distilled version of the BERT language model designed to retain most of its performance while being more efficient for practical NLP applications.
-
D.
AllenNLP
AllenNLP is an open-source natural language processing research library built on PyTorch, designed to facilitate the development and evaluation of state-of-the-art NLP models.
-
E.
ELPI
ELPI is an implementation of the λProlog logic programming language, designed for higher-order abstract syntax and interactive theorem proving applications.
- F. None of above. chosen
- G. Unsure - the case is ambiguous/there is not enough information to decide.
NED2
Entity disambiguation (via description)
gpt-5-mini-2025-08-07
Target entity: SentencePiece Target entity description: SentencePiece is an unsupervised text tokenizer and detokenizer library, widely used in modern NLP models to perform subword segmentation independent of language- or whitespace-specific rules.
-
A.
TensorFlow Text
TensorFlow Text is a library of text-related ops and utilities that extends TensorFlow for building, training, and serving natural language processing models.
-
B.
Hugging Face Transformers
Hugging Face Transformers is a widely used open-source library that provides state-of-the-art transformer-based models and tools for natural language processing and related machine learning tasks.
-
C.
DistilBERT
DistilBERT is a smaller, faster, and lighter-weight distilled version of the BERT language model designed to retain most of its performance while being more efficient for practical NLP applications.
-
D.
AllenNLP
AllenNLP is an open-source natural language processing research library built on PyTorch, designed to facilitate the development and evaluation of state-of-the-art NLP models.
-
E.
ELPI
ELPI is an implementation of the λProlog logic programming language, designed for higher-order abstract syntax and interactive theorem proving applications.
- F. None of above. chosen
Provenance (2 batches)
| Stage | Batch ID | Job type | Status |
|---|---|---|---|
| creating | batch_69d8b90dba6481908e119eb9aa4ca0cb |
elicitation | completed |
| NER | batch_69e4e222831081908f7d5500424e3acb |
ner | completed |
Created at: April 10, 2026, 10:32 a.m.