DistilBERT

E435865

DistilBERT is a smaller, faster, and lighter-weight distilled version of the BERT language model designed to retain most of its performance while being more efficient for practical NLP applications.

All labels observed (1)

Label Occurrences
DistilBERT canonical 1

How this entity was disambiguated

Statements (49)

Predicate Object
instanceOf distilled model
neural network model
pretrained language model
transformer-based language model
architectureType Transformer
availableInLibrary Transformers NERFINISHED
basedOn BERT NERFINISHED
compatibleWith Hugging Face Tokenizers NERFINISHED
ONNX export
designedFor efficiency
faster inference
lower memory usage
developedBy Hugging Face NERFINISHED
distilledFrom BERT base uncased NERFINISHED
hasModelVariant distilbert-base-cased NERFINISHED
distilbert-base-multilingual-cased NERFINISHED
distilbert-base-uncased
distilbert-base-uncased-finetuned-sst-2-english
hiddenSize 768
implementedIn PyTorch NERFINISHED
inputType tokenized text
language English
license Apache-2.0
maintainedBy Hugging Face NERFINISHED
numberOfAttentionHeads 12
numberOfLayers 6
outputType contextualized token embeddings
sequence representation
paperArchive arXiv NERFINISHED
paperArxivId 1910.01108
paperTitle DistilBERT: a distilled version of BERT: smaller, faster, cheaper and lighter NERFINISHED
parameterCountRelativeTo about 40 percent fewer parameters than BERT base
performanceRetention retains about 97 percent of BERT base performance on GLUE
releasedBy Hugging Face NERFINISHED
releaseYear 2019
speedRelativeTo about 60 percent faster than BERT base
supportsTask feature extraction
named entity recognition
question answering
sentiment analysis
sequence labeling
text classification
tokenizerType WordPiece
trainingObjective distillation from BERT
masked language modeling
usedFor production NLP systems
resource-constrained environments
usesMechanism self-attention
vocabularySize 30522

How these facts were elicited

Referenced by (1)

Full triples — surface form annotated when it differs from this entity's canonical label.