DeepSpeed

E760434

DeepSpeed is a deep learning optimization library from Microsoft that enables efficient, large-scale training of models across distributed GPU systems.

All labels observed (1)

Label Occurrences
DeepSpeed canonical 1

How this entity was disambiguated

Statements (49)

Predicate Object
instanceOf deep learning optimization library
open-source software project
category distributed computing library
machine learning software
developer Microsoft
Microsoft DeepSpeed team NERFINISHED
enables scaling to thousands of GPUs
training models that do not fit in single-GPU memory
feature 3D parallelism
BF16 training
DeepSpeed-Inference NERFINISHED
DeepSpeed-MoE NERFINISHED
FP16 training
Mixture-of-Experts training support
ZeRO optimizer NERFINISHED
ZeRO-Infinity NERFINISHED
ZeRO-Offload NERFINISHED
activation checkpointing
activation partitioning
checkpointing utilities
communication optimization
data parallelism
gradient checkpointing
mixed precision training
model parallelism
optimizer state partitioning
parameter partitioning
pipeline parallelism
sparse attention
tensor parallelism
throughput benchmarking tools
license MIT License
optimizedFor NVIDIA GPUs NERFINISHED
programmingLanguage C++
Python
repository https://github.com/microsoft/DeepSpeed
specializesIn GPU acceleration
distributed training
large-scale model training
memory optimization
training throughput optimization
supports model parallel training of billion-parameter models
offloading to CPU memory
offloading to NVMe storage
supportsFramework PyTorch NERFINISHED
useCase multi-GPU training
multi-node training
training transformer models
training very large language models

How these facts were elicited

Referenced by (1)

Full triples — surface form annotated when it differs from this entity's canonical label.

NCCL usedBy DeepSpeed