Optimization story: Bloom inference

Hugging Face - Blog
Hugging Face - Blog ·

Incredibly Fast BLOOM Inference with DeepSpeed and Accelerate

Hugging Face - Blog
Hugging Face - Blog ·