GPT-OSS Performance Optimizations on NVIDIA Blackwell: Pushing the Pareto Frontier

📝

内容提要

TL;DR: In collaboration with the open-source community, vLLM + NVIDIA has achieved significant performance milestones on the gpt-oss-120b model running on NVIDIA’s Blackwell GPUs. Through deep...

🏷️

标签

➡️

继续阅读