GPT-OSS Performance Optimizations on NVIDIA Blackwell: Pushing the Pareto Frontier
📝
内容提要
TL;DR: In collaboration with the open-source community, vLLM + NVIDIA has achieved significant performance milestones on the gpt-oss-120b model running on NVIDIA’s Blackwell GPUs. Through deep...
➡️