高效的 FP4 混合量化扩散变换器(HQ-DiT)
📝
内容提要
Diffusion Transformers (DiTs) are improved by Hybrid Floating-point Quantization (HQ-DiT), a post-training quantization method utilizing 4-bit floating-point precision on both weights and...
➡️