高效的 FP4 混合量化扩散变换器(HQ-DiT)

📝

内容提要

Diffusion Transformers (DiTs) are improved by Hybrid Floating-point Quantization (HQ-DiT), a post-training quantization method utilizing 4-bit floating-point precision on both weights and...

➡️

继续阅读