As announced at @NVIDIAGTC today, we've collaborated to optimize Stable Diffusion 3.5 using TensorRT and FP8 quantization. Compared to the base @PyTorch models, these optimizations deliver: 2.3x faster generation with SD3.5 Large 1.7x faster generation with SD3.5 Medium
Stable Diffusion 3.5 Optimization Delivers 2.3x Faster Generation
By
–
Leave a Reply