Flux Dev (Base)
Inference Time: 5.98s
Model Size: 33GB
Compilation Time: N/A
Cost: $4468.39*
Flux Turbo (Pruna)
Inference Time: 3.9s
Model Size: 33GB
Compilation Time: 10min
Cost: $2914.17*
Flux Dev (Base)
Flux Turbo (Pruna)
- The cost of generating 1M images on H100 on Replicate is shown. For quantized models, the cost can be lowered by switching to a smaller GPU.