Using NVIDIA GeForce RTX 3090 24G GPU, using DPM++2M Karras with steps of 201024 * 1024 to generate a graph at a speed of 2.55 it/s. Is this speed normal? #15113
hjj-lmx
started this conversation in
Optimization
Replies: 1 comment 2 replies
-
With TensorRT you will hit a max of 5 it/s, and that is the limit of the 3090 for single image. FP8 does not give speed improvements on the Ampere tensor cores, only memory savings. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
How can you optimize the speed? If you use Controlnet, the speed will be slower. Do you have any good suggestions?
Beta Was this translation helpful? Give feedback.
All reactions