Replies: 4 comments 1 reply
-
@Deminisa you have to remove the |
Beta Was this translation helpful? Give feedback.
-
@Deminisa: Download cuDNN for linux and place files in your /output directory (this is for convenience, because it is easily accessible from inside container, so you can unpack files later inside container into proper destination directory). You can do cuDNN files (libraries) are in If it works for you (this is temporary solution to check it out), than we'll think about incorporating it in regular dockerfile so everybody that can benefit from this will do so. |
Beta Was this translation helpful? Give feedback.
-
Thank you very much for the instructions! I was originally under the impression that the container wasn't persistent as I'm having issues trying to properly set up the dreambooth plugin in auto (not being able to find a CUDA shared library to point to for bitsandbytes/8-bit adam, and installing cuda-libraries with apt gets removed after each container restart), but I digress and something for another discussion. All tests were done with the v1-5-pruned-emaonly.ckpt [cc6cb27103] checkpoint (https://huggingface.co/runwayml/stable-diffusion-v1-5/blob/main/v1-5-pruned-emaonly.ckpt) Prompt: a photo of a person Default install Default install with cudnn-linux-x86_64-8.6.0.163_cuda11.8 Default install with cudnn-linux-x86_64-8.7.0.84_cuda11 Also did a couple of tests with xformers 0.0.17 built with TORCH_CUDA_ARCH_LIST="8.6+ptx" xformers-0.0.17+00afc12.d20230207-cp310-cp310-linux_x86_64.whl with cudnn-linux-x86_64-8.6.0.163_cuda11.8 xformers-0.0.17+00afc12.d20230207-cp310-cp310-linux_x86_64.whl with cudnn-linux-x86_64-8.7.0.84_cuda11 Made a few notes regarding the test: Note 1) Note 2) Note 3) Note 4) webui-docker-auto-1 | Checkpoint v1-5-pruned-emaonly.ckpt [cc6cb27103] not found; loading fallback sd-v1-5-inpainting.ckpt [c6bbc15e32] So in summary ... While anecdotal, I could see a great improvement by updating the libraries. Still not reaching the low to mid 20's it/s as reported by others though. |
Beta Was this translation helpful? Give feedback.
-
We'll figure it out. |
Beta Was this translation helpful? Give feedback.
-
Hi,
Was wondering which kind of performance other people were seeing on 40X0 cards as I'm having some issues with a 4090 reaching it/s remotely close to what others have been reporting.
I.e generating "photo of a person" with euler a, 20 steps, 512x512 I'm getting everywhere between 3.8 and 4.5 it/s with 1 batch count and 1 batch size.
System:
i7 9700k, 64GB RAM, M2 NVMe (the CPU is a few generations old, but assuming it's not making it/s that much worse)
Clean install of Windows 11 22H2
WSL 2 Ubuntu 20.04
Docker 4.16.3
Clean container from the newest commit of this repo (1e0561c) on AUTO1111 having everything on default settings
Also tried building and using xformers-0.0.17+00afc12.d20230207 but did not see any noticeable improvements.
Any tips would be appreciated! 😀
Cheers!
Beta Was this translation helpful? Give feedback.
All reactions