I only got 31 it/s on Windows with my 4090 and I am happy about that. #8721
Replies: 10 comments 19 replies
-
Out of all the people in that thread, you tagged me... I just was curious about why it is that way... So, long story short: windows sucks, better use linux. Thanks for your insight! |
Beta Was this translation helpful? Give feedback.
-
So to get the linux speedboost, would it be sufficient to run A1111 with --listen on a windows ubuntu wsl machine? And then access it from windows via browser and IP? Or does it has to be linux on boot? There are still so many users out there who I tried to convince to get a better CPU for their 4090 because the card really needs it, but nope. "SD only uses GPU there is no CPU involved!" yea...no... and memory bandwith and PCIE version also absolutely dont matter... /sarcasm off. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
With my Ryzen 9 5900X I get 30-33 it/s for batch size 1 and 49 it/s for bath size 4 on Ubuntu 22.04 WSL2 (Windows 11)
2.1
|
Beta Was this translation helpful? Give feedback.
-
I don't understand these numbers. I have 6 G of VRAM and I'm pretty happy with ~1.5 it/s. Are you able to generate one image a second or something? Is this some CPU-based metric which doesn't compare equivalently? Or have I been doing something wrong and you all have been generating real time video for weeks now? |
Beta Was this translation helpful? Give feedback.
-
And here I am on an m1 max getting 3 it/s max. Which is after optimizations, it was 1.5 before pytorch 2.0.... Now I'm depressed, thank you. |
Beta Was this translation helpful? Give feedback.
-
@aifartist How to upgrade to 2.0.0+cu118. |
Beta Was this translation helpful? Give feedback.
-
I only get 21, 31 would be amazing. Win11 4090 |
Beta Was this translation helpful? Give feedback.
-
What about Messaged Signalling Interrupts? |
Beta Was this translation helpful? Give feedback.
-
Why? Because I finally got tired of all the noise about windows users not getting the 39.5 it/s or faster that I get on Linux.
Why am I happy it wasn't as fast? Because it only took me a few minutes to repro the problem so I could see the issue myself.
I had a dual boot system since I bought my high end box in Dec. but have been on Ubuntu ever since the first week having it.
Today I booted Windows.
I had to upgrade the python there to 3.10 from the 3.9 already installed.
I did a git clone of A1111.
Did the first time start up, for it to install everything, and saw the expected slow 13 it/s.
Upgraded to torch 2.0.0+cu118, and got about 31 it/s.
Bingo! 39+ it/s on Ubuntu and 31-32 on Windows.
Using Task Manager I can see quite large amounts of system CPU usage.
You have to click "show kernel times" and it is easier if you use the per logical processor view.
My estimate based on what I saw is that, in total, 2X cores are being used instead of the 1 expected.
The main python process is using 100% of the processor it is running on but one, or sometimes two other processors just get hammered with system processing.
Turned off GPU scheduling and got to 35 it/s.
I suspect with the SD 2.1 model it'd be closer to 37 it/s.
Ran on my slower 4.3 GHz cpus, to simulate users with slower processors, trying to push a 4090, and only got 22.5.
I was a bit surprised that turning off GPU scheduling made more of a difference on slower CPU.
On a slow CPU one would think the GPU offload to a fast GPU might be better.
But I don't intend to do a deep tech dive looking at the Windows interrupt handling system(possible source of high kernel time) and thread scheduling which seems to bounce processes back and forth between CPU's more than Linux does.
Windows 10 vs Windows 11?
Windows vs Windows pro?
Windows pro vs Windows Enterprise?
Windows version?
PCI capabilities, or BIOS settings, allowing or not allowing kernel bypass without needing interrupts.
To bad I wasn't playing with this stuff just before I retired from Microsoft. I could have asked internally.
Now I'd want to know where to send the bill to figure this out. :-)
I've satisfied my curiosity. I'm back on Ubuntu now.
@SamueleLorefice
Beta Was this translation helpful? Give feedback.
All reactions