Can anyone help me troubleshoot why this workflow is so slow?

I will start off by saying I am a total noob to this. I have had ComfyUI for a little over a week and have been slugging through pixorama tutorials.

I came across this tutorial a few days ago using this workflow (patreon link but the workflow is free…I am using the Q5_K_M gguf for my testing which should align with my GPU) and have been messing with it ever since. One thing I notice is my generations are PAINFULLY slow. The workflow took 40+ minutes to complete before I did a RAM upgrade and now takes between 24-35 minutes. I have an RTX 4060 TI w/16GB VRAM. A1111 can create a 1024x1024 image in around 15ish seconds without any optimization using a larger model like RealisticVision. I would expect this workflow to take around 10ish minutes max (20 seconds per image 30 images) but its taking at minimum double that.

Things I have tried to resolve this:

  • Upgrading RAM to 32GB, enabling overclocking in BIOS for 3200 MTs Speeds (this was the only thing that significantly reduces the time, but no where near as much as I would hope)

  • Putting ComfyUI into --highvram mode (currently still in highvram mode)

  • Changing GPU drivers (game vs stability, currently have game)

  • Messing with system fallback settings in my Nvidia control panel (driver default always works the best) (no oom errors in any of the testing I did)

None of these have worked for me…even a little.

Things I notice when I run the workflow:

  • It seems to get hung up on the ksampler but I am not seeing my GPU fire up sometimes for multiple minutes. Eventually the GPU will fire up to 100% and the image will generate but it seems like its getting hung on something before the generation kicks in.

  • The time ComfyUI tells me it took to process is way less than it actually took. Idk if comfy is just counting time spent generating but the # of seconds Comfy gives me at the end is on average around 10 minutes under counted.

  • For some reason the workflow will fail out the first time I load it religiously. I need to go back in and re-select the models (not change anything literally just re-select them even though they are already selected) THEN the workflow will work.

Does anyone have any advice here? Ive read about adding nodes to offload processing (im sure im saying this wrong but I assume someone will know what im taking about) which could reduce time to generate?

I appreciate any and all help!

Another noob here.

I also watched this tutorial and its an interesting one, but I believe this workflow is slow by it self.

The guy from the tutorial says this workflow is modular.
Perhaps use that to your advantage and disable a couple of groups and try again?

Sorry, that’s all the feedback I can give.

Hope it works out for you!