Automatically UNLOAD Models from Vram

TheAviator · February 15, 2025, 3:12am

As we all know, Vram is very limited for most of us.
Its basically impossible to Run an LLM & say Flux Dev in the Vram at the same time (even on a RTX 4090)

If you have your LLM and ComfUI running on the same server, its critical to be able to quickly and automatically free up Vram

Example of how things should work to efficiently manage Vram

Ask LLM to generate image prompt via Open-webui
Once prompt is generated, Open-webui automatically UNLOADS LLM from Vram via Keep Alive setting 0 minutes
Use freshly generated image prompt to send to ComfyUI via the Open-webui interface
ComfyUI generate amazing image
ComfyUI should then AUTOMATICALLY UNLOAD models from Vram, freeing up server ready for new LLM request.

Is this already possible?
If so, HOW?
If not, i really think it would make all our workflows far more efficient if this were possible.
Thanks

TheAviator · February 15, 2025, 10:56am

Ok i have found a way to do this.
I found this git page SeanScripts / ComfyUI-Unload-Model that add a NODE that can unload ALL MODELS.

I placed it just before the image generation

and sure enough after generating the image, the Vram unloads and empties.

Would be great tho, if ComfyUI integrated this setting natively.

system · February 20, 2025, 7:39pm

This topic was automatically closed 5 hours after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Some problems and feedback with Comfy Desktop Windows issue , node-library	0	404	November 27, 2024
Unload model from VRAM, what is the speed bottleneck? Windows issue , others	6	214	June 23, 2025
Allow Different Model Path Structure Windows suggestion , model-library	1	550	November 19, 2024
Add an option to choose where to load and save models to Ideas suggestion , settings	1	1621	March 16, 2025
How do I handle Memory in ComfyUI Desktop Windows suggestion , settings	1	149	December 6, 2024

Automatically UNLOAD Models from Vram

Related topics