r/LocalLLaMA · · 1 min read

Pay attention: a few chats waiting in tray reserve 1GB VRAM for themselves.

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

If an application uses a Web-based interface and "hardware acceleration", it constructs its frame in VRAM and sometimes keeps it reserved even if the app is minimised.

On my Linux machine, Discord is the worst offender, reserving 450 MB VRAM. Steam takes 200 MB, Telegram 150 MB, and a few other apps top it up to 1 GB+.

If you are really squeezing something into VRAM, make sure to either close those apps or turn off "hardware acceleration" in their settings. But they would stutter a lot.

Also, it may make sense to have another browser with hardware acceleration turned off, and use it only when working with an LLM.

P.S. On Linux with Nvidia, I can get a list of VRAM gobblers with the command nvidia-smi.

submitted by /u/Barafu
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA