r/LocalLLaMA · June 6, 2026 · 1 min read

What are you running on 16Gb VRAM + 64Gb Ram?

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

I know this gets asked a lot, but I can only find threads that are at least a couple of months old, so I thought I'd ask to see what people are running these days.

I have an RTX5080 and 64Gb Ddr5 RAM. What's the best I can run for coding? And for agentic workflows?

If you have a similar setup I'd love to know what quants you are running of which models, and a llama.cpp command with your settings would be sweet too :)

submitted by /u/whatyathinkk
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA