What are you running on 16Gb VRAM + 64Gb Ram?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
I know this gets asked a lot, but I can only find threads that are at least a couple of months old, so I thought I'd ask to see what people are running these days.
I have an RTX5080 and 64Gb Ddr5 RAM. What's the best I can run for coding? And for agentic workflows?
If you have a similar setup I'd love to know what quants you are running of which models, and a llama.cpp command with your settings would be sweet too :)
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.