Added an old 2070 Super to my rig and I can't go back...worse, now I need more
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Context: I built a new system last year November before everything went to shit. I spent like 5k for a 5090, 9800X3D and 96GB RAM. Recently (last 2-3 months) I'm heavily working on my local setup. Ditched Windows, went Ubuntu > Manjaro > CachyOS (now) and I'm basically building llama.cpp everyday now running tests to find optimal model quantizations, context sizes, best agent cli + harness, etc...most of you know the drill.
Now: I finally got around and took my old PC apart. I saw the 2070, dusted it off and put in my new PC (just out of curiousity). LET ME TELL YOU: I was not ready for what 8GB of additional VRAM does to a mf. I can suddenly run Qwen3.6-27B at Q8_0 with a context of 144k (q8_0 as well) and with MTP and I still generate 40-70tk/s. It's addicting! Now I'm looking at offers online for 5070tis and 3090s (because they are in the same ball park prize wise). I mean it's going to be the 3090 eventually, because I can't just pass on 8GB of VRAM but again I wasn't ready for this. Even a 2070 Super brings so much value if you have it laying around.
This experience was eye opening in terms of: acceptable performance + bigger VRAM > amazing performance + smaller VRAM
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.