I hate to be this guy but: Any good, recent CODING models in the 70-80B range?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
- 3x 24GB vram.
- Qwen-coder-next is not bad. I'll continue to use it if you yell enough at me.
- I do a lot of front-end work, which develops rapidly, so the most recent the model the better.
- Larger than 80B and I'll have to sacrifice the decentish Q6 quant, or the minimum (for coding) 256k context.
- I do NOT believe that the latest 27-31B dense models can realistically beat an 80B model, even if I stomach the slowness, but change my mind.
- Slowness is an issue since I do NOT yolo. I micro-manage the heck out of the agent. It's actually more efficient than letting it rip, then having it rip again the next day because it had been climbing the wrong ladder.
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.