r/LocalLLaMA · · 1 min read

Qwen 3.6 coding choice–27B vs 35B quants

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

I've been using Qwen 3.6 35BA3B for a while in Q8_0 quant, KV Q8_0 as well. I'm trying to explore Qwen 2.6 27B. Any tips on which quant to use?

Context size is 262144

  1. Q4KM with full KV quant (fp16)

  2. Q6K with Q8_0 KV quant

  3. Stick with 35BA3B Q8_0, it's better.

View Poll

submitted by /u/siegevjorn
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA