Qwen3.6-35B-A3B vs Gemma4-26B-A4B
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Just wondering how are people's experience with both these models!
I've had some nice results with Qwen but Gemma4 runs so much faster here. I'm using a Radeon 9070 XT and always latest llama.cpp.
[link] [comments]
More from r/LocalLLaMA
-
Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job)
May 24
-
How I do use the recent llama.cpp native tools to do web rag a.k.a. web_fetch (or anything else for the matter) directly from inside the llama-server's webui
May 24
-
Why not dynamic active parameters (and other questions for the knowledgeable)
May 24
-
Choosing an abliterated version of Gemma 4 31B and 26B-A4B
May 24
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.