Gemma 4 12B: incompatible with opencode, or just awful at tool calling?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Yesterday I tried out Gemma 4 12B on a significant coding challenge, to compare it to prior results with Qwen models. I ran the 8-bit quant, so I'm not dumbing it down much at all.
Judging from the partial results, it seemed capable of grasping the task, but it burned far too much time and effort trying to successfully do basic tool calls. Over and over it would fail to specify "pattern" successfully to a "grep" tool, for instance, and the call would be rejected. Ultimately I interrupted it because it didn't feel like this was going to be productive.
Is opencode lacking in compatibility with Gemma 4 12B, or the other way around? Is there a harness with which people are seeing reliable tool calls from Gemma 4 12B?
Thanks!
[link] [comments]
More from r/LocalLLaMA
-
You guys were right - Qwen 3.6 35B IS good...and KV Cache DOES matter.
Jun 4
-
Run (your largest) local models from your iPhone
Jun 4
-
Nemotron 3 Ultra. 550 billion parameters, 55B active. 1 million context
Jun 4
-
I accidentally crippled my 4x RTX 3090 LLM rig with a hidden PCIe 2.0 x4 slot and fixing it doubled Mistral 128B performance
Jun 4
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.