r/LocalLLaMA · June 7, 2026 · 1 min read

Any smaller model than OmniCoder v2 9b that can appropriately and accurately tool call?

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Hate to ask a simple question, but I’ve looked around and I see plenty of smaller models that *can* tool call, but none of them seem to do so appropriately or agentically. Referring to this.

As a matter of fact I couldn’t seem to get any newer Qwen or Gemma models to tool call properly either, but to be fair I didn’t try very hard. OmniCoder seems to do it perfectly without prompting. Just wondering if there’s something smaller so I can hot load it quicker on my 12 GB RTX 3060.

submitted by /u/gavff64
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA