Any smaller model than OmniCoder v2 9b that can appropriately and accurately tool call?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Hate to ask a simple question, but I’ve looked around and I see plenty of smaller models that *can* tool call, but none of them seem to do so appropriately or agentically. Referring to this.
As a matter of fact I couldn’t seem to get any newer Qwen or Gemma models to tool call properly either, but to be fair I didn’t try very hard. OmniCoder seems to do it perfectly without prompting. Just wondering if there’s something smaller so I can hot load it quicker on my 12 GB RTX 3060.
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.