r/LocalLLaMA · · 1 min read

Best small model right now (~4B params) that is good with agentic tasks for personal assistant?

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Looking for suggestions. I have been experimenting with gemma-4-E2B and gemma-4-E4B but the tool calling has been not the best? My tasks are just things like:

  • Update calendar
  • Get my schedule
  • Send a WA message at 4PM

etc.

Any suggestions? If it helps, here are my server params:

./llama-server \ --host 0.0.0.0 \ --port 8080 \ -m ~/myp/models/google_gemma-4-E4B-it-Q8_0.gguf \ --temp 1.0 \ --top_p 0.95 \ --top_k 64 \ -c 65536 \ --flash-attn on \ -t 16 \ --ctx-checkpoints 4 \ --cache-ram 16384 \ --chat-template-file /home/lenny/myp/models/jinja/gemma4-improved.jinja \ -ngl 99

submitted by /u/BitGreen1270
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA