Best small model right now (~4B params) that is good with agentic tasks for personal assistant?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Looking for suggestions. I have been experimenting with gemma-4-E2B and gemma-4-E4B but the tool calling has been not the best? My tasks are just things like:
- Update calendar
- Get my schedule
- Send a WA message at 4PM
etc.
Any suggestions? If it helps, here are my server params:
./llama-server \ --host 0.0.0.0 \ --port 8080 \ -m ~/myp/models/google_gemma-4-E4B-it-Q8_0.gguf \ --temp 1.0 \ --top_p 0.95 \ --top_k 64 \ -c 65536 \ --flash-attn on \ -t 16 \ --ctx-checkpoints 4 \ --cache-ram 16384 \ --chat-template-file /home/lenny/myp/models/jinja/gemma4-improved.jinja \ -ngl 99
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.