Gemma 4 12B first coding agent test on a 4080 Super
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| Just threw the new Gemma 4 12B into VSCodium with the Pi Agent extension to see how it handles tools, and it nailed the test on the first try. I gave it a prompt to write a Python script that reads logs line-by-line, grabs the error modules, and dumps the counts to a JSON file. I also told it to make its own mock log data and run a live terminal test to verify the results. Instead of just spitting out a block of code for me to copy and paste, the agent actually went to work. It created the script, populated a dummy app.log file with a mix of random logs, opened up a terminal shell to run the code, and verified the output with zero bugs or path errors.
[link] [comments] |
More from r/LocalLLaMA
-
New Google Gemma 4 12B Claims Near-26B Performance - We Tested Both!
Jun 3
-
gemma-4-12b-it vs Qwen3.5-9B on shared benchmarks: Qwen is overall winner beating gemma in 5/8 benchmarks despite a smaller footprint
Jun 3
-
More Gemma 4 models incoming
Jun 3
-
Been a while since we had a Qwen-Coder. could use a 3.7 80B-8B
Jun 3
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.