Even Google still believes in small models for coding.
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| I've been meaning to post about this. The community has been pretty vocal in criticizing "vibe-coded" projects. I used to think the backlash was the real problem, but I've started getting annoyed by a lot of these posts myself — many are just tiny, hyper-specific tools with minimal impact. Still, I think the community and mods could create better spaces for sharing actual ideas and innovations so people can build on each other's work. A monthly mega-thread or "top picks" roundup or something like that could help. I firmly believe that good, well-designed code fits the open source collaborative spirit of this community even(specially?) if it's vibe-coded. That said, vibe coding with local models has huge potential. Even Google is now running hackathons for small models like Gemma 4 31B (see thumbnail). This is to celebrate their record inference speeds of 1500 tokens per second, 50–100× faster than what we can do locally, but it's still telling that the big players see real value in small-model AI-assisted software engineering. [link] [comments] |
More from r/LocalLLaMA
-
Been running Qwen3.6-27B through a 3-critic harness. The harness matters more than I thought
Jun 30
-
I Hate Dario Amodei, and everything he stands for.
Jun 29
-
Introducing LongCat-2.0 - , a large-scale MoE language model with 1.6 trillion total parameters and ~48 billion activated per token. This was the stealth model that was on Openrouter under the name 'owl-alpha'.
Jun 29
-
Krea-2-Turbo Image Model - Easy to be fully uncensored, but it can also EDIT Images!
Jun 29
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.