Even Google still believes in small models for coding.
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| I've been meaning to post about this. The community has been pretty vocal in criticizing "vibe-coded" projects. I used to think the backlash was the real problem, but I've started getting annoyed by a lot of these posts myself — many are just tiny, hyper-specific tools with minimal impact. Still, I think the community and mods could create better spaces for sharing actual ideas and innovations so people can build on each other's work. A monthly mega-thread or "top picks" roundup or something like that could help. I firmly believe that good, well-designed code fits the open source collaborative spirit of this community even(specially?) if it's vibe-coded. That said, vibe coding with local models has huge potential. Even Google is now running hackathons for small models like Gemma 4 31B (see thumbnail). This is to celebrate their record inference speeds of 1500 tokens per second, 50–100× faster than what we can do locally, but it's still telling that the big players see real value in small-model AI-assisted software engineering. [link] [comments] |
More from r/LocalLLaMA
-
Palantir CEO rages against closed models
Jul 2
-
SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.
Jul 2
-
[Benchmark] Kimi K2.7 Code Q3 on Mac Studio M3 Ultra + RTX PRO 6000 over llama.cpp RPC: prefill improves, no changes in token generation/decode
Jul 2
-
They fit! Mostly.... 2x 3090, Thermaltake Core p3
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.