r/LocalLLaMA · June 30, 2026 · 3 min read

Vibe Coding / Agentic workflow

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Hey folks. I know that vibe coding is frowned upon pretty solidly here, and I get that, but I’m not a programmer. I just don’t realistically have the time to learn python or C++ to the level I would need to to build some of the things I’d like to create.

On a side note, I do believe that coding through natural language will be the inevitable outcome of AI adoption and through growth in the field as models get stronger.

My question is, what sort of workflows can you use to successfully vibe-code, using something like Qwen 27B Q8_0 and 128k context? I’ve tried a lot of different things.

My current workflow tends to be something like this: I give the LLM a plan, let’s say for example a three.js stack game. I create a very in-depth plan regarding the scope of the game, including structure, mechanics, scope. like a 6-8 paragraph document including lists and sub lists, just how I would organize a project myself. I let the LLM create a more granular version of the plan that includes the entire file and directory structure, technical details on how to achieve the plan’s goals, etc, and create a phase/task list that breaks down all the necessary building stages of the project.

In my last example, I gave instructions to use config files with templates for game objects, that way the LLM could create the game code in a more horizontal way, where I can go behind and add depth with game objects through the configs. This has worked for me previously in a word-based TUI RPG I vibe coded.

As the workflow continues, I have the LLM complete the task list in pieces, with me baby sitting watching for loops, and prompting the model to update the task list and I start a new session once’s context starts getting too high.

The issue is I’m getting really sub-par results. Like, in the initial first phase of a building, controls don’t work, and a couple sessions later the LLM can’t diagnose it’s own code to find the problem, for something in three.js.

I understand that some people will tell me to just learn to code myself, but I see videos on here of the same LLM’s one-shotting games that are substantially better functioning than my well planned out and after 10-20 sessions later.

What can I do to improve my workflow? Do I really have to commit to using frontier cloud models to come behind to resolve problems in the code? These aren’t huge asks of my model compared to what I see some people ask. I tried getting my LLM to create a PI extension that uses a python script to manually prompt the LLM to save its progress to memory, and start a fresh session with a given prompt when context gets too high, and it was completely unsuccessful. I attempted to debug it myself, along with the LLM over multiple sessions and finally scrapped the project.

I’m looking for advice. running Ubuntu, llama.cpp, and pi harness with 32gb VRAM and 48gb RAM. To anyone who managed to read all of this, thanks for chiming in. I’m sure I’m not the only one that’s struggled with this. This might just be the limit of these small sized local models.

submitted by /u/UniqueIdentifier00
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA