r/LocalLLaMA · May 30, 2026 · 1 min read

Fulloch V2: 100% Local Voice Assistant for Home Assistant & Obsidian (Runs on 16GB VRAM)

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Fulloch V2: 100% Local Voice Assistant for Home Assistant & Obsidian (Runs on 16GB VRAM)

Hey everyone, following up on my r/LocalLLaMA post from a while back, I have spent some time testing how far I can push my 5060ti as a personal voice assistant.

The stack is Qwen3.5-9B GGUF Q5_K_M, Qwen3-1.7B ASR, and Qwen3-1.7B TTS, delivering fast, real-time responses with acoustic barge-in and follow up for better conversations. On top of driving your Home Assistant, V2 now features agentic long-term memory and seamlessly integrates with your local Obsidian vault (or other markdown notes) to read, write and append notes (it won't delete or modify anything). Semantic search of your markdown notes is also available through voice search using the bge embedding model.

Public repo at https://github.com/liampetti/fulloch

I've linked a quick video demo showing the response speed, conversational, barge-in, and semantic note searching features through an included Chat UI.

It also has a bash/bat file for creating your own voices and you can add your own custom wakeword by just typing it into the config (no special wakeword models needed). Everything tested on Linux but Windows supported.

submitted by /u/liampetti
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA