Fulloch V2: 100% Local Voice Assistant for Home Assistant & Obsidian (Runs on 16GB VRAM)
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| Hey everyone, following up on my r/LocalLLaMA post from a while back, I have spent some time testing how far I can push my 5060ti as a personal voice assistant. The stack is Qwen3.5-9B GGUF Q5_K_M, Qwen3-1.7B ASR, and Qwen3-1.7B TTS, delivering fast, real-time responses with acoustic barge-in and follow up for better conversations. On top of driving your Home Assistant, V2 now features agentic long-term memory and seamlessly integrates with your local Obsidian vault (or other markdown notes) to read, write and append notes (it won't delete or modify anything). Semantic search of your markdown notes is also available through voice search using the bge embedding model. Public repo at https://github.com/liampetti/fulloch I've linked a quick video demo showing the response speed, conversational, barge-in, and semantic note searching features through an included Chat UI. It also has a bash/bat file for creating your own voices and you can add your own custom wakeword by just typing it into the config (no special wakeword models needed). Everything tested on Linux but Windows supported. [link] [comments] |
More from r/LocalLLaMA
-
6x P40 running Minimax M2.7_Q3_XL
Jul 2
-
Fine-tuned Gemma-4-31B specifically for Copywriting & Creative Writing Tasks (Scored +290 Elo over base using EqBench3)
Jul 2
-
Gemma 4 WebGPU Kernels 255 tok/s by x/@xenovacom
Jul 2
-
openlumara, my manually coded super-token-efficient harness, now works across any UI that can connect to an openAI endpoint! koboldlite, openwebui, you name it. basically, openAI bridge. yay!
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.