Self-hosted STT better than Whisper Large V3 Turbo that matches AssemblyAI quality?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
I’m already using Whisper Large V3 Turbo self-hosted, but the accuracy still isn’t where I need it. I like AssemblyAI’s quality and want something self-hosted that:
- Is clearly better than Whisper Large V3 Turbo
- Can match or get close to AssemblyAI’s transcription quality
- Runs locally (no cloud API)
Is there a self-hosted model or stack that realistically beats Whisper Large V3 and gets close to AssemblyAI? Or is AssemblyAI’s own self-hosted offering the only real option at that quality level?
[link] [comments]
More from r/LocalLLaMA
-
6x P40 running Minimax M2.7_Q3_XL
Jul 2
-
Fine-tuned Gemma-4-31B specifically for Copywriting & Creative Writing Tasks (Scored +290 Elo over base using EqBench3)
Jul 2
-
Gemma 4 WebGPU Kernels 255 tok/s by x/@xenovacom
Jul 2
-
openlumara, my manually coded super-token-efficient harness, now works across any UI that can connect to an openAI endpoint! koboldlite, openwebui, you name it. basically, openAI bridge. yay!
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.