My reasons to run local models
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
- I can finetune any model on any dataset I want.
- I can use techniques like speculative decoding and other sota approaches to get the max tps
- The llm provides like anthropic and openai are not getting access to my data
- The hardware is reusable for vision text speech, and I can run any blend of models for free as much as I want
- I can curate any dataset/content that I want without worrying about the costs
- I like watching Dario go up in flames
[link] [comments]
More from r/LocalLLaMA
-
SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.
Jul 2
-
[Benchmark] Kimi K2.7 Code Q3 on Mac Studio M3 Ultra + RTX PRO 6000 over llama.cpp RPC: prefill improves, no changes in token generation/decode
Jul 2
-
They fit! Mostly.... 2x 3090, Thermaltake Core p3
Jul 2
-
Making LLMs Better at Creative Writing using Entropy
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.