Anyone still doing fine-tunes on consumer grade hardware?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Felt like there used to be a thriving fine-tuning community a few years back - and then once we started getting models that were smart enough and generalist enough (i.e. post Llama-3-8b era) things kind of dropped off a little. Less need for fine-tunes when prompt-tweaking can get you most of the way if your base is smart enough I suppose? I do miss it - felt like more or less every week I'd open this sub to find some new weird and wonderful thing going on with home brewed models trained on Unsloth or MLX or what-have-you
My gut says that there are still plenty of people doing this, and that the posts just don't surface as much as they used to lol
Bonus question; are there any other subs out there that are more dedicated to training models locally that I just haven't come across yet?
[link] [comments]
More from r/LocalLLaMA
-
What's in your RAG?
Jul 2
-
Palantir CEO rages against closed models
Jul 2
-
A cheap trick for reliable structured output: feed the validation error back into the retry
Jul 2
-
SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.