Anyone still doing fine-tunes on consumer grade hardware?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Felt like there used to be a thriving fine-tuning community a few years back - and then once we started getting models that were smart enough and generalist enough (i.e. post Llama-3-8b era) things kind of dropped off a little. Less need for fine-tunes when prompt-tweaking can get you most of the way if your base is smart enough I suppose? I do miss it - felt like more or less every week I'd open this sub to find some new weird and wonderful thing going on with home brewed models trained on Unsloth or MLX or what-have-you
My gut says that there are still plenty of people doing this, and that the posts just don't surface as much as they used to lol
Bonus question; are there any other subs out there that are more dedicated to training models locally that I just haven't come across yet?
[link] [comments]
More from r/LocalLLaMA
-
Been running Qwen3.6-27B through a 3-critic harness. The harness matters more than I thought
Jun 30
-
I Hate Dario Amodei, and everything he stands for.
Jun 29
-
Introducing LongCat-2.0 - , a large-scale MoE language model with 1.6 trillion total parameters and ~48 billion activated per token. This was the stealth model that was on Openrouter under the name 'owl-alpha'.
Jun 29
-
Krea-2-Turbo Image Model - Easy to be fully uncensored, but it can also EDIT Images!
Jun 29
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.