KLD is flawed in abliteration.
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
I've noticed while creating my abliteration engine that KL is a flawed metric because it can be represented so many different ways, it depends completely on eval prompts, and lots of people use first token KL to make their models appear better than others. So I'm curious what do you guys think is the best way to measure the difference between an abliterated model and the base. Do you guys agree or disagree with me?
[link] [comments]
More from r/LocalLLaMA
-
Palantir CEO rages against closed models
Jul 2
-
A cheap trick for reliable structured output: feed the validation error back into the retry
Jul 2
-
SenseNova-U1-8b-MoT-Infographic-V2 (released yesterday) - An open source SOTA beast for infographic design and image editing.
Jul 2
-
[Benchmark] Kimi K2.7 Code Q3 on Mac Studio M3 Ultra + RTX PRO 6000 over llama.cpp RPC: prefill improves, no changes in token generation/decode
Jul 2
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.