r/LocalLLaMA · June 26, 2026 · 1 min read

KLD is flawed in abliteration.

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

I've noticed while creating my abliteration engine that KL is a flawed metric because it can be represented so many different ways, it depends completely on eval prompts, and lots of people use first token KL to make their models appear better than others. So I'm curious what do you guys think is the best way to measure the difference between an abliterated model and the base. Do you guys agree or disagree with me?

submitted by /u/AccountAntique9327
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA