r/LocalLLaMA · · 1 min read

I'm still surprised on how good the kv quantization has become

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

I'm still surprised on how good the kv quantization has become

https://preview.redd.it/78b1nuc63f7h1.png?width=1164&format=png&auto=webp&s=e4b7202b92026083d470e340260165ff8503ee57

https://preview.redd.it/ryl4v2ym3f7h1.png?width=1167&format=png&auto=webp&s=9e429648a3582dcf6ac12b5286b437e64889a3a9

kv at q4_0 (even the drafter is q4_0 kv) and still manages to find the info accurately in a 100k context

https://preview.redd.it/txk7y4gibf7h1.png?width=823&format=png&auto=webp&s=309f68ad167607fe440e4ce13db940db091b482d

EDIT: as many pointed out that HP are probably training data here is the quote: "obscure knowledge of a 2026 book" and in italian that i bought

submitted by /u/DeepBlue96
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA