r/LocalLLaMA · · 1 min read

LM Studio finally added support for MTP Speculative Decoding

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

LM Studio finally added support for MTP Speculative Decoding

https://preview.redd.it/1uuzjm0ll72h1.png?width=923&format=png&auto=webp&s=1af7d7594be1e08ff7ad6797e2bc53e9410769a3

update to 0.4.14 Build 2 (Beta) and make sure your llama.cpp engine is 2.15.0

https://preview.redd.it/x0vdwjb3n72h1.png?width=742&format=png&auto=webp&s=6367de44208004d2f50194d78a542c46b040dceb

you also must select "Manually choose model load parameters" and enable MTP in those before loading the model it is NOT on by default

submitted by /u/pigeon57434
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA