LM Studio finally added support for MTP Speculative Decoding
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| update to 0.4.14 Build 2 (Beta) and make sure your llama.cpp engine is 2.15.0 you also must select "Manually choose model load parameters" and enable MTP in those before loading the model it is NOT on by default [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.