r/LocalLLaMA · June 13, 2026 · 1 min read

llama-launcher v1.3 release -> Bayesian Optimisation

#model-release #version-bump #developer-tool

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

llama-launcher v1.3 release -> Bayesian Optimisation

Hello everyone, some of you may have seen a post of mine from a few days ago about my app, llama-launcher, a lightweight point-and-click GUI to create llama-server commands without the constant need for typing them up. Well, I've just added an optimisation feature that uses Tree-Structured Parzen estimation through optuna's framework. It uses llama-server to tune a pre-determined set of parameters to try to squeeze the last bit of juice out of your system, completely hands-free. I've been using this to get the last bit of performance from my MTP models without having to sit at my desk tuning, loading, prompting, and unloading manually and repeatedly. So far, I've seen upto a 15% improvement in speeds (as seen in the images) versus baseline commands with no tuning with Gemma 12B MTP during testing. Without any human interaction at all during the optimisation process. It's still in it's early stages so there are many improvements to be made but any suggestions you may have please let me know.

You can check the repo out here: https://github.com/SolaryKryptic/llama-launcher

submitted by /u/Solary_Kryptic
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from r/LocalLLaMA