r/LocalLLaMA · June 10, 2026 · 1 min read

Lemonade v10.7 release and project organization update

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Today's v10.7 release is the start of an exciting new chapter for the Lemonade project, so I thought I should share an project-level update.

Lemonade's roadmap and development is now driven by 6 working groups, 4 of which are led by non-AMDers. Here are highlights from 3 of the groups in the v10.7 release, which had 19 contributors.

Local Omni Models

True omni-modal chat, including image gen/editing, by seamlessly combining multiple backends and models. v10.7 makes these LMX-Omni virtual models compatible with Open WebUI and other OpenAI clients that support multimedia rendering.

Auto Tuning

Every system should get the best performance, without users worrying about optimizing flags. v10.7 kicks this off by adding the lemonade bench CLI tool, which collects apples-to-apples LLM performance data across llama.cpp, FastFlowLM, and vLLM.

Cross-Vendor Support

Lemonade has its best chance at its mission of advancing local AI if it gives a great experience on every platform. v10.7 adds CUDA backends for llama.cpp and stable-diffusion.cpp, as well as Vulkan for sd-cpp, with more to come.

As of v10.7, the LMX-Omni virtual models are now GPU accelerated on AMD, Apple Silicon, Nvidia, and Intel systems.

What's Next

You can check out the working group roadmaps here.

If you like what we're up to, please give me your feedback here, star the repo, and join the bi-weekly public meetings on the Lemonade Discord!

submitted by /u/jfowers_amd
[link] [comments]

Discussion (0)

No comments yet. Sign in and be the first to say something.

Local Omni Models

Auto Tuning

Cross-Vendor Support

What's Next

Discussion (0)

More from r/LocalLLaMA