Lemonade v10.7 release and project organization update
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Today's v10.7 release is the start of an exciting new chapter for the Lemonade project, so I thought I should share an project-level update.
Lemonade's roadmap and development is now driven by 6 working groups, 4 of which are led by non-AMDers. Here are highlights from 3 of the groups in the v10.7 release, which had 19 contributors.
Local Omni Models
True omni-modal chat, including image gen/editing, by seamlessly combining multiple backends and models. v10.7 makes these LMX-Omni virtual models compatible with Open WebUI and other OpenAI clients that support multimedia rendering.
Auto Tuning
Every system should get the best performance, without users worrying about optimizing flags. v10.7 kicks this off by adding the lemonade bench CLI tool, which collects apples-to-apples LLM performance data across llama.cpp, FastFlowLM, and vLLM.
Cross-Vendor Support
Lemonade has its best chance at its mission of advancing local AI if it gives a great experience on every platform. v10.7 adds CUDA backends for llama.cpp and stable-diffusion.cpp, as well as Vulkan for sd-cpp, with more to come.
As of v10.7, the LMX-Omni virtual models are now GPU accelerated on AMD, Apple Silicon, Nvidia, and Intel systems.
What's Next
You can check out the working group roadmaps here.
If you like what we're up to, please give me your feedback here, star the repo, and join the bi-weekly public meetings on the Lemonade Discord!
[link] [comments]
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.