r/LocalLLaMA · · 1 min read

Orthrus (diffusion head) trained Qwen 3.5/3.6 and Gemma 4 models are dropping soon

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Orthrus (diffusion head) trained Qwen 3.5/3.6 and Gemma 4 models are dropping soon

"Hi all, we are finalized with our testing and are preparing the release pipeline. We will be releasing support for the Qwen3.5, Qwen3.6, and Gemma4 very soon. Alongside the model checkpoints, we will be open-sourcing our complete end-to-end training and evaluation code. Stay tuned, we are pushing the updates to the repository very shortly!"

https://huggingface.co/chiennv/Orthrus-Qwen3-8B

I don't think anyone is working on llama.cpp support yet.

submitted by /u/oxygen_addiction
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA