r/LocalLLaMA · · 1 min read

MLX engine comparison… and oMLX is the top choice.

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

MLX engine comparison… and oMLX is the top choice.

Just stumbled on this blog. A very interesting read if you are picking inference engine.

M5 Max 64GB with mlx-community/Qwen3.6-35B-A3B-4bit.

The MTPLX in the article use 3.6 27B so it's not apple to apple.

https://preview.redd.it/huxhasc4gx1h1.png?width=990&format=png&auto=webp&s=88cf7828b18eb8dea7a4c92c041f2b5c795f1824

https://preview.redd.it/fhevre6agx1h1.png?width=990&format=png&auto=webp&s=7bbc9aecbb5684aeeedf712e5a1017d0aab68fa7

https://www.largitdata.com/blog_detail/20260511

submitted by /u/Beamsters
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA