MLX engine comparison… and oMLX is the top choice.
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| Just stumbled on this blog. A very interesting read if you are picking inference engine. M5 Max 64GB with mlx-community/Qwen3.6-35B-A3B-4bit. The MTPLX in the article use 3.6 27B so it's not apple to apple. [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.