Anyone holding out for m5 ultra?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
-Faster memory
-M5 Ultra, have floating-point (FP) hardware built directly into the silicon at multiple levels.
-more cores
[link] [comments]
More from r/LocalLLaMA
-
G4-Meromero-31B-Uncensored-Heretic Is Out Now, a Finetune of Gemma 4 31B It Designed for Creative Tasks, With Kld of 0.0100 and 15/100 Refusals!
May 17
-
Ran the same models across Strix Halo, RTX 3090, and RTX 5070 because I wanted my own numbers
May 16
-
Anyone else running one of the pre-release branches of MTP support to maintain the higher speeds?
May 16
-
Now that MTP is merged... What's the best outputs you're getting on Qwen 3.6 35B on 2x3090s?
May 16
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.