For users have have both 6000 PRO MaxQ and Workstation Edition (or Server Edition), how much slower is the MaxQ vs the WS/SV on compute? (Prompt processing, Diffusion, etc)
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Hello guys, hoping you are doing fine!
I'm torn on the choice of either a RTX 6000 PRO MaxQ (on stock on Chile right now) or waiting 3~ months and get a RTX 6000 PRO Workstation Edition.
I have sold 3x5090 I purchased time ago near MSRP and got for one of these. I have a open case setup.
I have read on multiple places that tasks that depends only of bandwidth, like token generation, the difference is about -5 to -15% on the MaxQ vs the Workstation Edition (or Server Edition). I guess it makes sense since it has max 300W vs 600W.
But I haven't seen someone posting a difference on compute heavy tasks, like prompt processing or diffusion (txt2image, txt2video, etc). Only a comment from some months ago that mentions that is 50% slower: https://www.reddit.com/r/LocalLLaMA/comments/1t6ji0q/comment/oks3398/
EDIT: Found a comparison between SE 600W vs MaxQ and it seems to be indeed 50% faster: https://www.reddit.com/r/LocalLLaMA/comments/1pt9czu/comment/nvfkahn/
Does someone have a test or an actual difference between these 2 cards to make a final decision?
Thanks in advance!
[link] [comments]
More from r/LocalLLaMA
-
BitCPM-CANN: Native 1.58-Bit Large Language Model Training on Ascend NPU
May 24
-
GPU VRAM only for small models with llama.cpp: is it possible?
May 24
-
Qwen3.6-35B-A3B vs Gemma4-26B-A4B
May 24
-
Qwen Plays ̶p̶̶o̶̶k̶̶e̶̶m̶̶o̶̶n̶ ? / QWEN PLAYS DCSS! - qwen3.6-35b-a3b@q4_k_xl plays open source roguelike adventure DCSS (and does a decent job)
May 24
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.