r/LocalLLaMA · · 1 min read

For users have have both 6000 PRO MaxQ and Workstation Edition (or Server Edition), how much slower is the MaxQ vs the WS/SV on compute? (Prompt processing, Diffusion, etc)

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Hello guys, hoping you are doing fine!

I'm torn on the choice of either a RTX 6000 PRO MaxQ (on stock on Chile right now) or waiting 3~ months and get a RTX 6000 PRO Workstation Edition.

I have sold 3x5090 I purchased time ago near MSRP and got for one of these. I have a open case setup.

I have read on multiple places that tasks that depends only of bandwidth, like token generation, the difference is about -5 to -15% on the MaxQ vs the Workstation Edition (or Server Edition). I guess it makes sense since it has max 300W vs 600W.

But I haven't seen someone posting a difference on compute heavy tasks, like prompt processing or diffusion (txt2image, txt2video, etc). Only a comment from some months ago that mentions that is 50% slower: https://www.reddit.com/r/LocalLLaMA/comments/1t6ji0q/comment/oks3398/

EDIT: Found a comparison between SE 600W vs MaxQ and it seems to be indeed 50% faster: https://www.reddit.com/r/LocalLLaMA/comments/1pt9czu/comment/nvfkahn/

Does someone have a test or an actual difference between these 2 cards to make a final decision?

Thanks in advance!

submitted by /u/panchovix
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA