r/LocalLLaMA · · 1 min read

Cannot get NCCL test to run in docker with 2 x 6000 Pro connected x8 to AM4 CPU

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

nvidia-smi topo -m is showing the both GPU as PHB (i.e. via CPU) connected as expected but I cannot get NCCL all_reduce_perf to run at all, it always hangs after starting up. It seems that vllm won't work with TP=2 until I can fix this.

Is there any reason why this setup would not work (it's X570 based)?

TIA

submitted by /u/NaiRogers
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA