r/LocalLLaMA · May 14, 2026 · 1 min read

Strix Halo or GPUs?

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

I want to build my own AI server, I already have multiple servers at home but none have GPUs neither are powerful enough to host +4B models.

I'd like to be able to host dense 27-30b parameters models, or some MoE with 3b activated parameters.

Let's say I could spend about 2k, what would be the best route? And what tokens speeds should I expect?

Discussion (0)

No comments yet. Sign in and be the first to say something.