2x RX 9060xt 16gb, is it worth it?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
I'm planning to buy 2x RX 9060xt with 16gb each to run Qwen 3.6 27B and alike. Would it be a good investment? How much tk/s should i expect in generation and prefill? I'm planning to use this as a coding agent in a large codebase.
Currently I'm running this on my i7 64gb laptop and I'm getting 3~4 tk/s with MTP and ~50 tk/s prefill. The generation speed is kind of ok, but 50 tk/s prefill is just unusable in my use case... Every read tool call i have to wait 1~2min just for the prefill
[link] [comments]
More from r/LocalLLaMA
-
Been running Qwen3.6-27B through a 3-critic harness. The harness matters more than I thought
Jun 30
-
I Hate Dario Amodei, and everything he stands for.
Jun 29
-
Introducing LongCat-2.0 - , a large-scale MoE language model with 1.6 trillion total parameters and ~48 billion activated per token. This was the stealth model that was on Openrouter under the name 'owl-alpha'.
Jun 29
-
Krea-2-Turbo Image Model - Easy to be fully uncensored, but it can also EDIT Images!
Jun 29
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.