r/LocalLLaMA · · 1 min read

Qwen 3.6 27B MTP speed on 3080ti (getting 4.5 t/s)

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Using LM Studio with 3080ti (12gb of VRAM) and 128gb of ddr4.

Model version: Qwen 3.6 27B MTP UD q4_k_xl

Is this my hardware limit?

Is there anyway to speed this up using the current hardware?

submitted by /u/yehiaserag
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA