r/LocalLLaMA · May 24, 2026 · 1 min read

Qwen 3.6 27B MTP speed on 3080ti (getting 4.5 t/s)

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Using LM Studio with 3080ti (12gb of VRAM) and 128gb of ddr4.

Model version: Qwen 3.6 27B MTP UD q4_k_xl

Is this my hardware limit?

Is there anyway to speed this up using the current hardware?

Discussion (0)

No comments yet. Sign in and be the first to say something.