Qwen3.6 27B quants
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| I have a project (picture) entirely made with bartowski 27B IQ3 XXS, turbo3 (and some parts with unsloth IQ3 XXS turbo4 when MTP became available). I've read so many arguments around minimum quantization to still get good quality, that I went ahead an made a small test to get some peace of mind. Am I missing big architectural and code quality advantages using this low quant model? Wouldn't be better to take some more time and get responses from a stronger tier? So I made a simple request/prompt: I put this through the Qwen3.6 27B (unsloth) in two variants (5070ti 16Gb):
I then use the same model to make a comparison table with the differences between both plans. Not to bother you with the full table results, I'm just going to leave here the final conclusion the model put after the table: _______________
_______________ My take: IQ3 XXS is good enough (I would say very good) for ordinary coding tasks - if you only have 16Gb, you won't be missing all that much: good judgement and good prompt are way more important on a project like this [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.