Qwen3.6-35B-A3B-Uncensored-Genesis-APEX-MTP
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
Here model: https://huggingface.co/LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Genesis-V2-APEX-MTP-GGUF
Safetensors: https://huggingface.co/LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Genesis-V2-FP8-Safetensors
Testing results in Open Code on hardware (Beelink gtr9 pro + Strix Halo) done by my friend on Q8_K_P - MTP quant:
- 5 sessions with 200k context, not a single glitch, no loops, no repeated tool calls.
- After 120k tokens he suddenly gave another task that doesn't intersect with what it was doing at all, and it calmly picked up and solved it correctly.
- Uncensored with MTP support with APEX quantization.
Recommended quant: APEX, MTP-APEX
Recommended settings for LM Studio:
Or use this minimal string as the first line:
You are Qwen, created by Alibaba Cloud. You are a helpful assistant.
Then add anything you want after. Model may underperform without this first line.
Settings:
| Parameter | Value |
|---|---|
| Temperature | 0.7 |
| Top K Sampling | 20 |
| Presence Penalty | 1.5 |
| Repeat Penalty | 1.0 |
| Top P Sampling | 0.8 |
| Min P Sampling | 0 |
| Seed | 42 |
Enjoy 😄
[link] [comments]
More from r/LocalLLaMA
-
TTS Benchmark Comparison (all known TTS up until May 2026)
May 24
-
Anyone down to test this? Just uploaded a model using rys
May 24
-
Vision-capable LLMs vs. OCR for long-document (including charts, images, tables, etc.) QA
May 24
-
Is there any reason for an uncensored model if you have no interest in roleplaying?
May 24
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.