r/LocalLLaMA · · 1 min read

Qwen3.6-35B-A3B-Uncensored-Genesis-APEX-MTP

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Here model: https://huggingface.co/LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Genesis-V2-APEX-MTP-GGUF

Safetensors: https://huggingface.co/LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Genesis-V2-FP8-Safetensors

Testing results in Open Code on hardware (Beelink gtr9 pro + Strix Halo) done by my friend on Q8_K_P - MTP quant:

  1. 5 sessions with 200k context, not a single glitch, no loops, no repeated tool calls.
  2. After 120k tokens he suddenly gave another task that doesn't intersect with what it was doing at all, and it calmly picked up and solved it correctly.
  3. Uncensored with MTP support with APEX quantization.

Recommended quant: APEX, MTP-APEX

Recommended settings for LM Studio:

System Prompt

Chat Template

Chat Template Thinking

Or use this minimal string as the first line:

You are Qwen, created by Alibaba Cloud. You are a helpful assistant.

Then add anything you want after. Model may underperform without this first line.

Settings:

Parameter Value
Temperature 0.7
Top K Sampling 20
Presence Penalty 1.5
Repeat Penalty 1.0
Top P Sampling 0.8
Min P Sampling 0
Seed 42

Enjoy 😄

submitted by /u/EvilEnginer
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA