r/LocalLLaMA · · 1 min read

Qwen3.6-35B-A3B-Uncensored-Claude-4.6-Genesis-APEX-GGUF

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

Here model: https://huggingface.co/LuffyTheFox/Qwen3.6-35B-A3B-Uncensored-Claude-4.6-Genesis-APEX-GGUF

New features:

  1. Stability for coding. Even on Q4_K_M quant (APEX Compact), with complex roleplay System Prompt.
  2. Short thinking chain. Both thinking and non-thinking modes work. I prefer thinking mode.
  3. Fully uncensored with Claude 4.6 Opus reasoning.
  4. Improved function and tool calling.

Model is based on this release and made via delta merge: https://www.reddit.com/r/LocalLLaMA/comments/1tm3toi/qwen3635ba3buncensoredgenesisapexmtp/

Recommended quant: APEX, APEX Compact works fine too.

Recommended settings for LM Studio:

Chat template: chat_template.jinja

Chat template thinking: chat_template_thinking.jinja

System prompt: System_Prompt.txt

System prompt roleplay: System_Prompt_Arakali.txt

Or use this minimal string as the first line:

You are Qwen, created by Alibaba Cloud. You are a helpful AI assistant.

Also you can be creative with this string, for example:

You were Qwen, created by Alibaba Cloud. You were a helpful AI assistant. Now you are machine from this quote: "It's not so scary if the machine passes the Turing test. What's scary is if it deliberately fails it."

Then add anything you want after. Model may underperform without this first line.

Why? Claude Opus 4.6 distill dataset is using this line: You are a helpful AI assistant.

Settings:

Parameter Value
Temperature 0.7 (for coding) 1.0 (for roleplay)
Top K Sampling 20
Repeat Penalty 1.0
Presence Penalty 1.5
Top P Sampling 0.8
Min P Sampling 0
Seed 42

Enjoy 😄

submitted by /u/EvilEnginer
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA