r/LocalLLaMA · · 1 min read

SIQ-1 Qwen3.6 for autoresearch and autonomous agency

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

SIQ-1 Qwen3.6 for autoresearch and autonomous agency

Took Qwen-35B-A3 and trained it with PPO — and honestly this is the first time I've ever seen PPO actually pull its weight (with verifiable reward).

SO:
On karpathy/autoresearch for parameter-golf → beats GLM-5.2 and Qwen-350B, and the ideas it spits out feel Opus4.8-like
On bullshit-bench beats NEX and GPT-5.5

Model + GGUF: https://huggingface.co/AlexWortega/SIQ-1-35B
Agent and demo to play on ZeroGPU: https://huggingface.co/spaces/AlexWortega/hermes-agent-zerogpu

submitted by /u/Mysterious_Hearing14
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA