r/LocalLLaMA · · 1 min read

I scaled test-time compute for Qwen-3.6-27B and Gemma-4-31B to surpass Claude Mythos in code optimizations and speedups.

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

I scaled test-time compute for Qwen-3.6-27B and Gemma-4-31B to surpass Claude Mythos in code optimizations and speedups.

The scaffold uses ~25-40x more compute on the original baseline model to attempt the same problem. I put it into max mode by setting the branches exploration breadth to 5, iterative corrections loop depth to 10 and 6 branch aware selective hypothesis that are revised after every 2 iterations. These hypotheses tests various claims, local speedups or completely different algorithmic designs independently and are selectively injected in a specific branch context. The most useful component of this entire system is solution pool which adds structured noise to the iterative corrections loop so that the LLMs don't get stuck in the local minima. All the agents have access to python environment so they can instantly check up their work programmatically and see if their ideas are actually organic and a real improvement.

Because both these models (Gemma & Qwen) don't have stable reasoning over long context windows, the performance actually starts dropping significantly at iteration 4 and 5, or after the PQF update, in the iteration 9 and 10. Like these are genuine regressions, we can't stop at say iteration 3 because sometimes the updated/evolved branch has more chances of doing better than all other branches so far. Can't do memory bank distillation after every 3 iterations either because that'd be too narrow search (and frontier LLMs do well in that). So I gave them branch history separately and asked them to judge and pick the most performing/optimized candidate in each branch and then select the best one from each and give it to the final judge.

Original Paper Link: https://arxiv.org/abs/2605.15222

Github repo link for this scaffold: https://github.com/ryoiki-tokuiten/Iterative-Contextual-Refinements

submitted by /u/Ryoiki-Tokuiten
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA