How can you stop your model from looping
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
So i thought this is a small model issue but when i added a new gpu and i am able to run low mid model like Qwen 3.6 35b q4 or q5 this issue still exists now its not as much as small model but it does break when linking the model to copilot chat or Hermes the model mid task will start loop thinking or looping generating more than 40k token or generating a wrong tool call
[link] [comments]
More from r/LocalLLaMA
-
AMD Powers Next-Generation Agent Computers with New Ryzen AI Halo Developer Platform and Ryzen AI Max PRO 400 Series Processors
May 21
-
Qwen3.6 27B and llama.cpp appreciation post
May 21
-
Same task in github-copilot, pi, claude-code, and opencode with Qwen3.6 27B
May 21
-
Training a vision model from scratch on iPod touch 4 images
May 21
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.