r/LocalLLaMA · · 1 min read

Does llama cpp split mode tensor cause issues?

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

I split qwen 27b and Gemma 4 26b (moe) across a 5080, and 2x 5060ti. I noticed setting split mode to tensor mode will cause looping issues in OpenCode with tool calls or just through the reasoning traces. Anyone else get this or understand why? Split mode layer seems to work fine

submitted by /u/MapSensitive9894
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA