So qwen3.7-4b when?
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
More from r/LocalLLaMA
-
llama: limit max outputs of `llama_context` by am17an · Pull Request #23861 · ggml-org/llama.cpp
Jun 1
-
i dedicate this meme to you r/LocalLLaMA
Jun 1
-
For Ling-2.6-1T, what would make the size feel justified first: quality per token, local serving reality, or long context stability?
Jun 1
-
Mellum2 Goes Open Source: A Fast Model for AI Workflows | The JetBrains AI Blog
Jun 1
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.