God dammit Qwen
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| I guess it's my fault for being an idiot. [link] [comments] |
More from r/LocalLLaMA
-
G7 agrees on shared language around open-source AI and open weights AI
May 31
-
I ported NVIDIA Parakeet (speech-to-text) to ggml: same output as NeMo, faster, GGUF-quantized, no Python
May 31
-
What's this sub geebral opinion on quantisizing the KV cache
May 31
-
Whats actually happening when a model spills out of VRAM into system memory?
May 31
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.