r/LocalLLaMA
500 articles archived · Visit source ↗ · RSS
-
-
-
r/LocalLLaMA community 3d ago
vulkan: make TP viable by pwilkin · Pull Request #25051 · ggml-org/llama.cpp
The legend Piotr has taken a pass at making Vulkan Tensor Parallel somewhat usable, really looking forward to seeing this evolve   submitted by   /u/TKGaming_11 [link]   [comments]
11 -
r/LocalLLaMA community 3d ago
Local LLM Peeps
I am 80% done with a harness that works for local and API but is local first. The harness has some interesting logic around multiple agents which I’m holding back on until it is open source on GitHub. I have been local for 6 months and built out EVERYTHING I could think of to…
28 -
-
-
-
-
-
r/LocalLLaMA community 3d ago
Made an interactive explainer about speculative decoding/MTP
  submitted by   /u/undefdev [link]   [comments]
36 -
-
-
-
-
r/LocalLLaMA community 4d ago
US Govt to individually approve who gets GPT 5.6.
  submitted by   /u/AtlanticHM [link]   [comments]
16 -
-
-
-
r/LocalLLaMA community 4d ago
Ornith-1.0 released on Hugging Face
Including 9B Dense, 31B Dense, 35B MoE, and 397B MoE and reporting sota on different benchmark (let's see if this holds). https://huggingface.co/collections/deepreinforce-ai/ornith-10   submitted by   /u/paf1138 [link]   [comments]
26 -
-