r/LocalLLaMA
500 articles archived · Visit source ↗ · RSS
-
-
-
-
r/LocalLLaMA community 14d ago
Finally - 4xRTX 5060TI
nvtop showing clocks and PCIe speed while running gpu_burn I wrote a while ago about my plans to put together a quad 5060ti 16gb based system after finding them nicely discounted. Everything got delayed due to issues with CPU seating (damn re-used stock cooler with plastic push…
32 -
r/LocalLLaMA community 14d ago
Reason to run local agents instead #645
  submitted by   /u/ToastFetish [link]   [comments]
18 -
r/LocalLLaMA community 14d ago
Stop using Ollama
  submitted by   /u/zxyzyxz [link]   [comments]
12 -
r/LocalLLaMA community 14d ago
Local VibeCoding is a lot of fun..
Hi everyone! I don’t consider myself a professional, even though my current position is officially called "programmer." I’ve been writing code for many years, using different languages and technologies, most of which I’ve already forgotten) I decided to put together (to…
37 -
-
-
r/LocalLLaMA community 14d ago
About the Rio model
As a Brazilian, I was proud that a Brazilian team was capable to bring innovation and a useful model to the table. It was a cold water bath what came next with the wrong model uploaded. ​ That is a chance that it is real and it would be a major improvement for local AI. I…
18 -
-
-
-
-
-
-
r/LocalLLaMA community 15d ago
What's the lesson chat?
  submitted by   /u/ill_be_productive [link]   [comments]
22 -
r/LocalLLaMA community 15d ago
moar QAT stuff and hairy ticks
tldr; finally got to a point where we can publish some of the ggufs with a more accurate process. in these repos: https://huggingface.co/idkwhattoputherenow/gemma-4-12B-it-qat-q4_0-maxerr https://huggingface.co/idkwhattoputherenow/gemma-4-31B-it-qat-q4_0-maxerr this is a…
27 -
-
-
-
-
r/LocalLLaMA community 15d ago
EAGLE support merged into llama.cpp
  submitted by   /u/Diablo-D3 [link]   [comments]
18 -
-
-
r/LocalLLaMA community 15d ago
Voice-to-voice chatbot update
I've been working on this after hours for a few months continuously improving it. Now at a point where the chatbot is close to real-time (thanks to SSE streaming) and also interruptible while preserving context of what was last said. 100% local and powered by Qwen3.5-397B…
33 -
r/LocalLLaMA community 15d ago
Nex claims Rio 3.5 is Nex 2.5 PRO in trench coat
  submitted by   /u/Specter_Origin [link]   [comments]
18 -