Suggestion - this sub should have post flairs that mention the amount of vram/unified ram
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
The amount of fast ram is the single most important factor for llm use.
There are lots of people that run setups with massive amounts of ram. Reading a post about how model X performs, it'd really help to know the kind of setup being used, otherwise its not relevant for a lot of people.
It will also allow easy filtering of posts relevant to the hardware you have, right now thats very hard to do.
[link] [comments]
More from r/LocalLLaMA
-
I implemented KVarN in my llama.cpp fork and ran KLD benchmarks. It's promising!
Jun 5
-
[NEW MODEL] SupraLabs just released a new model! - Supra-50M-Reasoning
Jun 5
-
Bringing Gemma 4 12B to your Laptop: Unlocking Local, Agentic Workflows with Google AI Edge
Jun 5
-
What is your current go-to stack for running a fully local AI agent?
Jun 5
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.