Jetson Orin NX Build for Hermes Agent + Benchmarking
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| I had a huge LLM server, and now I have a tiny one! I had a Jetson Orin NX gathering dust from a long dead robotics project, from back in the Llama-7B days. I figured now with MoE and smaller models doing well, it was time to mess with it again. Goal:
With those constraints, I had to take a hacksaw to the stock heatsink and make a new case. Then I tested way too many models (the expected, Gemma-4's and Qwen 3.6's), but with too many quant variations. It's all written up in the blog! TL;DR: Gemma 4 26B A4B UD Q2_K_XL gives:
Hope this comes in handy! [link] [comments] |
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.