r/LocalLLaMA · · 1 min read

HRM 1B

Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.

HRM 1B

HRM 1B Base model (not Instruct).

The authors have released the training code in their Github (https://github.com/sapientinc/HRM-Text) and claim some wild things in their paper (https://arxiv.org/pdf/2605.20613):

- "Despite utilizing roughly 100-900x fewer training tokens and 96-432x less estimated compute than standard baselines, HRM-Text performs competitively with 2–7B parameter open models."

- The 1B model can be trained in 16 H100s (x2 nodes) in about 46 hours with ~$1472).

From a quick look, training seems as a combination of pretraining and instruction tuning, so the model can be prompted to function a bit like a chatbot.

I believe it would be very interesting to see how the model would function after undergoing SFT+RL. TBH, I don't quite understand the limitations of this particular architecture.

submitted by /u/pol_phil
[link] [comments]

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from r/LocalLLaMA