The frontier reasoning race is starting to look like a crowded subway station
Mirrored from r/LocalLLaMA for archival readability. Support the source by reading on the original site.
| We went from chasing GPT4 to looking at graphs with GPT5.4 xhigh, Gemini 3.1Pro, and now Hy3 preview completely shaking up the leaderboard. Look at that CHSBO 2025 chart Hy3 preview scoring 87.8 over Gemini and GPT. What a time to be alive, but honestly, my brain can't keep up with the version numbers anymore. What's your take? Is Hy3 actually punching at this level in real-world coding/math, or is it just benchmark hardening? [link] [comments] |
More from r/LocalLLaMA
-
Local LLMs on Refurb M4 Max vs new M5 Max
May 28
-
Gemma-4-Harmonia-31B-Uncensored-Heretic Is Out Now, a Merge of Multiple gemma-4-31B-it Finetunes Designed for a Targeted Approach to Deep Neural Consolidation, Minimizing Regression While Amplifying Unique Capability Boundaries. With KLD 0.0047 and 9/100 Refusals!
May 28
-
Vulnerability found in framework used by VLLM, many MCP servers, and other LLM tools
May 28
-
CrankGPT by Squeez Labs - hand-cranked edge AI - talk about local AI!!!
May 27
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.