Verbosity is not faithfulness: an architectural argument that reasoning models cannot perform faithful inference [D]
Mirrored from r/MachineLearning for archival readability. Support the source by reading on the original site.
Essay argues that reasoning models cannot perform faithful inference because their reasoning trace and final answer come from the same operation. Engages with Lanham/Turpin/Mirzadeh in empirical critique, and with HRM, TRM, GRAM, AlphaProof, and Kona/Aleph as the contrasting architectural lineage.
Curious what this subreddit makes of the constraint-vs-influence framing.
[link] [comments]
More from r/MachineLearning
-
[P] have a couple technical questions for my LLM router. [P]
May 26
-
Added a Chrome Dino-style game to my research tool's pipeline wait screen driven by real SSE events [P]
May 26
-
[D] Dlib or pytorch to CNN? [D]
May 26
-
[P] Built a portable GPU ISA after reading too many architecture manuals [P]
May 26
Discussion (0)
Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.
Sign in →No comments yet. Sign in and be the first to say something.