abs/2506.21734

水木社区手机版

主题:abs/2506.21734
楼主|tgfbeta|2025-09-11 16:57:49|只看此ID
Hierarchical Reasoning Model

HRM executes sequential reasoning tasks in a single forward pass without explicit supervision of the intermediate process, through two interdependent recurrent modules: a high-level module responsible for slow, abstract planning, and a low-level module handling rapid, detailed computations. With only 27 million parameters, HRM achieves exceptional performance on complex reasoning tasks using only 1000 training samples. The model operates without pre-training or CoT data, yet achieves nearly perfect performance on challenging tasks including complex Sudoku puzzles and optimal path finding in large mazes. Furthermore, HRM outperforms much larger models with significantly longer context windows on the Abstraction and Reasoning Corpus (ARC), a key benchmark for measuring artificial general intelligence capabilities
--
修改:tgfbeta FROM 221.198.64.*
FROM 221.198.64.*
1楼|db16122|2025-09-22 15:00:51|只看此ID
deepseek?
--
FROM 112.54.232.*

BYR-Team©2010. KBS Dev-Team©2011 登录完整版