HRMACT Sudoku Solver

This repository hosts checkpoints for the Hierarchical Reasoning Model with Adaptive Computation Time (HRMACT), trained on Sudoku puzzles.

  • Training steps: 3,000
  • Batch size: 512
  • Total training time: ~18 hours
  • Checkpoint format: .safetensors

πŸ“Œ Note: The original HRM paper recommends training for 10,000+ steps for best performance. These checkpoints are intended as a lightweight, educational reference.


πŸ”— Code & Usage

For the full implementation, training pipeline, and evaluation tools, see the main repo: πŸ‘‰ ZoneTwelve/HRM-Sudoku

To run evaluation on a checkpoint:

python evaluate.py <checkpoint>

πŸ” Evaluation

Evaluation is performed with evaluate.py.

Each checkpoint is evaluated 100 times per difficulty to estimate average performance.

Example format for results:

Model very-easy easy medium hard extreme
checkpoint-1 0.00% 0.00% 0.00% 0.00% 0.00%
checkpoint-250 0.00% 0.00% 0.00% 0.00% 0.00%
checkpoint-500 0.00% 0.00% 0.00% 0.00% 0.00%
checkpoint-750 0.00% 0.00% 0.00% 0.00% 0.00%
checkpoint-1000 2.00% 0.00% 0.00% 0.00% 0.00%
checkpoint-1250 15.00% 4.00% 0.00% 0.00% 0.00%
checkpoint-1500 42.00% 18.00% 1.00% 0.00% 0.00%
checkpoint-1750 61.00% 40.00% 1.00% 1.00% 0.00%
checkpoint-2000 64.00% 28.00% 1.00% 1.00% 0.00%
checkpoint-2250 84.00% 67.00% 5.00% 2.00% 0.00%
checkpoint-2500 80.00% 66.00% 22.00% 8.00% 0.00%
checkpoint-2750 91.00% 81.00% 42.00% 13.00% 3.00%
checkpoint-3000 98.00% 95.00% 38.00% 13.00% 1.00%

Accuracy Chart


πŸ“– Citation

If you use this model in academic work, please cite:

@misc{wang2025hierarchicalreasoningmodel,
      title={Hierarchical Reasoning Model},
      author={Guan Wang and Jin Li and Yuhao Sun and Xing Chen and Changling Liu and Yue Wu and Meng Lu and Sen Song and Yasin Abbasi Yadkori},
      year={2025},
      eprint={2506.21734},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2506.21734},
}

πŸ“œ License

Apache License 2.0

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support