strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard 40.380
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard 47.700
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard 21.370
acc_norm on GPQA (0-shot)
Open LLM Leaderboard 16.000
acc_norm on MuSR (0-shot)
Open LLM Leaderboard 17.040
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard 49.520