Benchmarks

#3
by sapbot - opened

I benchmarked this LLM, and this is what I have:

  • ARC (Challenge): 3.3%
  • MMLU: 9.06%
  • TruthfulQA: 16.28%

Raw data:

"raincandy-u/rain-100m":{
    "arc":3.3,
    "mmlu":9.06,
    "truthfulqa":16.28
}

Source: https://obscureai.mooo.com/raincandy-u/rain-100m
Снимок экрана от 2026-05-26 09-14-12
(Pinned screenshot of truthfulQA benchmark)

Sign up or log in to comment