Benchmarks

by sapbot - opened 13 days ago

I benchmarked this LLM, and this is what I have:

Raw data:

"raincandy-u/rain-100m":{
    "arc":3.3,
    "mmlu":9.06,
    "truthfulqa":16.28
}

Source: https://obscureai.mooo.com/raincandy-u/rain-100m

(Pinned screenshot of truthfulQA benchmark)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment