OpenEvals

community

Activity Feed

AI & ML interests

LLM evaluation

Recent Activity

nielsr submitted a paper about 20 hours ago

Duration Aware Scheduling for ASR Serving Under Workload Drift

nielsr submitted a paper 17 days ago

Ultralytics YOLO26: Unified Real-Time End-to-End Vision Models

nielsr submitted a paper 24 days ago

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

View all activity

OpenEvals 's Spaces 11

Benchmark Finder

📚

A space to view and inspect all the tasks in lighteval

329

Evaluation Guidebook

📝

Explore LLM benchmark scores over time

147

Find a leaderboard

🔍

Explore and discover all leaderboards from the HF community

HF Hub Benchmark Dashboard

🏆

Live dashboard for HF Hub benchmark leaderboards

Official Benchmarks Leaderboard 2026

🏆

Explore and compare AI model scores across official benchmarks

README

⚖

Aa Omniscience

🐠

Display and inspect log files

InferenceProviderTestingBackend

📈

Launch and monitor model evaluation jobs

Evals

🐨

Run your LLM evaluations on the hub

🐢

Generate a command to run model evaluations

Tokenizers Languages

🐠

Compare tokenization lengths across languages