Running on CPU Upgrade 18 BigCodeBench Evaluator 🥇 18 Evaluate code samples using specified parameters
Running on CPU Upgrade 13.7k Open LLM Leaderboard 🏆 13.7k Track, rank and evaluate open LLMs and chatbots