Berkeley Function Calling Leaderboard
Display Berkeley Function-Calling Leaderboard
Gotta rank 'em all!
Display Berkeley Function-Calling Leaderboard
More advanced and challenging multi-task evaluation
Compact LLM Battle Arena: Frugal AI Face-Off!
Text to Video and Image to Video Arena & Leaderboard
AI Music Arena & Leaderboard (Suno, Udio, Google, Meta, +)
Ranking of LLMs for agentic tasks
Uncensored General Intelligence Leaderboard
Compare two AI models' answers to document questions
Explore and analyze code completion benchmarks
Can AI Code? An LLM leaderboard inclquantized models.
Duplicate this leaderboard to initialize your own!
Embedding Leaderboard
Explore hardware performance for LLMs
View and filter LLM hallucination leaderboard
VLMEvalKit Evaluation Results Collection
Explore LLM performance with a leaderboard
Display hardware performance leaderboard
A Comprehensive Benchmark for Generative Writing
Evaluating LLMs by their role-playing capabilities.
View and request speech models benchmark data