Leaderboards - a Felladrin Collection

Felladrin 's Collections

Trained Models 🏋️

Frequently Used Spaces

Foundation Text-Generation Models Below 360M Parameters

Leaderboards

updated 12 days ago

Gotta rank 'em all!

Running

121

Berkeley Function Calling Leaderboard

🏃

121

Display Berkeley Function-Calling Leaderboard
Running on CPU Upgrade

238

MMLU-Pro Leaderboard

🥇

238

More advanced and challenging multi-task evaluation
Running

296

GPU Poor LLM Arena

🏆

296

Compact LLM Battle Arena: Frugal AI Face-Off!
Running

181

Video Generation Leaderboard

📊

181

Text to Video and Image to Video Arena & Leaderboard
Running

Featured

83

Music Arena Leaderboard

🎵

83

AI Music Arena & Leaderboard (Suno, Udio, Google, Meta, +)
Running on CPU Upgrade

435

Agent Leaderboard

💬

435

Ranking of LLMs for agentic tasks
Running

1.35k

UGI Leaderboard

📢

1.35k

Uncensored General Intelligence Leaderboard
Running on Zero

31

SLM RAG Arena

🤼

31

Compare two AI models' answers to document questions
Running

226

BigCodeBench Leaderboard

🥇

226

Explore and analyze code completion benchmarks
Running

450

Can Ai Code Results

🏆

450

Can AI Code? An LLM leaderboard inclquantized models.
Running

9

Web Bench Leaderboard

🥇

9

Duplicate this leaderboard to initialize your own!
Running on CPU Upgrade

6.85k

MTEB Leaderboard

🥇

6.85k

Embedding Leaderboard
Running

Featured

579

LLM-Perf Leaderboard

🏆

579

Explore hardware performance for LLMs
Running on CPU Upgrade

183

LLM Hallucination Leaderboard

🚀

183

View and filter LLM hallucination leaderboard
Running on CPU Upgrade

952

Open VLM Leaderboard

🌎

952

VLMEvalKit Evaluation Results Collection
Running

16

LLM Inference Benchmark

🥇

16

Explore LLM performance with a leaderboard
Running

18

Edge LLM Leaderboard

🌖

18

Display hardware performance leaderboard
Running

14

WritingBench

🏆

14

A Comprehensive Benchmark for Generative Writing
Running

2

RPEval

🏆

2

Evaluating LLMs by their role-playing capabilities.
Running on CPU Upgrade

Featured

1.18k

Open ASR Leaderboard

🏆

1.18k

View and request speech models benchmark data