Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
prometheus-eval
university
Activity Feed
Follow
109
AI & ML interests
None defined yet.
Recent Activity
hyungjoochae
authored
a paper
2 days ago
Safe and Scalable Web Agent Learning via Recreated Websites
hyungjoochae
submitted
a paper
3 days ago
Safe and Scalable Web Agent Learning via Recreated Websites
amphora
submitted
a paper
about 1 month ago
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math
View all activity
Team members
56
+22
+9
prometheus-eval
's Spaces
2
Sort: Recently updated
Running
16
BiGGen Bench Leaderboard
😻
Display model performance leaderboard
Running
README
🐨