Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy Paper • 2507.01352 • Published Jul 2 • 56
IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance Paper • 2502.08395 • Published Feb 12
IFBench Collection Datasets for IFBench benchmark and paper! • 3 items • Updated about 13 hours ago • 9
IFBench Collection Datasets for IFBench benchmark and paper! • 3 items • Updated about 13 hours ago • 9