IFDecorator - a guox18 Collection

guox18 's Collections

IFDecorator

updated Aug 7, 2025

Dataset and Models for ''IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards''

guox18/IFDecorator

Preview • Updated Aug 8, 2025 • 248 • 1

Note Datasets
guox18/Qwen2.5-7B-Instruct-IFDecorator

Text Generation • 8B • Updated Aug 8, 2025 • 5
guox18/Llama3.1-8B-Instruct-IFDecorator

8B • Updated Aug 10, 2025 • 12
guox18/Qwen3-8B-IFDecorator

8B • Updated Aug 7, 2025 • 8
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Paper • 2508.04632 • Published Aug 6, 2025 • 2