ARPO - a dongguanting Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

dongguanting 's Collections

AEPO

ARPO

ARPO

updated Oct 15

The official datasets and model checkpoints of ARPO

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158
dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29 • 9 • 2
dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated Aug 12 • 18 • 5
dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated Aug 19 • 925 • 2
dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated Aug 12 • 16 • 1
dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated Aug 12 • 13 • 3
dongguanting/ARPO-SFT-54K

Viewer • Updated Oct 17 • 54.6k • 203 • 14
dongguanting/ARPO-RL-Reasoning-10K

Viewer • Updated Oct 17 • 10k • 128 • 3
dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17 • 1.07k • 109 • 5

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs