mradermacher/Qwen3-14B-ARPO-DeepSearch-GGUF Reinforcement Learning • 15B • Updated Aug 12, 2025 • 31 • 2
mradermacher/Qwen3-14B-ARPO-DeepSearch-i1-GGUF Reinforcement Learning • 15B • Updated 17 days ago • 1.71k • 1