Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kwai-Klear 's Collections
mini-swe-agent-plus
Klear-AgentForge
Klear1.0
KlearReasoner
RLEP

Klear-AgentForge

updated 28 days ago

Effective supervised fine-tuning (SFT) with synthetic data followed by multi-turn reinforcement learning (RL) for boosting agentic models.

Upvote
3

  • Kwai-Klear/Klear-AgentForge-8B-SFT

    308k • Updated Oct 23 • 591 • 3

  • Kwai-Klear/SWE-smith-mini_swe_agent_plus-trajectories-66k

    Viewer • Updated Nov 6 • 66k • 1.43k • 9

  • Kwai-Klear/Klear-AgentForge-8B

    8B • Updated 29 days ago • 19 • 1
Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs