Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Renjie-Ranger
/
verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_110
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
No model card
Downloads last month
3
Safetensors
Model size
2B params
Tensor type
F32
·
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
Collection including
Renjie-Ranger/verl-grpo-8k-Qwen2.5-1.5B-Instruct-global_step_110
Long_CoT_Degradation_RL
Collection
119 items
•
Updated
Nov 11, 2025