Uploaded model
- Developed by: natukundaphiionah
- License: apache-2.0
- Finetuned from model : jq/sunflower-14b-bs64-lr1e-4-250919
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- -
Model tree for natukundaphiionah/sunflower-grpo-merged
Base model
jq/sunflower-14b-bs64-lr1e-4-250919