arxiv:2410.01180
Kangda Wei
kangdawei
AI & ML interests
None yet
Organizations
models 50
kangdawei/DRA-GRPO-8B
8B • Updated • 2
kangdawei/DRA-GRPO-7B
Text Generation • 8B • Updated • 4
kangdawei/MMR-Sigmoid-DAPO-7B
Text Generation • 8B • Updated • 10 •
kangdawei/MMR-Sigmoid-DR-GRPO-8B
Text Generation • 8B • Updated • 2
kangdawei/MMR-Sigmoid-DAPO-8B
Text Generation • 8B • Updated • 174 •
kangdawei/MMR-Sigmoid-DAPO
Text Generation • 2B • Updated • 3 •
kangdawei/MMR-Sigmoid-GRPO-8B
Text Generation • 8B • Updated • 3 • 1
kangdawei/MMR-Sigmoid-GRPO-7B
Text Generation • 8B • Updated • 4
kangdawei/MMR-Sigmoid-DR-GRPO-7B
Text Generation • 8B • Updated • 4
kangdawei/DAPO-8B
Text Generation • 8B • Updated • 3 •
datasets 0
None public yet