HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-5-gamma-share-expert Text Generation • 16B • Updated Sep 25, 2025 • 1 • 1
HectorHe/DeepSeek-V2-Lite-aux-free-sft-commonsense-1epoch-1e-4-gamma-share-expert Text Generation • 16B • Updated Sep 24, 2025 • 2 • 1
HectorHe/Qwen1.5-MOE-sft-coommonsense15k-aux-free-share-experts Text Generation • 14B • Updated Sep 23, 2025 • 1
HectorHe/OLMoE-1B-7B-0125-aux-free-sft-commonsense15k-share-expert Text Generation • 7B • Updated Sep 24, 2025 • 4 • 1
HectorHe/DeepSeek-V2-Lite-aux-free-sft-math7k-1epoch-1e-4-gamma-share-experts-2nd-epoch Text Generation • 16B • Updated Sep 19, 2025 • 2 • 1
HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-5e-5-gamma Text Generation • 14B • Updated Sep 15, 2025 • 1 • 1
HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-6-gamma Text Generation • 14B • Updated Sep 15, 2025 • 1 • 1
HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-remov-aux-only Text Generation • 14B • Updated Sep 15, 2025 • 4 • 1
HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-4-gamma Text Generation • 14B • Updated Sep 15, 2025 • 3 • 1
HectorHe/Qwen1.5-MOE-aux-free-sft-math7k-1e-2-gamma Text Generation • 14B • Updated Sep 15, 2025 • 1 • 1
HectorHe/Deepseek-Coder-V2-Lite-13B-Instruct-sft-math7k Text Generation • 16B • Updated Aug 17, 2025 • 3 • 2