FuseChat 3.0
Preference Optimization for Implicit Model Fusion
-
Paper • 2412.03187 • Published • 12
FuseAI/FuseChat-Llama-3.1-8B-Instruct
Text Generation • 8B • Updated • 33 • 12Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.1-8B-Instruct.
FuseAI/FuseChat-Llama-3.2-3B-Instruct
3B • Updated • 411 • 7Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-3B-Instruct.
FuseAI/FuseChat-Llama-3.2-1B-Instruct
1B • Updated • 30 • 6Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-1B-Instruct.
FuseAI/FuseChat-Qwen-2.5-7B-Instruct
8B • Updated • 18 • 14Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Qwen-2.5-7B-Instruct.
FuseAI/FuseChat-Gemma-2-9B-Instruct
9B • Updated • 15 • 7Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Gemma-2-9B-Instruct.
FuseAI/FuseChat-Llama-3.1-8B-SFT
8B • Updated • 25 • 2Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.1-8B-Instruct.
FuseAI/FuseChat-Llama-3.2-3B-SFT
3B • Updated • 19 • 3Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-3B-Instruct.
FuseAI/FuseChat-Llama-3.2-1B-SFT
1B • Updated • 14Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-1B-Instruct.
FuseAI/FuseChat-Qwen-2.5-7B-SFT
8B • Updated • 2.65k • 2Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Qwen-2.5-7B-Instruct.
FuseAI/FuseChat-Gemma-2-9B-SFT
9B • Updated • 11 • 4Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Gemma-2-9B-Instruct.