FuseChat 3.0
Preference Optimization for Implicit Model Fusion
- Paper • 2412.03187 • Published • 11
FuseAI/FuseChat-Llama-3.1-8B-Instruct
Updated • 262 • 10Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.1-8B-Instruct.
FuseAI/FuseChat-Llama-3.2-3B-Instruct
Updated • 54 • 5Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-3B-Instruct.
FuseAI/FuseChat-Llama-3.2-1B-Instruct
Updated • 21 • 4Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-1B-Instruct.
FuseAI/FuseChat-Qwen-2.5-7B-Instruct
Updated • 224 • 9Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Qwen-2.5-7B-Instruct.
FuseAI/FuseChat-Gemma-2-9B-Instruct
Updated • 57 • 5Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Gemma-2-9B-Instruct.
FuseAI/FuseChat-Llama-3.1-8B-SFT
Updated • 112 • 1Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.1-8B-Instruct.
FuseAI/FuseChat-Llama-3.2-3B-SFT
Updated • 42 • 3Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-3B-Instruct.
FuseAI/FuseChat-Llama-3.2-1B-SFT
Updated • 26Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-1B-Instruct.
FuseAI/FuseChat-Qwen-2.5-7B-SFT
Updated • 25 • 2Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Qwen-2.5-7B-Instruct.
FuseAI/FuseChat-Gemma-2-9B-SFT
Updated • 24 • 2Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Gemma-2-9B-Instruct.
FuseAI/FuseChat-3.0-SFT-Data
Viewer • Updated • 94.5k • 62Note SFT dataset for FuseChat-3.0.
FuseAI/FuseChat-3.0-DPO-Data
Viewer • Updated • 64.1k • 59Note DPO dataset for FuseChat-3.0.