view article Article Process Reinforcement through Implicit Rewards By ganqu and 1 other • Jan 3 • 23
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 514
jpwahle/evol_instruct_70k_train_gpt-4o-mini_2_agents_2_turns Viewer • Updated Aug 10, 2024 • 7.13k • 10
Running on CPU Upgrade 12.4k 12.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots