trl-internal-testing/tiny-DeepseekV3ForCausalLM Text Generation • 5.52M • Updated 26 days ago • 2.5k • 3
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF Text Generation • 480B • Updated Jul 31, 2025 • 3.53k • 163