MobileQuant: Mobile-friendly Quantization for On-device Language Models Paper • 2408.13933 • Published Aug 25, 2024 • 16
ondevicellm/tinyllama_mole_sft_routeraux_ultrachat_ep3 Text Generation • 1B • Updated Jan 30, 2024 • 5