Llama3-8B-1.58 A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! HF1BitLLM/Llama3-8B-1.58-100B-tokens Text Generation • 3B • Updated Sep 19, 2024 • 1.51k • 192 HF1BitLLM/Llama3-8B-1.58-Linear-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 26 • 10 HF1BitLLM/Llama3-8B-1.58-Sigmoid-k100-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 2 • 9
Llama3-8B-1.58 A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! HF1BitLLM/Llama3-8B-1.58-100B-tokens Text Generation • 3B • Updated Sep 19, 2024 • 1.51k • 192 HF1BitLLM/Llama3-8B-1.58-Linear-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 26 • 10 HF1BitLLM/Llama3-8B-1.58-Sigmoid-k100-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 2 • 9