mistralai/Mistral-Small-24B-Instruct-2501 Text Generation β’ Updated 9 days ago β’ 338k β’ β’ 704
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation β’ Updated 3 days ago β’ 300k β’ β’ 522
Running on CPU Upgrade 32 32 OpenLLM French leaderboard π«π· π₯ Explore and compare LLM benchmarks and submit models for evaluation
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation β’ Updated Oct 25, 2024 β’ 179k β’ β’ 2.02k
Running 14 14 GPU Memory Calculator LLMTraining π¬ Calculate GPU memory consumption for LLM training
Running 664 664 FineWeb: decanting the web for the finest text data at scale π· Generate high-quality web text data for LLM training