view article Article πΊπ¦ββ¬ LLM Comparison/Test: Phi-4, Qwen2 VL 72B Instruct, Aya Expanse 32B in my updated MMLU-Pro CS benchmark By wolfram β’ Jan 10 β’ 4
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ Jan 2 β’ 39
view article Article They Said It Couldnβt Be Done By Pclanglais and 2 others β’ Dec 5, 2024 β’ 80
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76
view article Article Turning Home Assistant into an AI Powerhouse: Amy's Guide By wolfram β’ Dec 1, 2024 β’ 3
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais and 2 others β’ Nov 13, 2024 β’ 98