Running on CPU Upgrade 148 148 Open LLM Progress Tracker ๐ฌ Visualize Open vs. Proprietary LLM Progress
Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks Paper โข 2406.12066 โข Published Jun 17, 2024 โข 8
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias Paper โข 2405.05506 โข Published May 9, 2024 โข 1
Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks Paper โข 2406.12066 โข Published Jun 17, 2024 โข 8
Running on CPU Upgrade 5.06k 5.06k MTEB Leaderboard ๐ฅ Select benchmarks and languages for text embeddings evaluation
Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens Paper โข 2401.17377 โข Published Jan 30, 2024 โข 36
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Paper โข 2304.01373 โข Published Apr 3, 2023 โข 9
Running on CPU Upgrade 12.7k 12.7k Open LLM Leaderboard ๐ Track, rank and evaluate open LLMs and chatbots