-
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset
Paper • 2309.04662 • Published • 23 -
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Paper • 2309.11998 • Published • 25 -
YanweiLi/MGM-Pretrain
Viewer • Updated • 1.27M • 33 • 16 -
YanweiLi/MGM-Instruction
Updated • 96 • 17
Collections
Discover the best community collections!
Collections including paper arxiv:2309.11998
-
TheBirdLegacy/FreeLoaderLM
Text Generation • Updated -
CofeAI/FLM-101B
Text Generation • Updated • 107 • 91 -
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper • 2309.03852 • Published • 44 -
Composable Function-preserving Expansions for Transformer Architectures
Paper • 2308.06103 • Published • 20