GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper β’ 2403.03507 β’ Published Mar 6, 2024 β’ 186 β’ 15
SaulLM-7B: A pioneering Large Language Model for Law Paper β’ 2403.03883 β’ Published Mar 6, 2024 β’ 80 β’ 5
PersianMind: A Cross-Lingual Persian-English Large Language Model Paper β’ 2401.06466 β’ Published Jan 12, 2024 β’ 3 β’ 3
GRATH: Gradual Self-Truthifying for Large Language Models Paper β’ 2401.12292 β’ Published Jan 22, 2024 β’ 2 β’ 2