MiniMax-01: Scaling Foundation Models with Lightning Attention Paper โข 2501.08313 โข Published about 1 month ago โข 273