Submitted by Jingfeng Yao 95 Towards Scalable Pre-training of Visual Tokenizers for Generation MiniMax 367 4