--- pipeline_tag: reinforcement-learning tags: - deep - reinforcement - learning - world - models --- # M3: A Modular World Model over Streams of Tokens 📄 [Paper](https://arxiv.org/abs/2502.11537) ▪️ 💾 [Code](https://github.com/leor-c/M3) 🧠 The trained model weights for Atari 100K, DeepMind Control Suite Proprioceptive 500K, and Craftax (Symbolic) 1M.