pipeline_tag: reinforcement-learning | |
tags: | |
- deep | |
- reinforcement | |
- learning | |
- world | |
- models | |
# M<sup>3</sup>: A Modular World Model over Streams of Tokens | |
📄 [Paper](https://arxiv.org/abs/2502.11537) ▪️ 💾 [Code](https://github.com/leor-c/M3) | |
🧠 The trained model weights for Atari 100K, DeepMind Control Suite Proprioceptive 500K, and Craftax (Symbolic) 1M. | |