M3 / README.md
leorc's picture
Update README.md
06aafdb verified
|
raw
history blame
382 Bytes
---
pipeline_tag: reinforcement-learning
tags:
- deep
- reinforcement
- learning
- world
- models
---
# M<sup>3</sup>: A Modular World Model over Streams of Tokens
📄 [Paper](https://arxiv.org/abs/2502.11537) ▪️ 💾 [Code](https://github.com/leor-c/M3)
🧠 The trained model weights for Atari 100K, DeepMind Control Suite Proprioceptive 500K, and Craftax (Symbolic) 1M.