leorc
/

M3

Reinforcement Learning

Model card Files Files and versions Community

M3 / README.md

leorc's picture

Update README.md

06aafdb verified about 1 month ago

|

382 Bytes

	---
	pipeline_tag: reinforcement-learning
	tags:
	- deep
	- reinforcement
	- learning
	- world
	- models
	---
	# M<sup>3</sup>: A Modular World Model over Streams of Tokens
	📄 [Paper](https://arxiv.org/abs/2502.11537) ▪️ 💾 [Code](https://github.com/leor-c/M3)


	🧠 The trained model weights for Atari 100K, DeepMind Control Suite Proprioceptive 500K, and Craftax (Symbolic) 1M.