metadata
datasets:
- commaai/commabody
pipeline_tag: robotics
This model has been trained on a larger version of the commabody dataset.
It includes a vqgan encoder/decoder fine tuned from imagenet. It compresses images of size 250x160 to 16x10 tokens.
It also includes a GPT2 model trained to predict the next frame, wheel speeds and actions. It can be used either as a simulator or as a policy.