rail-berkeley
/

octo-base

Model card Files Files and versions Community

rail-berkeley commited on Dec 14, 2023

Commit

c00abd5

·

1 Parent(s): fbc5da5

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -1,5 +1,6 @@
 # Octo Base
-This model is trained with a window size of 2, predicting 7-dimensional actions 4 steps into the future using a diffusion policy.
 Observations and tasks conform to the following spec:
 Observations:

 # Octo Base
+This model is trained with a window size of 2, predicting 7-dimensional actions 4 steps into the future using a diffusion policy. The model is a Transformer with 93M parameters (equivalent to a ViT-B). Images are tokenized by preprocessing with a lightweight convolutional encoder, then grouped into 16x16 patches. Language is tokenized by applying the T5 tokenizer, and then applying the T5-Base language encoder.
 Observations and tasks conform to the following spec:
 Observations: