Duplicated from multimodalart/Cosmos-Predict2-2B
For more detail on the models, please refer to the docs.