Does this only supports image?

#6
by 2U1 - opened

I'm not sure because that the model has 2 version Image and video.
Does this vision encoder supports video?

If so, can I get some example how can I get the embedding from videos.
Also, Can I get some examples for multi-image too?

Sign up or log in to comment