Does this only supports image?
#6
by
2U1
- opened
I'm not sure because that the model has 2 version Image and video.
Does this vision encoder supports video?
If so, can I get some example how can I get the embedding from videos.
Also, Can I get some examples for multi-image too?