Generate captions for images in various styles
Prompt with Images in flux[dev]
a tiny vision language model