
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
•
156k
•
1.08k
https://huggingface.co/papers/2501.03006
Detect and annotate poses in images and videos
Generate text with detailed prompts