344
Chat with DeepSeek-VL2-small
ð
Generate text based on images and prompts
Find similar images from a dataset
Unified Framework for Generalized Video Face Restoration
FitDiT is a high-fidelity virtual try-on model.
https://huggingface.co/papers/2501.03006
Upgraded to v1.0!
Detect and annotate poses in images and videos
Audio Conditioned LipSync with Latent Diffusion Models
Gaze detection using Moondream
Create real-time lip-synchronized videos from audio
Image generator/identifier/reposer
Execute custom code from environment variable