Create videos with FFMPEG + Qwen2.5-Coder
Audio Conditioned LipSync with Latent Diffusion Models
Display Hugging Face status and loading animation
Convert text to speech in multiple languages