198
Janus Pro WebGPU
🏛
In-browser unified multimodal understanding and generation.
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Interpret and execute code with responses
Generate text from audio recordings
Realtime implementation of Whisper large turbo
Transcribe or translate audio and YouTube videos