Joh Bo PRO
johnnyb0y
·
AI & ML interests
None yet
Recent Activity
liked
a model
3 months ago
internlm/Spark-VL-7B
replied to
ACloudCenter's
post
4 months ago
I've really been into testing the various ASR, TTS, and other audio related models. This space showcases the Nvidia Canary-Qwen 2.5B model. The model is able to transcribe incredibly fast and and combine qwen for queries about the transcript.
All audio example files were generated with my adjacent VibeVoice Conference Generator Space. Another really cool model!!
https://huggingface.co/spaces/ACloudCenter/canary-qwen-transcriber-2.5b
Organizations
None yet