Yatharth Sharma's picture

Yatharth Sharma

YaTharThShaRma999

·

AI & ML interests

None yet

Recent Activity

updated a model about 4 hours ago

YaTharThShaRma999/voices

liked a Space about 7 hours ago

sesame/csm-1b

reacted to AtAndDev's post with 🔥 1 day ago

Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.

View all activity

Organizations

None yet

YaTharThShaRma999's activity

updated a model about 4 hours ago

YaTharThShaRma999/voices

Updated about 4 hours ago • 1

liked a Space about 7 hours ago

Sesame CSM

Conversational speech generation

reacted to AtAndDev's post with 🔥 1 day ago

Post

949

Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.

updated a model 7 days ago

YaTharThShaRma999/SparkTTS-LLM

Text Generation • Updated 7 days ago • 14

published a model 7 days ago

YaTharThShaRma999/SparkTTS-LLM

Text Generation • Updated 7 days ago • 14

upvoted a paper 8 days ago

A Multimodal Symphony: Integrating Taste and Sound through Generative AI

Paper • 2503.02823 • Published 9 days ago • 2

liked a model 9 days ago

SparkAudio/Spark-TTS-0.5B

Text-to-Speech • Updated 7 days ago • 7.82k • 388

upvoted a paper 10 days ago

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Paper • 2503.01183 • Published 11 days ago • 26

upvoted a paper 11 days ago

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Paper • 2502.20583 • Published 14 days ago • 11

reacted to hexgrad's post with 👍 14 days ago

Post

5058

hexgrad/Kokoro-82M-v1.1-zh

reacted to AdinaY's post with 👍🚀🔥 16 days ago

Post

2712

Wan2.1 🔥📹 new OPEN video model by Alibaba Wan team!

Model: Wan-AI/Wan2.1-T2V-14B
Demo: Wan-AI/Wan2.1

✨Apache 2.0
✨8.19GB VRAM, runs on most GPUs
✨Multi-Tasking: T2V, I2V, Video Editing, T2I, V2A
✨Text Generation: Supports Chinese & English
✨Powerful Video VAE: Encode/decode 1080P w/ temporal precision

1 reply

·

reacted to stefan-it's post with 👍 18 days ago

Post

5069

She arrived 😍

[Expect more models soon...]

2 replies

·

updated a model 20 days ago

YaTharThShaRma999/sound_effects

Updated 20 days ago

published a model 20 days ago

YaTharThShaRma999/sound_effects

Updated 20 days ago

New activity in stabilityai/stable-diffusion 21 days ago

2.1 elsewhere?

#20546 opened about 1 month ago by

liked a model 21 days ago

YaTharThShaRma999/voices

Updated about 4 hours ago • 1

upvoted a paper 22 days ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published 23 days ago • 37

liked a model 24 days ago

Skywork/SkyReels-V1-Hunyuan-I2V

Image-to-Video • Updated 18 days ago • 53.2k • 248