❤️
Darío Muñoz Prudant PRO
prudant
AI & ML interests
Tech enthusiast, avid AI learner, and perpetual seeker of new knowledge.
Recent Activity
commented on
an
article
22 days ago
SmolVLM Grows Smaller – Introducing the 250M & 500M Models!
new activity
24 days ago
M-Chimiste/Llama-3-8B-prime-graph-exp-1_merged:3.1 llama version
new activity
about 1 month ago
tiesenx14/Llama-3.1-8B-IT-GraphRAG-finetuned:help =)
Organizations
prudant's activity
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
commented on
SmolVLM Grows Smaller – Introducing the 250M & 500M Models!
22 days ago
3.1 llama version
#1 opened 24 days ago
by
prudant
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
New activity in
NLDoc/lilt-xlm-roberta-base-finetuned-DocLayNet-large_paragraphs_ml512-v1-cp5000
about 1 month ago
how to use this model?
#1 opened about 1 month ago
by
prudant
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
Finetune code
1
#1 opened about 1 month ago
by
prudant
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
reacted to
reach-vb's
post with 👀
3 months ago
Post
1705
Smol TTS models are here! OuteTTS-0.1-350M - Zero shot voice cloning, built on LLaMa architecture, CC-BY license! 🔥
> Pure language modeling approach to TTS
> Zero-shot voice cloning
> LLaMa architecture w/ Audio tokens (WavTokenizer)
> BONUS: Works on-device w/ llama.cpp ⚡
Three-step approach to TTS:
> Audio tokenization using WavTokenizer (75 tok per second)
> CTC forced alignment for word-to-audio token mapping
> Structured prompt creation w/ transcription, duration, audio tokens
The model is extremely impressive for 350M parameters! Kudos to the
OuteAI team on such a brilliant feat - I'd love to see this be applied on larger data and smarter backbones like SmolLM 🤗
Check out the models here: OuteAI/outetts-6728aa71a53a076e4ba4817c
> Pure language modeling approach to TTS
> Zero-shot voice cloning
> LLaMa architecture w/ Audio tokens (WavTokenizer)
> BONUS: Works on-device w/ llama.cpp ⚡
Three-step approach to TTS:
> Audio tokenization using WavTokenizer (75 tok per second)
> CTC forced alignment for word-to-audio token mapping
> Structured prompt creation w/ transcription, duration, audio tokens
The model is extremely impressive for 350M parameters! Kudos to the
OuteAI team on such a brilliant feat - I'd love to see this be applied on larger data and smarter backbones like SmolLM 🤗
Check out the models here: OuteAI/outetts-6728aa71a53a076e4ba4817c
awq quant
1
#1 opened 3 months ago
by
prudant
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
How to use visual grounding with this model ?
1
#25 opened 5 months ago
by
r4hul77
Documentation?
1
#1 opened 11 months ago
by
ThewindMom
A practical use case from your great job for the spanish language
4
#9 opened 8 months ago
by
prudant
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
upvoted
an
article
4 months ago
Article
¡Lanzamiento de la Comunidad Latinoamericana de NLP en Hugging Face! 🌟
By
•
•
7![](https://cdn-avatars.huggingface.co/v1/production/uploads/62d2de4a26213de379a2c33c/ow2Uh4Rvoz24rMoPMf2B_.png)
published
an
article
4 months ago
Article
¡Lanzamiento de la Comunidad Latinoamericana de NLP en Hugging Face! 🌟
By
•
•
7where is the model?
3
#1 opened 7 months ago
by
moyanxinxu
![](https://cdn-avatars.huggingface.co/v1/production/uploads/652f6e60da91a2e197ba5e5b/cSPJEGOPbzNSciu5gFrO2.jpeg)
Collaboration?
10
#10 opened 5 months ago
by
dnhkng
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62cbf98d48c278718d06d07c/hk1oW6SQpmWDdKjAvIrnW.png)
License
3
#4 opened 5 months ago
by
jameshuntercarter
License?
4
#1 opened 5 months ago
by
nbroad
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1639773384591-5f353bb37e58354338621655.jpeg)
Quote for commercial use
6
#71 opened 7 months ago
by
faisalahmedsifat