Speakers
Collection
A set of models dedicated to voice descriptions, with special focus on speaker traits (timbral and non-timbral characteristics, etc)
•
3 items
•
Updated
•
4
Hey
@nicccobb
,
I see that you feel the post and code are lacking.
Could you specify which parts seem incorrectly coded? I’m willing to revise and give more depth where needed. Also, I’ve already discovered one clear mistake: my calculations for the time span each CTC token covers (ms per token) were off.
I’d love to hear any other points you think should be fixed or expanded upon. Constructive feedback is always appreciated —I’m open to correcting and learning.