nvidia/parakeet-tdt-0.6b-v2 Automatic Speech Recognition β’ 0.6B β’ Updated Jun 26 β’ 647k β’ 1.27k
view reply It's not prompted. The source Audio had that emotional context and the model simply copied it.