Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Anuj Diwan's picture
4 1 7

Anuj Diwan

ajd12342
litagin's profile picture
·
https://ajd12342.github.io/
  • anuj_diwan
  • ajd12342

AI & ML interests

None yet

Organizations

University of Texas at Austin's profile picture

authored a paper 8 months ago

Rhapsody: A Dataset for Highlight Detection in Podcasts

Paper • 2505.19429 • Published May 26, 2025 • 1
authored a paper 10 months ago

Scaling Rich Style-Prompted Text-to-Speech Datasets

Paper • 2503.04713 • Published Mar 6, 2025 • 1
authored 6 papers about 1 year ago

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 3

Textless Speech-to-Speech Translation With Limited Parallel Data

Paper • 2305.15405 • Published May 24, 2023

Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality

Paper • 2211.00768 • Published Nov 1, 2022

Continual Learning for On-Device Speech Recognition using Disentangled Conformers

Paper • 2212.01393 • Published Dec 2, 2022 • 1

Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages

Paper • 2010.09322 • Published Oct 19, 2020

Multilingual and code-switching ASR challenges for low resource Indian languages

Paper • 2104.00235 • Published Apr 1, 2021
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs