Anuj Diwan's picture

4 1 7

Anuj Diwan

ajd12342

·

https://ajd12342.github.io/

AI & ML interests

None yet

Organizations

authored a paper 8 months ago

Rhapsody: A Dataset for Highlight Detection in Podcasts

Paper • 2505.19429 • Published May 26, 2025 • 1

authored a paper 10 months ago

Scaling Rich Style-Prompted Text-to-Speech Datasets

Paper • 2503.04713 • Published Mar 6, 2025 • 1

authored 6 papers about 1 year ago

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 3

Textless Speech-to-Speech Translation With Limited Parallel Data

Paper • 2305.15405 • Published May 24, 2023

Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality

Paper • 2211.00768 • Published Nov 1, 2022

Continual Learning for On-Device Speech Recognition using Disentangled Conformers

Paper • 2212.01393 • Published Dec 2, 2022 • 1

Reduce and Reconstruct: ASR for Low-Resource Phonetic Languages

Paper • 2010.09322 • Published Oct 19, 2020

Multilingual and code-switching ASR challenges for low resource Indian languages

Paper • 2104.00235 • Published Apr 1, 2021