26 44 51

Steven Zheng

Steveeeeeeen

AI & ML interests

speech & audio

Recent Activity

updated a dataset 9 minutes ago

Steveeeeeeen/whisper-leaderboard-evals

upvoted an article about 12 hours ago

Open-R1: Update #1

upvoted an article about 12 hours ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Organizations

Steveeeeeeen's activity

upvoted 2 articles about 12 hours ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 295

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 803

upvoted an article 2 days ago

Article

Open R1: Update #3

and 9 others •

2 days ago

• 197

upvoted an article 4 days ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

and 3 others •

4 days ago

• 116

upvoted an article 7 days ago

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

7 days ago

• 31

upvoted an article 15 days ago

Article

SigLIP 2: A better multilingual vision language encoder

21 days ago

• 134

upvoted an article 16 days ago

Article

Deploying Speech-to-Speech on Hugging Face

Oct 22, 2024

• 38

upvoted 2 collections 16 days ago

OWLS: Scaling Laws for Speech Recognition and Translation

Collection

🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 7 items • Updated 4 days ago • 4

Open Whisper-style Speech Models (OWSM)

Collection

Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/ • 15 items • Updated Feb 6 • 5

upvoted a paper 17 days ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published 22 days ago • 66

upvoted a paper 22 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 22 days ago • 163

upvoted an article 22 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 154

upvoted a paper 22 days ago

Presumed Cultural Identity: How Names Shape LLM Responses

Paper • 2502.11995 • Published 24 days ago • 10

upvoted an article 23 days ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

24 days ago

• 93

upvoted a collection 26 days ago

Feb 14 Releases 💌

Collection

23 items • Updated 27 days ago • 7

upvoted 3 articles 29 days ago

Article

1 Billion Classifications

29 days ago

• 42

Article

Efficient Controllable Generation for SDXL with T2I-Adapters

Sep 8, 2023

• 7

Article

Introduction to the Open Leaderboard for Japanese LLMs

Nov 20, 2024

• 35

upvoted an article 30 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

and 1 other •

about 1 month ago

• 26

upvoted an article about 1 month ago

Article

Open R1: Update #2

and 6 others •

Feb 10

• 202