Chuanming Liu's picture

In a Training Loop 🔄

Chuanming Liu

Chuanming

·

Chuanming

AI & ML interests

Artificial Intelligence, AGI, NLP, LLMs, Multimodality, MLSys. Python/Golang/C/C++/Shell/awk&sed

Recent Activity

upvoted an article about 3 hours ago

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

liked a model about 8 hours ago

PaddlePaddle/PaddleOCR-VL-1.5

liked a model 3 days ago

moonshotai/Kimi-K2.5

View all activity

Organizations

upvoted an article about 3 hours ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

25 days ago

•

74

upvoted an article 14 days ago

Article

Open Responses: What you need to know

+2

15 days ago

•

101

upvoted an article about 1 month ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

Dec 18, 2025

•

119

upvoted a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 102

upvoted 2 articles 3 months ago

Article

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

Nov 3, 2022

•

347

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

301

upvoted 2 papers 4 months ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12, 2025 • 134

Finite Scalar Quantization Enables Redundant and Transmission-Robust Neural Audio Compression at Low Bit-rates

Paper • 2509.09550 • Published Sep 11, 2025 • 3

upvoted 2 collections 4 months ago

Qwen3Guard

7 items • Updated about 1 month ago • 62

Qwen3-Omni

6 items • Updated about 1 month ago • 181

upvoted an article 5 months ago

Article

Understanding Vector Quantization in VQ-VAE

Aug 28, 2024

•

52

upvoted a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 316

upvoted an article 5 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9, 2025

•

78

upvoted 2 collections 5 months ago

PP-StructureV3

PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON. • 17 items • Updated Sep 15, 2025 • 12

PP-OCRv5

PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15, 2025 • 52

upvoted a paper 5 months ago

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22, 2025 • 73

upvoted 3 collections 5 months ago

Marvis-TTS-250m-v0.1

5 items • Updated Aug 26, 2025 • 26

AFM-Datasets

Training datasets of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 6 items • Updated Aug 6, 2025 • 5

AFM-Models

The models and training dataset of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 12 items • Updated Aug 6, 2025 • 16

upvoted a paper 5 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129