3 21 5

Hao Zhang

zhisbug

haozhangml

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

upvoted a paper about 2 months ago

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

commented on a paper 2 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

View all activity

Organizations

upvoted a paper 17 days ago

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Paper • 2512.14681 • Published 19 days ago • 39

upvoted a paper about 2 months ago

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Paper • 2511.09057 • Published Nov 12, 2025 • 76

upvoted a paper 2 months ago

Efficient Long-context Language Model Training by Core Attention Disaggregation

Paper • 2510.18121 • Published Oct 20, 2025 • 122

upvoted a paper 3 months ago

Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs

Paper • 2510.11062 • Published Oct 13, 2025 • 28

upvoted a paper 5 months ago

Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing

Paper • 2508.09192 • Published Aug 8, 2025 • 30

upvoted a paper 6 months ago

Scaling Speculative Decoding with Lookahead Reasoning

Paper • 2506.19830 • Published Jun 24, 2025 • 12

upvoted 2 papers 8 months ago

lmgame-Bench: How Good are LLMs at Playing Games?

Paper • 2505.15146 • Published May 21, 2025 • 20

Faster Video Diffusion with Trainable Sparse Attention

Paper • 2505.13389 • Published May 19, 2025 • 37

upvoted a paper 9 months ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11, 2025 • 130

upvoted 3 papers 11 months ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14, 2025 • 55

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Paper • 2502.06155 • Published Feb 10, 2025 • 10

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6, 2025 • 51

upvoted 2 papers about 1 year ago

Specifications: The missing link to making the development of LLM systems an engineering discipline

Paper • 2412.05299 • Published Nov 25, 2024 • 1

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 36

upvoted 3 papers over 1 year ago

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 20

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2, 2024 • 64

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12, 2024 • 66

upvoted a paper almost 2 years ago

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Paper • 2403.04132 • Published Mar 7, 2024 • 39

upvoted 2 papers over 2 years ago

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Paper • 2309.11998 • Published Sep 21, 2023 • 25

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 26

Hao Zhang

AI & ML interests

Recent Activity

Organizations

zhisbug's activity