6 88 52

Rui Zhao

ruizhaocv

https://ruizhaocv.github.io/

AI & ML interests

Multimodal and GenAI

Recent Activity

upvoted a paper about 2 hours ago

History-Guided Video Diffusion

upvoted a paper about 2 hours ago

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

upvoted a paper about 2 hours ago

LM2: Large Memory Models

View all activity

Organizations

ruizhaocv's activity

upvoted 3 papers about 2 hours ago

History-Guided Video Diffusion

Paper • 2502.06764 • Published 1 day ago • 10

CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers

Paper • 2502.06527 • Published 1 day ago • 6

LM2: Large Memory Models

Paper • 2502.06049 • Published 2 days ago • 16

upvoted a paper 2 days ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published 5 days ago • 61

upvoted 2 papers 5 days ago

DynVFX: Augmenting Real Videos with Dynamic Content

Paper • 2502.03621 • Published 6 days ago • 27

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

Paper • 2502.04299 • Published 5 days ago • 14

liked a Space 6 days ago

202

DeepSeek R1 Chat Assistant Web Search

📚

DeepSeek R1 Chat Assistant Web Search

upvoted 2 papers 6 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 11 days ago • 99

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 6 days ago • 49

liked a Space 6 days ago

296

Chat with DeepSeek-VL2-small

🌍

Generate text based on images and prompts

upvoted 2 papers 7 days ago

MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation

Paper • 2502.01572 • Published 8 days ago • 20

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models

Paper • 2502.01639 • Published 8 days ago • 24

upvoted a paper 8 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 9 days ago • 168

upvoted a paper 9 days ago

PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding

Paper • 2501.16411 • Published 15 days ago • 17

upvoted a paper 15 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 19 days ago • 48

liked a Space 15 days ago

1.68k

Chat With Janus-Pro-7B

🌍

A unified multimodal understanding and generation model.

upvoted a paper 18 days ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published 19 days ago • 34

upvoted 3 papers 21 days ago

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published 24 days ago • 23

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Paper • 2501.11733 • Published 22 days ago • 27

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published 28 days ago • 61