7 25 94

Zijian Zhou PRO

franciszzj

https://sites.google.com/view/zijian-zhou/home

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

VACE: All-in-One Video Creation and Editing

upvoted a paper 4 days ago

EgoLife: Towards Egocentric Life Assistant

upvoted a paper 14 days ago

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

View all activity

Organizations

None yet

franciszzj's activity

upvoted a paper about 14 hours ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published 3 days ago • 31

upvoted a paper 4 days ago

EgoLife: Towards Egocentric Life Assistant

Paper • 2503.03803 • Published 8 days ago • 34

upvoted a paper 14 days ago

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published 15 days ago • 58

upvoted 2 papers 21 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 21 days ago • 129

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 22 days ago • 163

upvoted a paper 2 months ago

Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation

Paper • 2501.04144 • Published Jan 7 • 19

upvoted a collection 2 months ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 208

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 352

upvoted a collection 3 months ago

AI Paper of the Day

Collection

A collection of papers that I think are interesting, one added each day • 313 items • Updated about 4 hours ago • 37

upvoted 2 papers 3 months ago

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Paper • 2412.08580 • Published Dec 11, 2024 • 45

Learning Flow Fields in Attention for Controllable Person Image Generation

Paper • 2412.08486 • Published Dec 11, 2024 • 34

upvoted 2 papers 5 months ago

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Paper • 2410.20280 • Published Oct 26, 2024 • 23

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 92

upvoted an article 6 months ago

Article

Breaking resolution curse of vision-language models

•

Feb 24, 2024

• 14

upvoted a collection 7 months ago

Playground v2

Collection

Collection of Playground v2 models • 4 items • Updated Dec 6, 2023 • 7

upvoted 2 papers 8 months ago

OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

Paper • 2407.11213 • Published Jul 15, 2024 • 3

OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person

Paper • 2407.16224 • Published Jul 23, 2024 • 28

upvoted 3 papers 9 months ago

SF-V: Single Forward Video Generation Model

Paper • 2406.04324 • Published Jun 6, 2024 • 25

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6, 2024 • 74

Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

Paper • 2403.06728 • Published Mar 11, 2024 • 2