Tianheng Cheng's picture

Tianheng Cheng

wondervictor

·

https://github.com/wondervictor

AI & ML interests

Computer vision, visual perception, multimodal models

Recent Activity

published a model 9 days ago

wondervictor/YOLO-World-V2.1

authored a paper 22 days ago

Knowledge Mining with Scene Text for Fine-Grained Recognition

authored a paper 22 days ago

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding

View all activity

Organizations

wondervictor's activity

published a model 9 days ago

wondervictor/YOLO-World-V2.1

Updated Jan 25 • 2

authored 3 papers 22 days ago

Knowledge Mining with Scene Text for Fine-Grained Recognition

Paper • 2203.14215 • Published Mar 27, 2022

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding

Paper • 2412.13193 • Published Dec 17, 2024

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published 23 days ago • 36

upvoted a paper 23 days ago

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published 23 days ago • 36

updated a model about 2 months ago

wondervictor/YOLO-World-V2.1

Updated Jan 25 • 2

upvoted an article 2 months ago

Article

Process Reinforcement through Implicit Rewards

By

and 1 other •

Jan 3

• 25

upvoted a paper 2 months ago

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

Paper • 2501.01423 • Published Jan 2 • 37

liked a Space 3 months ago

LuminaBrush

Execute commands from environment

updated a Space 3 months ago

ControlAR-XL

Controllable Autoregressive Image Generation

liked 2 Spaces 3 months ago

IC Light V2-Vary

Execute commands based on environment variables

ControlAR-XL

Controllable Autoregressive Image Generation

New activity in wondervictor/ControlAR 3 months ago

Frequently GPU Task Aborted

#1 opened 3 months ago by

updated a model 3 months ago

wondervictor/ControlAR

Updated Dec 11, 2024 • 2

updated a Space 3 months ago

ControlAR-XL

Controllable Autoregressive Image Generation

updated a model 3 months ago

wondervictor/ControlAR

Updated Dec 11, 2024 • 2

updated 3 Spaces 3 months ago

Mask-Adapter-SAM2

Segment objects in images using text and points

EVF-SAM-2

Segment objects in images and videos using text prompts

Mask-Adapter-SAM2

Segment objects in images using text and points

liked a Space 3 months ago

Mask-Adapter-SAM2

Segment objects in images using text and points