Art Atk

ArtAtk

AI & ML interests

Multimodal Models

Recent Activity

upvoted a paper 1 day ago

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

upvoted a paper 1 day ago

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

upvoted a paper 4 days ago

Weak-to-Strong Diffusion with Reflection

View all activity

Organizations

None yet

ArtAtk's activity

upvoted 2 papers 1 day ago

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Paper • 2502.05179 • Published 4 days ago • 19

QuEST: Stable Training of LLMs with 1-Bit Weights and Activations

Paper • 2502.05003 • Published 5 days ago • 36

upvoted 3 papers 4 days ago

Weak-to-Strong Diffusion with Reflection

Paper • 2502.00473 • Published 11 days ago • 19

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published 5 days ago • 30

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published 6 days ago • 20

upvoted a paper 6 days ago

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Paper • 2502.02492 • Published 8 days ago • 49

upvoted a paper 8 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 9 days ago • 168

upvoted 2 papers 18 days ago

DiffuEraser: A Diffusion Model for Video Inpainting

Paper • 2501.10018 • Published 26 days ago • 13

Temporal Preference Optimization for Long-Form Video Understanding

Paper • 2501.13919 • Published 19 days ago • 21

upvoted a paper 19 days ago

Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step

Paper • 2501.13926 • Published 19 days ago • 34

upvoted 2 papers 20 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 21 days ago • 91

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 21 days ago • 315

upvoted 3 papers 21 days ago

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Paper • 2501.12224 • Published 22 days ago • 46

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published 22 days ago • 33

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 23 days ago • 90

upvoted a paper 22 days ago

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published 28 days ago • 61

upvoted an article 22 days ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

22 days ago

• 60

upvoted a paper 22 days ago

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published 26 days ago • 43

upvoted a paper 23 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 26 days ago • 105

upvoted a paper 25 days ago

Do generative video models learn physical principles from watching videos?

Paper • 2501.09038 • Published 28 days ago • 32