Submitted by akhaliq 19 NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models · 7 authors
Submitted by akhaliq 17 I2VEdit: First-Frame-Guided Video Editing via Image-to-Video Diffusion Models · 5 authors
Submitted by akhaliq 16 Trans-LoRA: towards data-free Transferable Parameter Efficient Finetuning · 7 authors
Submitted by akhaliq 15 Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer · 5 authors
Submitted by akhaliq 15 Looking Backward: Streaming Video-to-Video Translation with Feature Banks · 6 authors 2
Submitted by akhaliq 12 Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels · 6 authors 3
Submitted by akhaliq 11 Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control · 7 authors
Submitted by akhaliq 11 LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters · 4 authors 2
Submitted by akhaliq 8 Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models · 24 authors