Submitted by akhaliq 2 X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages · 7 authors 7
Submitted by akhaliq 2 Locally Attentional SDF Diffusion for Controllable 3D Shape Generation · 6 authors
Submitted by akhaliq 2 COLA: How to adapt vision-language models to Compose Objects Localized with Attributes? · 6 authors 1
Submitted by akhaliq 1 A Variational Perspective on Solving Inverse Problems with Diffusion Models · 4 authors
Submitted by akhaliq 1 Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos · 5 authors
Submitted by akhaliq 1 A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding · 8 authors 4