Submitted by akhaliq 84 SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis · 8 authors 6
Submitted by akhaliq 34 DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models · 5 authors 5
Submitted by akhaliq 22 Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning · 4 authors 1
Submitted by akhaliq 15 mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding · 13 authors 1
Submitted by akhaliq 12 What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? · 8 authors
Submitted by akhaliq 11 Robots That Ask For Help: Uncertainty Alignment for Large Language Model Planners · 14 authors 1
Submitted by akhaliq 11 Building Cooperative Embodied Agents Modularly with Large Language Models · 8 authors
Submitted by akhaliq 8 DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation · 7 authors
Submitted by akhaliq 7 Open-Source Large Language Models Outperform Crowd Workers and Approach ChatGPT in Text-Annotation Tasks · 7 authors 2