Submitted by akhaliq 17 Rethinking FID: Towards a Better Evaluation Metric for Image Generation · 6 authors 2
Submitted by akhaliq 17 Improving fine-grained understanding in image-text pre-training · 11 authors 1
Submitted by akhaliq 14 SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild · 11 authors 1
Submitted by akhaliq 13 FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder · 5 authors 1
Submitted by akhaliq 9 CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects · 7 authors 1