Submitted by akhaliq 33 OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models · 16 authors 2
Submitted by akhaliq 21 Scaling Relationship on Learning Mathematical Reasoning with Large Language Models · 6 authors
Submitted by akhaliq 18 MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies · 6 authors
Submitted by akhaliq 12 The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World · 14 authors
Submitted by akhaliq 12 HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions · 7 authors
Submitted by akhaliq 7 Ambient Adventures: Teaching ChatGPT on Developing Complex Stories · 5 authors
Submitted by akhaliq 3 TDMD: A Database for Dynamic Color Mesh Subjective and Objective Quality Explorations · 5 authors