AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information? Paper • 2412.02611 • Published Dec 3, 2024 • 24
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control Paper • 2307.00117 • Published Jun 30, 2023 • 6