DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences Paper • 2406.03008 • Published Jun 5, 2024
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models Paper • 2407.07035 • Published Jul 9, 2024
Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors Paper • 2502.13311 • Published 23 days ago • 1
Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions Paper • 2406.09264 • Published Jun 13, 2024 • 1
Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue Paper • 2305.11271 • Published May 18, 2023
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation Paper • 2402.16846 • Published Feb 26, 2024
DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents Paper • 2210.12511 • Published Oct 22, 2022
World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models Paper • 2306.08685 • Published Jun 14, 2023 • 1
DANLI: Deliberative Agent for Following Natural Language Instructions Paper • 2210.12485 • Published Oct 22, 2022
Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models Paper • 2310.19619 • Published Oct 30, 2023
CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation Paper • 2310.13165 • Published Oct 19, 2023