EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Paper • 2502.09560 • Published about 22 hours ago • 17
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Paper • 2502.09560 • Published about 22 hours ago • 17 • 1
Teaching Language Models to Critique via Reinforcement Learning Paper • 2502.03492 • Published 10 days ago • 21
Ray2333/reward-model-Mistral-7B-instruct-Unified-Feedback Text Classification • Updated 9 days ago • 1.78k • 11