Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 5 days ago • 29
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 7 days ago • 89
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 5 items • Updated 8 days ago • 33
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 11 days ago • 95
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 15 days ago • 24
view article Article Janus Pro: DeepSeek's Revolutionary Multimodal AI Model By LLMhacker • 18 days ago • 31
Albertina Collection Albertina family of encoders for Portuguese • 9 items • Updated Jul 26, 2024 • 2
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 22 days ago • 62
🍃 MINT-1T Collection Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 58
GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated 24 days ago • 22
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 4 days ago • 59
Sparsh Collection Models and datasets for Sparsh: Self-supervised touch representations for vision-based tactile sensing • 15 items • Updated Oct 24, 2024 • 12