Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models Paper โข 2402.14207 โข Published Feb 22, 2024 โข 8
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper โข 2501.17161 โข Published 22 days ago โข 106
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 โข 3 items โข Updated 23 days ago โข 350
Deepseek Papers Collection Deepseek papers collection โข 18 items โข Updated about 22 hours ago โข 126
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper โข 2501.06186 โข Published Jan 10 โข 61
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models โข 11 items โข Updated Dec 6, 2024 โข 649