new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Dec 24

Submitted by

luojunyu

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

·
6 authors

Submitted by

AndrewZeng

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

·
6 authors

Submitted by

PeterV09

Diving into Self-Evolving Training for Multimodal Reasoning

·
6 authors

Submitted by

fjxmlzn

Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching

·
4 authors

Submitted by

akhaliq

OpenAI o1 System Card

·
265 authors

Submitted by

luyangl

Deliberation in Latent Space via Differentiable Cache Augmentation

·
5 authors

Submitted by

jinheon

Revisiting In-Context Learning with Long Context Language Models

·
7 authors

Submitted by

Yingqing

Large Motion Video Autoencoding with Cross-modal Video VAE

·
7 authors

Submitted by

akhaliq

LearnLM: Improving Gemini for Learning

·
46 authors

Submitted by

akhaliq

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

·
4 authors

Submitted by

zhuohaoyu

Outcome-Refining Process Supervision for Code Generation

·
7 authors

Submitted by

lwaekfjlk

ResearchTown: Simulator of Human Research Community

·
8 authors

Submitted by

Vfrz

PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World

·
8 authors

Submitted by

nonstopfor

Agent-SafetyBench: Evaluating the Safety of LLM Agents

·
7 authors

Submitted by

sdzy

OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning

·
6 authors

Submitted by

ColorfulAI

Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding

·
6 authors

Submitted by

DonJoey

NILE: Internal Consistency Alignment in Large Language Models

·
10 authors