new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Apr 8

Submitted by

akhaliq

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

·
6 authors

Submitted by

akhaliq

Stream of Search (SoS): Learning to Search in Language

·
7 authors

Submitted by

akhaliq

No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

·
8 authors

Submitted by

akhaliq

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

·
4 authors

Submitted by

akhaliq

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

·
11 authors

Submitted by

akhaliq

Social Skill Training with Large Language Models

·
6 authors

Submitted by

akhaliq

RL for Consistency Models: Faster Reward Guided Text-to-Image Generation

·
5 authors

Submitted by

akhaliq

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

·
16 authors

Submitted by

akhaliq

Robust Gaussian Splatting

·
4 authors

Submitted by

akhaliq

Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

·
7 authors