-
openchat/openchat-3.5-1210
Text Generation ā¢ Updated ā¢ 1.27k ā¢ 273 -
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Paper ā¢ 2401.04081 ā¢ Published ā¢ 71 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper ā¢ 2402.03300 ā¢ Published ā¢ 107 -
Babelscape/rebel-large
Text2Text Generation ā¢ Updated ā¢ 30k ā¢ ā¢ 216
Collections
Discover the best community collections!
Collections including paper arxiv:2402.03300
-
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Paper ā¢ 2312.08578 ā¢ Published ā¢ 20 -
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks
Paper ā¢ 2312.08583 ā¢ Published ā¢ 12 -
Vision-Language Models as a Source of Rewards
Paper ā¢ 2312.09187 ā¢ Published ā¢ 14 -
StemGen: A music generation model that listens
Paper ā¢ 2312.08723 ā¢ Published ā¢ 48
-
Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
Paper ā¢ 2311.06720 ā¢ Published ā¢ 9 -
System 2 Attention (is something you might need too)
Paper ā¢ 2311.11829 ā¢ Published ā¢ 42 -
TinyGSM: achieving >80% on GSM8k with small language models
Paper ā¢ 2312.09241 ā¢ Published ā¢ 39 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ā¢ 2401.08967 ā¢ Published ā¢ 30
-
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning
Paper ā¢ 2308.12032 ā¢ Published ā¢ 1 -
Know thy corpus! Robust methods for digital curation of Web corpora
Paper ā¢ 2003.06389 ā¢ Published ā¢ 1 -
Self-Alignment with Instruction Backtranslation
Paper ā¢ 2308.06259 ā¢ Published ā¢ 42 -
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation
Paper ā¢ 2305.06156 ā¢ Published ā¢ 2
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 55 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ā¢ 2307.08691 ā¢ Published ā¢ 8 -
Mixtral of Experts
Paper ā¢ 2401.04088 ā¢ Published ā¢ 158 -
Mistral 7B
Paper ā¢ 2310.06825 ā¢ Published ā¢ 46
-
KwaiYiiMath: Technical Report
Paper ā¢ 2310.07488 ā¢ Published ā¢ 2 -
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
Paper ā¢ 2308.07758 ā¢ Published ā¢ 4 -
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Paper ā¢ 2309.10814 ā¢ Published ā¢ 3 -
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning
Paper ā¢ 2310.03731 ā¢ Published ā¢ 29
-
Moral Foundations of Large Language Models
Paper ā¢ 2310.15337 ā¢ Published ā¢ 1 -
Specific versus General Principles for Constitutional AI
Paper ā¢ 2310.13798 ā¢ Published ā¢ 3 -
Contrastive Prefence Learning: Learning from Human Feedback without RL
Paper ā¢ 2310.13639 ā¢ Published ā¢ 25 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper ā¢ 2309.00267 ā¢ Published ā¢ 48
-
Text-to-3D using Gaussian Splatting
Paper ā¢ 2309.16585 ā¢ Published ā¢ 30 -
FP8-LM: Training FP8 Large Language Models
Paper ā¢ 2310.18313 ā¢ Published ā¢ 33 -
Zephyr: Direct Distillation of LM Alignment
Paper ā¢ 2310.16944 ā¢ Published ā¢ 123 -
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Paper ā¢ 2312.06585 ā¢ Published ā¢ 29
-
Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Paper ā¢ 2309.03550 ā¢ Published ā¢ 12 -
Memory Augmented Language Models through Mixture of Word Experts
Paper ā¢ 2311.10768 ā¢ Published ā¢ 18 -
GAIA: a benchmark for General AI Assistants
Paper ā¢ 2311.12983 ā¢ Published ā¢ 192 -
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Paper ā¢ 2311.12631 ā¢ Published ā¢ 15