Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.10176

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Paper • 2402.06332 • Published Feb 9, 2024 • 19
Augmenting Math Word Problems via Iterative Question Composing

Paper • 2401.09003 • Published Jan 17, 2024 • 2
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible Pipeline

Paper • 2401.08190 • Published Jan 16, 2024
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent

Paper • 2312.08926 • Published Dec 14, 2023 • 8

AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts

Paper • 2402.07625 • Published Feb 12, 2024 • 14
Rethinking Data Selection for Supervised Fine-Tuning

Paper • 2402.06094 • Published Feb 8, 2024 • 1
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 48
TnT-LLM: Text Mining at Scale with Large Language Models

Paper • 2403.12173 • Published Mar 18, 2024 • 20

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 146
ReFT: Reasoning with Reinforced Fine-Tuning

Paper • 2401.08967 • Published Jan 17, 2024 • 30
Tuning Language Models by Proxy

Paper • 2401.08565 • Published Jan 16, 2024 • 22
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10, 2024 • 69

abacusai/MetaMathFewshot

Viewer • Updated Jan 17, 2024 • 395k • 135 • 26
math-ai/StackMathQA

Viewer • Updated Sep 17, 2024 • 6.2M • 745 • 86
meta-math/MetaMathQA

Viewer • Updated Dec 21, 2023 • 395k • 7.05k • 353
argilla/distilabel-math-preference-dpo

Viewer • Updated Jul 16, 2024 • 2.42k • 131 • 80

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Paper • 2312.06585 • Published Dec 11, 2023 • 29
TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 38
SciPhi/AgentSearch-V1

Viewer • Updated Jan 14, 2024 • 70k • 4.34k • 86
Data Filtering Networks

Paper • 2309.17425 • Published Sep 29, 2023 • 6

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions

Paper • 2312.08578 • Published Dec 14, 2023 • 17
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks

Paper • 2312.08583 • Published Dec 14, 2023 • 9
Vision-Language Models as a Source of Rewards

Paper • 2312.09187 • Published Dec 14, 2023 • 12
StemGen: A music generation model that listens

Paper • 2312.08723 • Published Dec 14, 2023 • 48

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with.

bjoernp/ultrachat_de

Viewer • Updated Dec 2, 2023 • 959 • 108 • 8
openchat/openchat_sharegpt4_dataset

Updated Jul 1, 2023 • 523 • 166
imone/OpenOrca_FLAN

Viewer • Updated Dec 8, 2023 • 1.58M • 80 • 15
tiedong/goat

Viewer • Updated May 25, 2023 • 1.75M • 84 • 34

Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models

Paper • 2311.06783 • Published Nov 12, 2023 • 27
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Paper • 2311.07574 • Published Nov 13, 2023 • 15
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding

Paper • 2401.04575 • Published Jan 9, 2024 • 15
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 62

Research Papers

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15, 2024 • 36
Beyond Language Models: Byte Models are Digital World Simulators

Paper • 2402.19155 • Published Feb 29, 2024 • 50
Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 11

KwaiYiiMath: Technical Report

Paper • 2310.07488 • Published Oct 11, 2023 • 2
Forward-Backward Reasoning in Large Language Models for Mathematical Verification

Paper • 2308.07758 • Published Aug 15, 2023 • 4
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Paper • 2309.10814 • Published Sep 19, 2023 • 3
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning

Paper • 2310.03731 • Published Oct 5, 2023 • 29

Previous
1
2
3
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs