Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2405.18952

SyntheticDataPrep

Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29, 2024 • 10

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 393 • 76
argilla/intel-orca-dpo-pairs-helm-instruct

Viewer • Updated Feb 29, 2024 • 5 • 77 • 1
argilla/OpenHermes2.5-dpo-binarized-alpha

Viewer • Updated Feb 10, 2024 • 9.79k • 144 • 64
argilla/ultrafeedback-critique

Viewer • Updated Dec 15, 2023 • 253k • 71 • 4

Data generation

about 15 hours ago

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 48
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows

Paper • 2402.10379 • Published Feb 16, 2024 • 31
Automatic Data Curation for Self-Supervised Learning: A Clustering-Based Approach

Paper • 2405.15613 • Published May 24, 2024 • 15
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29, 2024 • 10

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 88
NEFTune: Noisy Embeddings Improve Instruction Finetuning

Paper • 2310.05914 • Published Oct 9, 2023 • 14
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 57
Soaring from 4K to 400K: Extending LLM's Context with Activation Beacon

Paper • 2401.03462 • Published Jan 7, 2024 • 27

Using Captum to Explain Generative Language Models

Paper • 2312.05491 • Published Dec 9, 2023 • 4
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets

Paper • 2405.18952 • Published May 29, 2024 • 10

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs