Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2307.09288

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244

MagicCollection

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244

general descriptions of LLMs

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123
Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 46
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244
flax-community/gpt-2-spanish

Text Generation • Updated Apr 1, 2024 • 911 • 27

the newest info

mistralai/Mistral-7B-v0.1

Text Generation • Updated Jul 24, 2024 • 348k • • 3.65k
Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244
togethercomputer/RedPajama-Data-V2

Updated Nov 21, 2024 • 3.03k • 359
Running on CPU Upgrade

9.69k

9.69k

AI Comic Factory

👩

Create your own AI comic with a single prompt

LLM-Exploration-CTAKES

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244

Training & Architectures

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 55
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Paper • 2307.08691 • Published Jul 17, 2023 • 8
Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 158
Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 46

Llemma: An Open Language Model For Mathematics

Paper • 2310.10631 • Published Oct 16, 2023 • 53
Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 46
Qwen Technical Report

Paper • 2309.16609 • Published Sep 28, 2023 • 35
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model

Paper • 2309.11568 • Published Sep 20, 2023 • 10

Running on CPU Upgrade

12.8k

12.8k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots
open-web-math/open-web-math

Viewer • Updated Oct 17, 2023 • 6.32M • 11.9k • 304
mistralai/Mistral-7B-v0.1

Text Generation • Updated Jul 24, 2024 • 348k • • 3.65k
lmsys/lmsys-chat-1m

Viewer • Updated Jul 27, 2024 • 1M • 3.1k • 640

Research on LLM

When can transformers reason with abstract symbols?

Paper • 2310.09753 • Published Oct 15, 2023 • 4
In-Context Pretraining: Language Modeling Beyond Document Boundaries

Paper • 2310.10638 • Published Oct 16, 2023 • 30
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model

Paper • 2310.09520 • Published Oct 14, 2023 • 12
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 53

PaLI-3 Vision Language Models: Smaller, Faster, Stronger

Paper • 2310.09199 • Published Oct 13, 2023 • 27
A Zero-Shot Language Agent for Computer Control with Structured Reflection

Paper • 2310.08740 • Published Oct 12, 2023 • 16
Personality Traits in Large Language Models

Paper • 2307.00184 • Published Jul 1, 2023 • 20
An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 13

Previous
1
...
4
5
6
7
8
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs