Collections
Discover the best community collections!
Collections including paper arxiv:2307.09288
-
Zephyr: Direct Distillation of LM Alignment
Paper β’ 2310.16944 β’ Published β’ 123 -
Mistral 7B
Paper β’ 2310.06825 β’ Published β’ 46 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 244 -
flax-community/gpt-2-spanish
Text Generation β’ Updated β’ 911 β’ 27
-
mistralai/Mistral-7B-v0.1
Text Generation β’ Updated β’ 348k β’ β’ 3.65k -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 244 -
togethercomputer/RedPajama-Data-V2
Updated β’ 3.03k β’ 359 -
9.69k
AI Comic Factory
π©Create your own AI comic with a single prompt
-
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 55 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper β’ 2307.08691 β’ Published β’ 8 -
Mixtral of Experts
Paper β’ 2401.04088 β’ Published β’ 158 -
Mistral 7B
Paper β’ 2310.06825 β’ Published β’ 46
-
Llemma: An Open Language Model For Mathematics
Paper β’ 2310.10631 β’ Published β’ 53 -
Mistral 7B
Paper β’ 2310.06825 β’ Published β’ 46 -
Qwen Technical Report
Paper β’ 2309.16609 β’ Published β’ 35 -
BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model
Paper β’ 2309.11568 β’ Published β’ 10
-
When can transformers reason with abstract symbols?
Paper β’ 2310.09753 β’ Published β’ 4 -
In-Context Pretraining: Language Modeling Beyond Document Boundaries
Paper β’ 2310.10638 β’ Published β’ 30 -
Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
Paper β’ 2310.09520 β’ Published β’ 12 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper β’ 2309.08532 β’ Published β’ 53
-
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper β’ 2310.09199 β’ Published β’ 27 -
A Zero-Shot Language Agent for Computer Control with Structured Reflection
Paper β’ 2310.08740 β’ Published β’ 16 -
Personality Traits in Large Language Models
Paper β’ 2307.00184 β’ Published β’ 20 -
An Emulator for Fine-Tuning Large Language Models using Small Language Models
Paper β’ 2310.12962 β’ Published β’ 13