Collections
Discover the best community collections!
Collections including paper arxiv:2311.12983
-
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
Paper • 2311.12022 • Published • 31 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 192 -
gorilla-llm/APIBench
Updated • 156 • 66 -
Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models
Paper • 2312.04724 • Published • 20
-
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models
Paper • 2311.06783 • Published • 28 -
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Paper • 2311.07361 • Published • 14 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 192 -
teknium/openhermes
Viewer • Updated • 243k • 777 • 208
-
Efficient Streaming Language Models with Attention Sinks
Paper • 2309.17453 • Published • 13 -
Simple and Controllable Music Generation
Paper • 2306.05284 • Published • 149 -
FinGPT: Large Generative Models for a Small Language
Paper • 2311.05640 • Published • 32 -
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
Paper • 2305.07185 • Published • 9
-
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
Paper • 2211.05100 • Published • 29 -
CsFEVER and CTKFacts: Acquiring Czech data for fact verification
Paper • 2201.11115 • Published -
Training language models to follow instructions with human feedback
Paper • 2203.02155 • Published • 17 -
FinGPT: Large Generative Models for a Small Language
Paper • 2311.05640 • Published • 32