ModernBERT Bringing BERT into modernity via both architecture changes and scaling answerdotai/ModernBERT-base Fill-Mask • 0.1B • Updated Jan 15 • 904k • • 887 answerdotai/ModernBERT-large Fill-Mask • 0.4B • Updated Jan 15 • 141k • • 407 Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 151
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 151
Multi-Vector Retrievers answerdotai/answerai-colbert-small-v1 0.0B • Updated Nov 18, 2024 • 4.12M • 153 answerdotai/JaColBERTv2.5 Sentence Similarity • 0.1B • Updated Jul 31, 2024 • 890 • 21
Japanese Retrieval answerdotai/JaColBERTv2.5 Sentence Similarity • 0.1B • Updated Jul 31, 2024 • 890 • 21 answerdotai/JaColBERTv2.4 Sentence Similarity • 0.1B • Updated Jul 31, 2024 • 15 • 3 answerdotai/MMARCO-japanese-32-scored-triplets Viewer • Updated Jul 31, 2024 • 8.64M • 203 • 6
CLA-Experiments answerdotai/llama3-8b-instruct-CLA-3 Text Generation • Updated Jul 18, 2024 • 18 • 1 answerdotai/llama3-8b-instruct-CLA-2 Text Generation • Updated Jul 18, 2024 • 9 • 1
Quantized-FT-Orca-Math Models trained during quantization aware fine-tuning experiments using PyTorch's FSDP. answerdotai/llama-7b-orca-math-10k-full Text Generation • Updated Mar 28, 2024 • 5 • 2 answerdotai/llama-7b-orca-math-10k-bnb-qlora Updated Mar 28, 2024 answerdotai/llama-7b-orca-math-10k-bnb-qdora Updated Mar 28, 2024 answerdotai/llama-7b-orca-math-10k-bnb-llama-pro Updated Mar 28, 2024 • 1
Function Calling A list of datasets, models and papers for making LLMs better at function calling and tool usage argilla/Synth-APIGen-v0.1 Viewer • Updated Oct 10, 2024 • 49.4k • 125 • 63 Salesforce/xlam-function-calling-60k Viewer • Updated Jan 24 • 60k • 4.02k • 474 sanjay920/gemma-function-calling Viewer • Updated Feb 21, 2024 • 112k • 22 • 8 gorilla-llm/Berkeley-Function-Calling-Leaderboard Preview • Updated Feb 14 • 1.7k • 79
ModernBERT Bringing BERT into modernity via both architecture changes and scaling answerdotai/ModernBERT-base Fill-Mask • 0.1B • Updated Jan 15 • 904k • • 887 answerdotai/ModernBERT-large Fill-Mask • 0.4B • Updated Jan 15 • 141k • • 407 Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 151
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 151
CLA-Experiments answerdotai/llama3-8b-instruct-CLA-3 Text Generation • Updated Jul 18, 2024 • 18 • 1 answerdotai/llama3-8b-instruct-CLA-2 Text Generation • Updated Jul 18, 2024 • 9 • 1
Multi-Vector Retrievers answerdotai/answerai-colbert-small-v1 0.0B • Updated Nov 18, 2024 • 4.12M • 153 answerdotai/JaColBERTv2.5 Sentence Similarity • 0.1B • Updated Jul 31, 2024 • 890 • 21
Quantized-FT-Orca-Math Models trained during quantization aware fine-tuning experiments using PyTorch's FSDP. answerdotai/llama-7b-orca-math-10k-full Text Generation • Updated Mar 28, 2024 • 5 • 2 answerdotai/llama-7b-orca-math-10k-bnb-qlora Updated Mar 28, 2024 answerdotai/llama-7b-orca-math-10k-bnb-qdora Updated Mar 28, 2024 answerdotai/llama-7b-orca-math-10k-bnb-llama-pro Updated Mar 28, 2024 • 1
Japanese Retrieval answerdotai/JaColBERTv2.5 Sentence Similarity • 0.1B • Updated Jul 31, 2024 • 890 • 21 answerdotai/JaColBERTv2.4 Sentence Similarity • 0.1B • Updated Jul 31, 2024 • 15 • 3 answerdotai/MMARCO-japanese-32-scored-triplets Viewer • Updated Jul 31, 2024 • 8.64M • 203 • 6
Function Calling A list of datasets, models and papers for making LLMs better at function calling and tool usage argilla/Synth-APIGen-v0.1 Viewer • Updated Oct 10, 2024 • 49.4k • 125 • 63 Salesforce/xlam-function-calling-60k Viewer • Updated Jan 24 • 60k • 4.02k • 474 sanjay920/gemma-function-calling Viewer • Updated Feb 21, 2024 • 112k • 22 • 8 gorilla-llm/Berkeley-Function-Calling-Leaderboard Preview • Updated Feb 14 • 1.7k • 79