Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
Hyperbolic
Novita
Nebius AI Studio
Fireworks
Replicate
Cerebras
SambaNova
fal
HF Inference API
Misc
Reset Misc
arxiv:
2408.15237
AutoTrain Compatible
Inference Endpoints
text-generation-inference
Misc with no match
Eval Results
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
21
Full-text search
Edit filters
Sort: Trending
Active filters:
2408.15237
Clear all
JunxiongWang/mamba_0_5_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
12
JunxiongWang/mamba_0_5_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
16
JunxiongWang/mamba_0_875_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
17
•
1
JunxiongWang/mamba_0_875_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
7
JunxiongWang/mamba_0_75_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
12
JunxiongWang/mamba_0_75_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
10
JunxiongWang/MambaInLlama_0_50
Updated
Sep 2, 2024
•
34
JunxiongWang/Mamba2InLlama_0_50
Updated
Sep 2, 2024
•
128
JunxiongWang/MambaInLlama_0_75
Updated
Sep 2, 2024
•
30
JunxiongWang/Mamba2InLlama_0_75
Updated
Sep 2, 2024
•
62
JunxiongWang/Mamba2InLlama_0_875
Updated
Sep 2, 2024
•
90
JunxiongWang/MambaInLlama_0_875
Updated
Sep 2, 2024
•
31
JunxiongWang/Mamba2InLlama_1
Updated
Sep 2, 2024
•
66
•
1
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
Nov 17, 2024
•
168
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
Nov 17, 2024
•
79
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
Nov 17, 2024
•
14
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
Nov 17, 2024
•
40
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
Nov 17, 2024
•
19
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
Nov 17, 2024
•
13
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
Nov 17, 2024
•
21
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
Nov 17, 2024
•
25